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NEISSERIA MENINGITIDIS ANTIGENS AND COMPOSITIONS 



reference in its entirety: 60/083,758 (filed May 1, 1998); 60/094,869 (filed July 31, 1998); 
60/098,994 (filed September 2, 1998); 60/099,062 (filed September 2, 1998); 60/103,749 
(filed October 9, 1998); 60/103,794 (filed October 9, 1998); 60/103,796 (filed October 9, 
1998); and 60/121,528 (filed February 25. 1999). 

This invention relates to antigens from the bacterial species: Neisseria meningitidis 
and Neisseria gonorrhoeae. 

BACKGROUND 

Neisseria meningitidis is a non-motile, gram negative diplococcus human pathogen. It 
colonizes the pharynx, causing meningitis and, occasionally, septicaemia in the absence of 
meningitis. It is closely related to N gonorrhoea, although one feature that clearly 
differentiates meningococcus from gonococcus is the presence of a polysaccharide capsule 
that is present in all pathogenic meningococci. 

N. meningitidis causes both endemic and epidemic disease. In the United States the 
attack rate is 0.6-1 per 100,000 persons per year, and it can be much greater during outbreaks, 
(see Lieberman et ai (1996) Safety and Immunogenic ty of a Serogroups A/C Neisseria 
meningitidis Oligosaccharide-Protein Conjugate Vaccine in Young Children. JAMA 
275(19):1499-1503; Schuchat et al (1997) Bacterial Meningitis in the United States in 1995. 
N Engl J Med 337(14):970-976). In developing countries, endemic disease rates are much 
higher and during epidemics incidence rates can reach 500 cases per 100,000 persons per 
year. Mortality is extremely high, at 10-20% in the United States, and much higher in 
developing countries. Following the introduction of the conjugate vaccine against 
Haemophilus influenzae, N. meningitidis is the major cause of bacterial meningitis at all ages 
in the United States (Schuchat et al (1997) supra). 

Based on the organism's capsular polysaccharide, 12 serogroups of N meningitidis 
have been identified. Group A is the pathogen most often implicated in epidemic disease in 
sub-Saharan Africa. Serogroups B and C are responsible for the vast majority of cases in the 
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United States and in most developed countries. Serogroups W135 and Y are responsible for 
the rest of the cases in the United States and developed countries. The meningococcal vaccine 
currently in use is a tetravalent polysaccharide vaccine composed of serogroups A, C Y and 
W135. Although efficacious in adolescents and adults, it induces a poor immune response and 
short duration of protection, and cannot be used in infants [eg. Morbidity and Mortality 
weekly report, Vol.46, No. RR-5 (1997)], This is because polysaccharides are T-cell 
independent antigens that induce a weak immune response that cannot be boosted by repeated 
immunization. Following the success of the vaccination against H. influenzae, conjugate 
vaccines against serogroups A and C have been developed and are at the final stage of clinical 
testing (Zollinger WD "New and Improved Vaccines Against Meningococcal Disease". In: 
New Generation Vaccines, supra, pp. 469-488; Lieberman et al (1996) supra; Costantino et al 
(1992) Development and phase I clinical testing of a conjugate vaccine against 
meningococcus A and C. Vaccine 10:691-698). 

Meningococcus B (meriB) remains a problem, however. This serotype currently is 
responsible for approximately 50% of total meningitis in the United States, Europe, and South 
America. The polysaccharide approach cannot be used because the menB capsular 
polysaccharide is a polymer of a(2-8)-linked N-acetyl neuraminic acid that is also present in 
mammalian tissue. This results in tolerance to the antigen; indeed, if an immune response 
were elicited, it would be anti-self, and therefore undesirable. In order to avoid induction of 
autoimmunity and to induce a protective immune response, the capsular polysaccharide has, 
for instance, been chemically modified substituting the jV-acetyl groups with V-propionyl 
groups, leaving the specific antigenicity unaltered (Romero & Outschoorn (1994) Current 
status of Meningococcal group B vaccine candidates: capsular or non-capsular? Clin 
Microbiol Rev 7(4):559-575). 

Alternative approaches to menB vaccines have used complex mixtures of outer 
membrane proteins (OMPs), containing either the OMPs alone, or OMPs enriched in porins, 
or deleted of the class 4 OMPs that are believed to induce antibodies that block bactericidal 
activity. This approach produces vaccines that are not well characterized. They are able to 
protect against the homologous strain, but are not effective at large where there are many 
antigenic variants of the outer membrane proteins. To overcome the antigenic variability, 
multivalent vaccines containing up to nine different porins have been constructed (eg. 
Poolman JT (1992) Development of a meningococcal vaccine. Infect. Agents Dis. 4:13-28). 
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Additional proteins to be used in outer membrane vaccines have been the opa and opc 
proteins, but none of these approaches have been able to overcome the antigenic variability 
(eg. Ala'Aldeen & Borriello (1996) The meningococcal transferrin-binding proteins 1 and 2 
are both surface exposed and generate bactericidal antibodies capable of killing homologous 
and heterologous strains. Vaccine 14(1 ):49-53). 

A certain amount of sequence data is available for meningococcal and gonoccocal 
genes and proteins {e.g. EP-A-0467714, W096/29412). but this is by no means complete. 
Other men B proteins may include those listed in WO 97/28273, WO 96/29412, WO 
95/03413, US 5,439,808, and US 5,879,686. 

The provision of further sequences could provide an opportunity to identify secreted 
or surface-exposed proteins that are presumed targets for the immune system and which are 
not antigenically variable. For instance, some of the identified proteins could be components 
of efficacious vaccines against meningococcus B, some could be components of vaccines 
against all meningococcal serotypes, and others could be components of vaccines against all 
pathogenic Neisseriae including Neisseria meningitidis or Neisseria gonorrhoeae. Those 
sequences specific to N. meningitidis or N. gonorrhoeae that are more highly conserved are 
further preferred sequences. 

It is thus an object of the invention is to provide Neisseria! DNA sequences which 
encode proteins that are antigenic or immunogenic. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1 illustrates the products of protein expression and purification of the predicted 
ORF 919 as cloned and expressed in E. coli. 

Fig. 2 illustrates the products of protein expression and purification of the predicted 
ORF 279 as cloned and expressed in E. coli. 

Fig. 3 illustrates the products of protein expression and purification of the predicted 
ORF 576-1 as cloned and expressed in E. coli. 

Fig. 4 illustrates the products of protein expression and purification of the predicted 
ORF 519-1 as cloned and expressed in E. coli. 

Fig. 5 illustrates the products of protein expression and purification of the predicted 
ORF 121-1 as cloned and expressed in E. coli. 
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Fig. 6 illustrates the products of protein expression and purification of the predicted 
ORF 128-1 as cloned and expressed in£. coli. 

Fig. 7 illustrates the products of protein expression and purification of the predicted 
ORF 206 as cloned and expressed in E. coli. 

Fig. 8 illustrates the products of protein expression and purification of the predicted 
ORF 287 as cloned and expressed in E. coli. 

Fig. 9 illustrates the products of protein expression and purification of the predicted 
ORF 406 as cloned and expressed in E. coli. 

Fig. 10 illustrates the hydrophilicity plot, antigenic index and AMPHI regions of the 
products of protein expression the predicted ORF 919 as cloned and expressed in E. coli. 

Fig. 1 1 illustrates the hydrophilicity plot, antigenic index and AMPHI regions of the 
products of protein expression the predicted ORF 279 as cloned and expressed in E. coli. 

Fig. 12 illustrates the hydrophilicity plot, antigenic index and AMPHI regions of the 
products of protein expression the predicted ORF 576-1 as cloned and expressed in E. coli. 

Fig. 13 illustrates the hydrophilicity plot, antigenic index and AMPHI regions of the 
products of protein expression the predicted ORF 519-1 as cloned and expressed in E. coli. 

Fig. 14 illustrates the hydrophilicity plot, antigenic index and AMPHI regions of the 
products of protein expression the predicted ORF 121-1 as cloned and expressed in E. coli. 

Fig. 15 illustrates the hydrophilicity plot, antigenic index and AMPHI regions of the 
products of protein expression the predicted ORF 128-1 as cloned and expressed in E. coli. 

Fig. 16 illustrates the hydrophilicity plot, antigenic index and AMPHI regions of the 
products of protein expression the predicted ORF 206 as cloned and expressed in E. coli. 

Fig. 17 illustrates the hydrophilicity plot, antigenic index and AMPHI regions of the 
products of protein expression the predicted ORF 287 as cloned and expressed in E. coli. 

Fig. 18 illustrates the hydrophilicity plot antigenic index and AMPHI regions of the 
products of protein expression the predicted ORF 406 as cloned and expressed in E. coli. 

Fig. 19 shows an alignment comparison of amino acid sequences for ORF 225 for 
several strains of Neisseria. Dark shading indicates regions of homology, and gray shading 
indicates the conservation of amino acids with similar characteristics. The Figure 
demonstrates a high degree of conservation amo.ig the various strains, further confirming its 
utility as an antigen for both vaccines and diagnostics. 
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Fig. 20 shows an alignment comparison of amino acid sequences for ORF 235 for 
several strains of Neisseria. Dark shading indicates regions of homology, and gray shading 
indicates the conservation of ammo acids with similar characteristics. The Figure 
demonstrates a high degree of conservation among the various strains, further confirming its 
utility as an antigen for both vaccines and diagnostics. 

Fig. 21 shows an alignment comparison of amino acid sequences for ORF 287 for 
several strains of Neisseria. Dark shading indicates regions of homology, and gray shading 
indicates the conservation of amino acids with similar characteristics. The Figure 
demonstrates a high degree of conservation among the various strains, further confirming its 
utility as an antigen for both vaccines and diagnostics. 

Fig. 22 shows an alignment comparison of amino acid sequences for ORF 519 for 
several strains of Neisseria. Dark shading indicates regions of homology, and gray shading 
indicates the conservation of amino acids with similar characteristics. The Figure 
demonstrates a high degree of conservation among the various strains, further confirming its 
utility as an antigen for both vaccines and diagnostics. 

Fig. 23 shows an alignment comparison of amino acid sequences for ORP 919 L; 
several strains of Neisseria. Dark shading indicates regions of homology, and gray shading 
indicates the conservation of amino acids with similar characteristics. The Figure 
demonstrates a high degree of conservation among the various strains, further confirming its 
utility as an antigen for both vaccines and diagnostics. 

THE INVENTION 

The invention provides proteins comprising the N. meningitidis amino acid sequences 
and N. gonorrhoeae amino acid sequences disclosed in the examples. 

It also provides proteins comprising sequences homologous (i.e., those having 
sequence identity) to the N. meningitidis amino acid sequences disclosed in the examples. 
Depending on the particular sequence, the degree of homology (sequence identity) is 
preferably greater than 50% (eg. 60%, 70%, 80%, 90%, 95%, 99% or more). These proteins 
include mutants and allelic variants of the sequences disclosed in the examples. Typically, 
50% identity or more between two proteins is considered to be an indication of functional 
equivalence. Identity between proteins is preferably determined by the Smith-Waterman 



WO 99/57280 



6 



PCT/US99/09346 



homology search algorithm as implemented in MPSRCH program (Oxford Molecular) using 
an affine gap search with parameters:gap penalty 12, gap extension penalty 1. 

The invention further provides proteins comprising fragments of the N. meningitidis 
amino acid sequences and N. gonorrhoeae amino acid sequences disclosed in the examples. 
The fragments should comprise at least n consecutive amino acids from the sequences and, 
depending on the particular sequence, n is 7 or more (eg. S, 1 0, 1 2, 1 4, 1 6, 1 8, 20 or more). 
Preferably the fragments compnse an epitope from the sequence. 

The proteins of the invention can, of course, be prepared by various means (eg. 
recombinant expression, purification from cell culture, chemical synthesis etc.) and in various 
forms (eg. native, fusions etc.). They are preferably prepared in substantially pure or isolated 
form (ie. substantially free from other N. meningitidis or N. gonorrhoeae host cell proteins) 

According to a further aspect, the invention provides antibodies which bind to these 
proteins. These may be polyclonal or monoclonal and may be produced by any suitable 
means. 

According to a further aspect, the invention provides nucleic acid comprising the 
N. meningitidis nucleotide sequences and N. gonorrhoeae nucleotide sequences disclosed in 
the examples. 

According to a further aspect, the invention comprises nucleic acids having sequence 
identity of greater than 50% (e.g., 60%, 70%, 80%, 90%, 95%, 99% or more) to the nucleic 
acid sequences herein. Sequence identity is determined as above-discussed. 

According to a further aspect, the invention comprises nucleic acid that hybridizes to 
the sequences provided herein. Conditions for hybridization are set forth herein. 

Nucleic acid comprising fragments of these sequences are also provided. These should 
comprise at least n consecutive nucleotides from the N. meningitidis sequences or N. 
gonorrhoeae sequences and depending on the particular sequence, n is 10 or more (eg 12, 14, 
15, 18, 20, 25, 30, 35, 40 or more). 

According to a further aspect, the invention provides nucleic acid encoding the 
proteins and protein fragments of the invention. 

It should also be appreciated that the invention provides nucleic acid comprising 
sequences complementary to those described above (eg. for antisense or probing purposes). 

Nucleic acid according to the invention can, of course, be prepared in many ways (eg. 
bv chemical synthesis, in part or in whole, from genomic or cDNA libraries, from the 
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organism itself etc.) and can take various forms (eg. single stranded, double stranded, vectors, 
probes etc.). 

In addition, the term "nucleic acid 1 ' includes DNA and RNA, and also their analogues, 
such as those containing modified backbones, and also protein nucleic acids (PNA) etc. 

According to a further aspect, the invention provides vectors comprising nucleotide 
sequences of the invention (eg. expression vectors) and host cells transformed with such 
vectors. 

According to a further aspect, the invention provides compositions comprising protein, 
antibody, and/or nucleic acid according to the invention. These compositions may be suitable 
as vaccines, for instance, or as diagnostic reagents or as immunogenic compositions. 

The invention also provides nucleic acid, protein, or antibody according to the 
invention for use as medicaments (eg. as vaccines) or as diagnostic reagents. It also provides 
the use of nucleic acid, protein, or antibody according to the invention in the manufacture of 
(I) a medicament for treating or preventing infection due to Neisserial bacteria (ii) a 
diagnostic reagent for detecting the presence of Neisserial bacteria or of antibodies raised 
against Neisserial bacteria or (iii) for raising antibodies. Said Neisserial bacteria may be any 
species or strain (such as N. gonorrhoeae) but are preferably N. meningitidis, especially strain 
B or strain C. 

The invention also provides a method of treating a patient, compnsing administering 
to the patient a therapeutically effective amount of nucleic acid, protein, and/or antibody 
according to the invention. 

According to further aspects, the invention provides various processes. 

A process for producing proteins of the invention is provided, comprising the step of 
culturing a host cell according to the invention under conditions which induce protein 
expression. 

A process for detecting polynucleotides of the invention is provided, comprising the 
steps of: (a) contacting a nucleic probe according to the invention with a biological sample 
under hybridizing conditions to form duplexes; and (b) detecting said duplexes. 

A process for detecting proteins of the invention is provided, comprising the steps of: 
(a) contacting an antibody according to the invention with a biological sample under 
conditions suitable for the formation of an antibody-antigen complexes; and (b) detecting 
said complexes. 
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Having now generally described the inventiion, the same will be more readily 
understood through reference to the following examples which are provided by way of 
illustration, and are not intended to be limiting of the present invention, unless specified. 

Methodology - Summary of standard procedures and techniques. 
General 

This invention provides Neisseria meningitidis menB nucleotide sequences, amino 
acid sequences encoded therein. With these disclosed sequences, nucleic acid probe assays 
and expression cassettes and vectors can be produced. The expression vectors can be 
transformed into host cells to produce proteins. The purified or isolated polypeptides (which 
may also be chemically synthesized) can be used to produce antibodies to detect menB 
proteins. Also, the host cells or extracts can be utilized for biological assays to isolate 
agonists or antagonists. In addition, with these sequences one can search to identify open 
reading frames and identify amino acid sequences. The proteins may also be used in 
immunogenic compositions, antigenic compositions and as vaccine components. 

The practice of the present invention will employ, unless otherwise indicated, 
conventional techniques of molecular biology, microbiology, recombinant DNA, and 
immunology, which are within the skill of the art. Such techniques are explained fully in the 
literature e.g., Sambrook Molecular Cloning; A Laboratoiy Manual Second Edition (1989); 
DNA Cloning, Volumes I and ii (D.N Glover ed. 1985); Oligonucleotide Synthesis (MJ. Gait 
ed, 1984); Nucleic Acid Hybridization (B.D. Hames & S.J. Higgins eds. 1984); Transcription 
and Translation (B.D. Hames & S.J. Higgins eds. 1984); Animal Cell Culture (R.I. Freshney 
ed. 1986); Immobilized Cells and Enzymes (1RL Press, 1986); B. Perbal, A Practical Guide to 
Molecular Cloning (1984); the Methods in Enzymology series (Academic Press, Inc.), 
especially volumes 154 & 155; Gene Transfer Vectors for Mammalian Cells (J.H. Miller and 
M.P. Calos eds. 1987, Cold Spring Harbor Laboratory); Mayer and Walker, eds. (1987), 
Immunochemical Methods in Cell and Molecular Biology (Academic Press, London); Scopes, 
(1987) Protein Purification: Principles and Practice, Second Edition (Springer- Verlag, 
N.Y.), and Handbook of Experimental Immunology?, Volumes I~IV (D.M. Weir and C.C. 
Blackwell eds 1986). 

Standard abbreviations for nucleotides and amino acids are used in this specification. 
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All publications, patents, and patent applications cited herein are incorporated in full 
by reference. 

Expression systems 

The Neisseria menB nucleotide sequences can be expressed in a variety of different 
expression systems; for example those used with mammalian cells, plant cells, baculoviruses, 
bacteria, and yeast. 

i. Mammalian Systems 

Mammalian expression systems are known in the art. A mammalian promoter is any 
DNA sequence capable of binding mammalian RNA polymerase and initiating the 
downstream (3 1 ) transcription of a coding sequence (e.g., structural gene) into mRNA. A 
promoter will have a transcription initiating region, which is usually placed proximal to the 5' 
end of the coding sequence, and a TATA box, usually located 25-30 base pairs (bp) upstream 
of the transcription initiation site. The TATA box is thought to direct RNA polymerase II to 
begin RNA synthesis at the correct site. A mammalian promoter will also contain an 
upstream promoter element, usually located within 100 to 200 bp upstream of the TATA box. 
An upstream promoter element determines the rate at which transcription is initiated and can 
act in either orientation (Sambrook et al. (1989) "Expression of Cloned Genes in Mammalian 
Cells." In Molecular Cloning: A Laboratory Manual, 2nd ed.). 

Mammalian viral genes are often highly expressed and have a broad host range; 
therefore sequences encoding mammalian viral genes provide particularly useful promoter 
sequences. Examples include the SV40 early promoter, mouse mammary tumor virus LTR 
promoter, adenovirus major late promoter (Ad MLP), and herpes simplex virus promoter. In 
addition, sequences derived from non-viral genes, such as the murine metallothionein gene, 
also provide useful promoter sequences. Expression may be either constitutive or regulated 
(inducible). Depending on the promoter selected, many promotes may be inducible using 
known substrates, such as the use of the mouse mammary tumor virus (MMTV) promoter 
with the glucocorticoid responsive element (GRE) that is induced by glucocorticoid in 
hormone-responsive transformed cells (see for example, U.S. Patent 5,783,681). 

The presence of an enhancer element (enhancer), combined with the promoter 
elements described above, will usually increase expression levels. An enhancer is a 
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regulatory DNA sequence that can stimulate transcription up to 1000-fold when linked to 
homologous or heterologous promoters, with synthesis beginning at the normal RNA start 
site. Enhancers are also active when they are placed upstream or downstream from the 
transcription initiation site, in either normal or flipped orientation, or at a distance of more 
than 1000 nucleotides from the promoter (Maniatis et al. (1987) Science 236:1231; Alberts et 
al. (1989) Molecular Biology? of the Cell, 2nd ed.). Enhancer elements derived from viruses 
may be particularly useful, because they usually have a broader host range. Examples include 
the SV40 early gene enhancer (Dijkema et al (1985) EMBO J. 4:761) and the 
enhancer/promoters derived from the long terminal repeat (LTR) of the Rous Sarcoma Virus 
(Gorman et al. (1982b) Proc. Natl. Acad. Sci. 79:6777) and from human cytomegalovirus 
(Boshart et al. (1985) Cell 47:521). Additionally, some enhancers are regulatable and become 
active only in the presence of an inducer, such as a hormone or metal ion (Sassone-Corsi and 
Borelli (1986) Trends Genet. 2:215; Maniatis et al. (1987) Science 236:1237). 

A DNA molecule may be expressed intracellularly in mammalian cells. A promoter 
sequence may be directly linked with the DNA molecule, in which case the first amino acid at 
the N-terminus of the recombinant protein will always be a methionine, which is encoded by 
the ATG start codon. If desired, the N-terminus may be cleaved from the protein by in vitro 
incubation with cyanogen bromide. 

Alternatively, foreign proteins can also be secreted from the cell into the growth media 
by creating chimeric DNA molecules that encode a fusion protein comprised of a leader 
sequence fragment that provides for secretion of the foreign protein in mammalian cells. 
Preferably, there are processing sites encoded between the leader fragment and the foreign 
gene that can be cleaved either in vivo or in vitro. The leader sequence fragment usually 
encodes a signal peptide comprised of hydrophobic amino acids which direct the secretion of 
the protein from the cell. The adenovirus tripartite leader is an example of a leader sequence 
that provides for secretion of a foreign protein in mammalian cells. 

Usually, transcription termination and polyadenylation sequences recognized by 
mammalian cells are regulatory regions located 3' to the translation stop codon and thus, 
together with the promoter elements, flank the coding sequence. The 3' terminus of the 
mature mRNA is formed by site-specific post-transcriptional cleavage and polyadenylation 
(Birnstiel et al. (1985) Cell 47:349; Proudfoot and Whitelaw (1988) "Termination and 3' end 
processing of eukaryotic RNA. In Transcription and splicing (ed. B.D. Hames and D.M. 
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Glover); Proudfoot (1989) Trends Biochem. Sci. 7-/: 105). These sequences direct the 
transcription of an mRNA which can be translated into the polypeptide encoded by the DNA. 
Examples of transcription terminator/polyadenylation signals include those derived from 
SV40 (Sambrook et al (1989) "Expression of cloned genes in cultured mammalian cells." In 
Molecular Cloning: A Laboratory Manual). 

Usually, the above described components, comprising a promoter, polyadenylation 
signal, and transcnption termination sequence are put together into expression constructs. 
Enhancers, introns with functional splice donor and acceptor sites, and leader sequences may 
also be included in an expression construct, if desired. Expression constructs are often 
maintained in a replicon, such as an cxtrachromosomal element (e.g., plasmids) capable of 
stable maintenance in a host, such as mammalian cells or bacteria. Mammalian replication 
systems include those derived from animal viruses, which require trans-acting factors to 
replicate. For example, plasmids containing the replication systems of papovaviruses, such as 
SV40 (Gluzman (1981) Cell 25:175) or polyomavirus, replicate to extremely high copy 
number in the presence of the appropriate viral T antigen. Additional examples of 
mammalian replicons include those derived from bovine papillomavirus and Epstein-Barr 
virus. Additionally, the replicon may have two replication systems, thus allowing it to be 
maintained, for example, in mammalian cells for expression and in a prokaryotic host for 
cloning and amplification. Examples of such mammalian-bacteria shuttle vectors include 
pMT2 (Kaufman et al (1989) MoL Cell Biol 9:946) and pHEBO (Shimizu et al. (1986) Mol 
Cell Biol. 6:1074). 

The transformation procedure used depends upon the host to be transformed. Methods 
for introduction of heterologous polynucleotides into mammalian cells are known in the art 
and include dextran-mediated transfection, calcium phosphate precipitation, polybrene 
mediated transfection, protoplast fusion, electroporation, encapsulation of the 
polynucleotide(s) in liposomes, and direct microinjection of the DNA into nuclei. 

Mammalian cell lines available as hosts for expression are known in the art and 
include many immortalized cell lines available from the American Type Culture Collection 
(ATCC), including but not limited to, Chinese hamster ovary (CHO) cells, HeLa cells, baby 
hamster kidney (BHK) cells, monkey kidney cells (COS), human hepatocellular carcinoma 
cells (e.g., Hep G2), and a number of other cell lines. 
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ii. Plant Cellular Expression Systems 

There are many plant cell culture and whole plant genetic expression systems known 
in the art. Exemplary plant cellular genetic expression systems include those described in 
patents, such as: U.S. 5,693,506; US 5,659,122; and US 5,608,143. Additional examples of 
genetic expression in plant cell culture has been described by Zenk, Phytochemistfy 30:3861 - 
3863 (1991). Descriptions of plant protein signal peptides may be found in addition to the 
references described above in Vaulcombe et al., Mol. Gen. Genet. 209:33-40 (1987); Chandler 
et al, Plant Molecular Biology 3:407-418 (1984); Rogers, J. Biol Chem. 260:3731-3738 
(1985); Rothstein et al., Gene 55:353-356 (1987); Whittier et al., Nucleic Acids Research 
15:2515-2535 (1987); Wirsel et al., Molecular Microbiology 3:3-14 (1989); Yu et aL, Gene 
122:247-253 (1992). A description of the regulation of plant gene expression by the 
phytohormone, gibberellic acid and secreted enzymes induced by gibberellic acid can be 
found in R.L. Jones and J. MacMillin, Gibberellins: in: Advanced Plant Physiology*,. Malcolm 
B. Wilkins, ed. ; 1984 Pitman Publishing Limited, London, pp. 21-52. References that 
describe other metabolically-regulated genes: Sheen, Plant Cell, 2:1027-1038(1990); Maas et 
aL, EMBOJ. 9:3447-3452 (1990); Benkel and Hickey, Proc. Natl. Acad. Set 84:1337-1339 
(1987) 

Typically, using techniques known in the art, a desired polynucleotide sequence is 
inserted into an expression cassette comprising genetic regulatory elements designed for 
operation in plants. The expression cassette is inserted into a desired expression vector with 
companion sequences upstream and downstream from the expression cassette suitable for 
expression in a plant host. The companion sequences will be of plasmid or viral origin and 
provide necessary characteristics to the vector to permit the vectors to move DNA from an 
original cloning host, such as bacteria, to the desired plant host. The basic bacterial/plant 
vector construct will preferably provide a broad host range prokaryote replication origin; a 
prokaryote selectable marker; and, for Agrobacterium transformations, T DNA sequences for 
Agrobacterium-mediated transfer to plant chromosomes. Where the heterologous gene is not 
readily amenable to detection, the construct will preferably also have a selectable marker gene 
suitable for determining if a plant cell has been transformed. A general review of suitable 
markers, for example for the members of the gra^: family, is found in Wilmink and Dons, 
1993, Plant Mol. BioL Reptr, 1 1(2): 1 65-1 85. 
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Sequences suitable for permitting integration of the heterologous sequence into the 
plant genome are also recommended. These might include transposon sequences and the like 
for homologous recombination as well as Ti sequences which permit random insertion of a 
heterologous expression cassette into a plant genome. Suitable prokaryote selectable markers 
include resistance toward antibiotics such as ampicillin or tetracycline. Other DNA sequences 
encoding additional functions may also be present in the vector, as is known in the art. 

The nucleic acid molecules of the subject invention may be included into an 
expression cassette for expression of the protein(s) of interest. Usually, there will be only one 
expression cassette, although two or more are feasible. The recombinant expression cassette 
will contain in addition to the heterologous protein encoding sequence the following 
elements, a promoter region, plant 5' untranslated sequences, initiation codon depending upon 
whether or not the structural gene comes equipped with one, and a transcription and 
translation termination sequence. Unique restriction enzyme sites at the 5' and 3' ends of the 
cassette allow for easy insertion into a pre-existing vector. 

A heterologous coding sequence may be for any protein relating to the present 
invention. The sequence encoding the protein of interest will encode a signal peptide whL!: 
allows processing and translocation of the protein, as appropriate, and will usually lack any 
sequence which might result in the binding of the desired protein of the invention to a 
membrane. Since, for the most part, the transcriptional initiation region will be for a gene 
which is expressed and translocated during germination, by employing the signal peptide 
which provides for translocation, one may also provide for translocation of the protein of 
interest. In this way, the protein(s) of interest will be translocated from the cells in which they 
are expressed and may be efficiently harvested. Typically secretion in seeds are across the 
aleurone or scutellar epithelium layer into the endosperm of the seed. While it is not required 
that the protein be secreted from the cells in which the protein is produced, this facilitates the 
isolation and purification of the recombinant protein. 

Since the ultimate expression of the desired gene product will be in a eucaryotic cell it 
is desirable to determine whether any portion of the cloned gene contains sequences which 
will be processed out as introns by the host's splicosome machinery. If so, site-directed 
mutagenesis of the "intron" region may be conducted to prevent losing a portion of the genetic 
message as a false intron code, Reed and Maniatis, Cell 41:95-105, 1985. 
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The vector can be microinjected directly into plant cells by use of micropipettes to 
mechanically transfer the recombinant DNA. Crossway, Mol. Gen. Genet, 202:179-185, 
1985. The genetic material may also be transferred into the plant cell by using polyethylene 
glycol, Krens, et aL Nature, 296, 72-74, 1982. Another method of introduction of nucleic 
acid segments is high velocity ballistic penetration by small particles with the nucleic acid 
either within the matrix of small beads or particles, or on the surface, Klein, et al., Nature, 
327, 70-73, 1987 and Knudsen and Muller, 1991, Planta, 185:330-336 teaching particle 
bombardment of barley endosperm to create transgenic barley. Yet another method of 
introduction would be fusion of protoplasts with other entities, either minicells, cells, 
lysosomes or other fusible lipid-surfaced bodies, Fraley, et al., Proa Natl. Acad. ScL USA, 79, 
1859-1863, 1982. 

The vector may also be introduced into the plant cells by electroporation. (Fromm et 
al., Proc. Natl Acad. Sci. USA 82:5824, 1985). In this technique, plant protoplasts are 
electroporated in the presence of plasmids containing the gene construct. Electrical impulses 
of high field strength reversibly permeabilize biomembranes allowing the introduction of the 
plasmids. Electroporated plant protoplasts reform the cell wall, divide, and form plant callus. 

All plants from which protoplasts can be isolated and cultured to give whole 
regenerated plants can be transformed by the present invention so that whole plants are 
recovered which contain the transferred gene. It is known that practically all plants can be 
regenerated from cultured cells or tissues, including but not limited to all major species of 
sugarcane, sugar beet, cotton, fruit and other trees, legumes and vegetables. Some suitable 
plants include, for example, species from the genera Fragaria, Lotus, Medicago, Onobrychis, 
Trifolium, Trigonella, Vigna, Citrus, Linum, Geranium, Manihot, Daucus, Arabidopsis, 
Brassica, Raphamts, Sinapis, Atropa, Capsicum, Datura, Hyoscyamus, Lycopersion, 
Nicotiana, Solarium, Petunia, Digitalis, Majorana, Cichorium, Helianthus, Lactuca, Bromus, 
Asparagus, Antirrhinum, Hererocallis, Nemesia, Pelargonium, Panicum, Pennisetum, 
Ranunculus, Senecio, Salpiglossis, Cucumis, Browaalia, Glycine, Lolium, Zea, Triticum, 
Sorghum, and Datura. 

Means for regeneration vary from species to species of plants, but generally a 
suspension of transformed protoplasts containing copies of the heterologous gene is first 
provided. Callus tissue is formed and shoots may be induced from callus and subsequently 
rooted. Alternatively, embryo formation can be induced from the protoplast suspension. 
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These embryos germinate as natural embryos to form plants. The culture media will generally 
contain various amino acids and hormones, such as auxin and cytokinins. It is also 
advantageous to add glutamic acid and proline to the medium, especially for such species as 
corn and alfalfa. Shoots and roots normally develop simultaneously. Efficient regeneration 
will depend on the medium, on the genotype, and on the history of the culture. If these three 
variables are controlled, then regeneration is fully reproducible and repeatable. 

In some plant cell culture systems, the desired protein of the invention may be 
excreted or alternatively, the protein may be extracted from the whole plant. Where the 
desired protein of the invention is secreted into the medium, it may be collected. 
Alternatively, the embryos and embryoless-half seeds or other plant tissue may be 
mechanically disrupted to release any secreted protein between cells and tissues. The mixture 
may be suspended in a buffer solution to retrieve soluble proteins. Conventional protein 
isolation and purification methods will be then used to purify the recombinant protein. 
Parameters of time, temperature pH, oxygen, and volumes will be adjusted through routine 
methods to optimize expression and recovery of heterologous protein. 

iii. Baculovirus Systems 

The polynucleotide encoding the protein can also be inserted into a suitable insect 
expression vector, and is operably linked to the control elements within that vector. Vector 
construction employs techniques which are known in the art. Generally, the components of 
the expression system include a transfer vector, usually a bacterial plasmid, which contains 
both a fragment of the baculovirus genome, and a convenient restriction site for insertion of 
the heterologous gene or genes to be expressed; a wild type baculovirus with a sequence 
homologous to the baculovirus-specific fragment in the transfer vector (this allows for the 
homologous recombination of the heterologous gene in to the baculovirus genome); and 
appropriate insect host cells and growth media. 

After inserting the DNA sequence encoding the protein into the transfer vector, the 
vector and the wild type viral genome are transfected into an insect host cell where the vector 
and viral genome are allowed to recombine. The packaged recombinant virus is expressed 
and recombinant plaques are identified and purified. Materials and methods for 
baculovirus/insect cell expression systems are commercially available in kit form from, inter 
alia, Invitrogen, San Diego CA ("MaxBac" kit). These techniques are generally known to 



WO 99/57280 



16 



PCT/US99/09346 



those skilled in the art and fully described in Summers and Smith, Texas Agricultural 
Experiment Station Bulletin No. 1555 (1987) (hereinafter "Summers and Smith"). 

Prior to inserting the DNA sequence encoding the protein into the baculovirus 
genome, the above described components, comprising a promoter, leader (if desired), coding 
sequence of interest, and transcription termination sequence, are usually assembled into an 
intermediate transplacement construct (transfer vector). This construct may contain a single 
gene and operably linked regulatory elements; multiple genes, each with its owned set of 
operably linked regulatory elements; or multiple genes, regulated by the same set of 
regulatory elements. Intermediate transplacement constructs are often maintained in a 
replicon, such as an extrachromosomal element (e.g., plasmids) capable of stable maintenance 
in a host, such as a bacterium. The replicon will have a replication system, thus allowing it to 
be maintained in a suitable host for cloning and amplification. 

Currently, the most commonly used transfer vector for introducing foreign genes into 
AcNPV is pAc373. Many other vectors, known to those of skill in the art, have also been 
designed. These include, for example, pVL985 (which alters the polyhedrin start codon from 
ATG to ATT, and which introduces a BamHI cloning site 32 basepairs downstream from the 
ATT; see Luckow and Summers, Virology (1989) 1 7:31 . 

The plasmid usually also contains the polyhedrin polyadenylation signal (Miller et al. 
(1988) Ann. Rev. Microbiol, 42A11) and a prokaryotic ampicillin-resistance (amp) gene and 
origin of replication for selection and propagation in £. coli. 

Baculovirus transfer vectors usually contain a baculovirus promoter. A baculovirus 
promoter is any DNA sequence capable of binding a baculovirus RNA polymerase and 
initiating the downstream (5' to 3') transcription of a coding sequence (e.g., structural gene) 
into mRNA. A promoter will have a transcription initiation region which is usually placed 
proximal to the 5' end of the coding sequence. This transcription initiation region usually 
includes an RNA polymerase binding site and a transcription initiation site. A baculovirus 
transfer vector may also have a second domain called an enhancer, which, if present, is 
usually distal to the structural gene. Expression may be either regulated or constitutive. 

Structural genes, abundantly transcribed at late times in a viral infection cycle, provide 
particularly useful promoter sequences. Examp; " include sequences derived from the gene 
encoding the viral polyhedron protein, Friesen et ah, (1986) "The Regulation of Baculovirus 
Gene Expression," in: The Molecular Biology of Baculoviruses (ed. Walter Doerfler); EPO 
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Publ. Nos. 127 839 and 155 476; and the gene encoding the pi 0 protein, Vlak et al., (1988), J. 
Gen. Virol. 69:765. 

DNA encoding suitable signal sequences can be derived from genes for secreted insect 
or baculovirus proteins, such as the baculovirus polyhedrin gene (Carbonell et al. (1988) 
Gene, 73:409). Alternatively, since the signals for mammalian cell posttranslational 
modifications (such as signal peptide cleavage, proteolytic cleavage, and phosphorylation) 
appear to be recognized by insect cells, and the signals required for secretion and nuclear 
accumulation also appear to be conserved between the invertebrate cells and vertebrate cells, 
leaders of non-insect origin, such as those derived from genes encoding human (alpha) a- 
interferon, Maeda et al, (1985), Nature 375:592; human gastrin-releasing peptide, Lebacq- 
Verheyden et al., (1988), Molec. Cell Biol. 5:3129; human IL-2, Smith et al., (1985) Proc. 
Nat'lAcad. ScL USA, 52:8404; mouse IL-3, (Miyajima et al, (1987) Gene 55:273; and human 
glucocerebrosidase, Martin et al. (1988) DNA, 7:99, can also be used to provide for secretion 
in insects. 

A recombinant polypeptide or polyprotein may be expressed intracellular^ or, if it is 
expressed with the proper regulatory sequences, it can be secreted. Good intracellular 
expression of nonfused foreign proteins usually requires heterologous genes that ideally have 
a short leader sequence containing suitable translation initiation signals preceding an ATG 
start signal. If desired, methionine at the N-terminus may be cleaved from the mature protein 
by in vitro incubation with cyanogen bromide. 

Alternatively, recombinant polyproteins or proteins which are not naturally secreted 
can be secreted from the insect cell by creating chimeric DNA molecules that encode a fusion 
protein comprised of a leader sequence fragment that provides for secretion of the foreign 
protein in insects. The leader sequence fragment usually encodes a signal peptide comprised 
of hydrophobic amino acids which direct the translocation of the protein into the endoplasmic 
reticulum. 

After insertion of the DNA sequence and/or the gene encoding the expression product 
precursor of the protein, an insect cell host is co-transformed with the heterologous DNA of 
the transfer vector and the genomic DNA of wild type baculovirus - usually by co- 
transfection. The promoter and transcnption termination sequence of the construct will 
usually comprise a 2-5kb section of the baculovirus genome. Methods for introducing 
heterologous DNA into the desired site in the baculovirus virus are known in the art. (See 
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Summers and Smith supra; Ju et al. (1987); Smith et al., Mol Cell Biol. (1983) 5:2156; and 
Luckovv and Summers (1989)). For example, the insertion can be into a gene such as the 
polyhedrin gene, by homologous double crossover recombination; insertion can also be into a 
restriction enzyme site engineered into the desired baculovirus gene. Miller et al., (1989), 
Bioessays 4:91. The DNA sequence, when cloned in place of the polyhedrin gene in the 
expression vector, is flanked both 5' and 3' by polyhednn-specific sequences and is positioned 
downstream of the polyhedrin promoter. 

The newly formed baculovirus expression vector is subsequently packaged into an 
infectious recombinant baculovirus. Homologous recombination occurs at low frequency 
(between about 1% and about 5%); thus, the majority of the virus produced after 
cotransfection is still wild-type virus. Therefore, a method is necessary to identify 
recombinant viruses. An advantage of the expression system is a visual screen allowing 
recombinant viruses to be distinguished. The polyhedrin protein, which is produced by the 
native virus, is produced at very high levels in the nuclei of infected cells at late times after 
viral infection. Accumulated polyhedrin protein forms occlusion bodies that also contain 
embedded particles. These occlusion bodies, up to 15 jim in size, are highly refractile, giving 
them a bright shiny appearance that is readily visualized under the light microscope. Cells 
infected with recombinant viruses lack occlusion bodies. To distinguish recombinant virus 
from wild-type virus, the transfection supernatant is plaqued onto a monolayer of insect cells 
by techniques known to those skilled in the art. Namely, the plaques are screened under the 
light microscope for the presence (indicative of wild-type virus) or absence (indicative of 
recombinant virus) of occlusion bodies. Current Protocols in Microbiology Vol. 2 (Ausubel 
et al. eds) at 16.8 (Supp. 10, 1990); Summers and Smith, supra- Miller et al. (1989). 

Recombinant baculovirus expression vectors have been developed for infection into 
several insect cells. For example, recombinant baculoviruses have been developed for, inter 
alia: Aedes aegypti , Autographa calif ornica, Bombyx mori, Drosophila melanogaster, 
Spodoptera frugiperda, and Trichoplusia ni (PCT Pub. No. WO 89/046699; Carbonell et al., 
(1985) J, Virol 56:153; Wright (1986) Nature 527:718; Smith et al., (1983) Mol Cell Biol 
3:2156; and see generally, Fraser, et al (1989) In Vitro Cell Dev. Biol 25:225). 

Cells and cell culture media are commercially available for both direct and fusion 
expression of heterologous polypeptides in a baculovirus/expression system; cell culture 
technology is generally known to those skilled in the art. See, e.g.. Summers and Smith supra. 
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The modified insect cells may then be grown in an appropriate nutrient medium, 
which allows for stable maintenance of the plasmid(s) present in the modified insect host. 
Where the expression product gene is under inducible control, the host may be grown to high 
density, and expression induced. Alternatively, where expression is constitutive, the product 
will be continuously expressed into the medium and the nutrient medium must be 
continuously circulated, while removing the product of interest and augmenting depleted 
nutrients- The product may be purified by such techniques as chromatography, e.g., HPLC, 
affinity chromatography, ion exchange chromatography, etc.; electrophoresis; density gradient 
centrifugation; solvent extraction, or the like. As appropriate, the product may be further 
purified, as required, so as to remove substantially any insect proteins which are also secreted 
in the medium or result from lysis of insect cells, so as to provide a product which is at least 
substantially free of host debris, e.g., proteins, lipids and polysaccharides. 

In order to obtain protein expression, recombinant host cells derived from the 
transformants are incubated under conditions which allow expression of the recombinant 
protein encoding sequence. These conditions will vary, dependent upon the host cell selected. 
However, the conditions are readily ascertainable to those of ordinary skill in the art, based 
upon what is known in the art. 

iv. Bacterial Systems 

Bacterial expression techniques are known in the art. A bacterial promoter is any DNA 
sequence capable of binding bacterial RNA polymerase and initiating the downstream (3') 
transcription of a coding sequence (e.g. structural gene) into mRNA. A promoter will have a 
transcription initiation region which is usually placed proximal to the 5' end of the coding 
sequence. This transcription initiation region usually includes an RNA polymerase binding 
site and a transcription initiation site. A bacterial promoter may also have a second domain 
called an operator, that may overlap an adjacent RNA polymerase binding site at which RNA 
synthesis begins. The operator permits negative regulated (inducible) transcription, as a gene 
repressor protein may bind the operator and thereby inhibit transcription of a specific gene. 
Constitutive expression may occur in the absence of negative regulatory elements, such as the 
operator. In addition, positive regulation may be achieved by a gene activator protein binding 
sequence, which, if present is usually proximal (5') to the RNA polymerase binding sequence. 
An example of a gene activator protein is the catabolite activator protein (CAP), which helps 
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initiate transcription of the lac operon in Escherichia coli (E. coli) (Raibaud et aL (1984) 
Annu. Rev. Genet. 75:173). Regulated expression may therefore be either positive or negative, 
thereby either enhancing or reducing transcription. 

Sequences encoding metabolic pathway enzymes provide particularly useful promoter 
sequences. Examples include promoter sequences derived from sugar metabolizing enzymes, 
such as galactose, lactose (lac) (Chang et al. (1977) Nature J 98: 1056), and maltose. 
Additional examples include promoter sequences derived from biosynthetic enzymes such as 
tryptophan (trp) (Goeddel et aL (1980) Nuc. Acids Res. 5:4057; Yelverton et aL (1981) Nucl. 
Acids Res. 9:731; U.S. Patent 4,738,921; EPO Publ. Nos. 036 776 and 121 775). The beta- 
lactamase (bid) promoter system (Weissmann (1981) "The cloning of interferon and other 
mistakes." In Interferon 3 (ed. I. Gresser)), bacteriophage lambda PL (Shimatake et aL (1981) 
Nature 292:128) and T5 (U.S. Patent 4,689,406) promoter systems also provide useful 
promoter sequences. 

In addition, synthetic promoters which do not occur in nature also function as bacterial 
promoters. For example, transcription activation sequences of one bacterial or bacteriophage 
promoter may be joined with the operon sequences of another bacterial or bacteriophage 
promoter, creating a synthetic hybrid promoter (U.S. Patent 4,551,433). For example, the tac 
promoter is a hybrid trp-lac promoter comprised of both trp promoter and lac operon 
sequences that is regulated by the lac repressor (Amann et aL (1 983) Gene 25: 1 67; de Boer et 
aL (1983) Proc. Natl. Acad, Sci. 50:21). Furthermore, a bacterial promoter can include 
naturally occurring promoters of non-bacterial origin that have the ability to bind bacterial 
RNA polymerase and initiate transcription. A naturally occurring promoter of non-bacterial 
origin can also be coupled with a compatible RNA polymerase to produce high levels of 
expression of some genes in prokaryotes. The bacteriophage T7 RNA polymerase/promoter 
system is an example of a coupled promoter system (Studier et aL (1986) J. Mol. Biol. 
759:1 13; Tabor et aL (1985) Proc Natl. Acad. Sci. 52:1074). In addition, a hybrid promoter 
can also be comprised of a bacteriophage promoter and an E. coli operator region (EPO Publ. 
No. 267 851). 

In addition to a functioning promoter sequence, an efficient ribosome binding site is 
also useful for the expression of foreign genes in r vokaryotes. In E. coli. the ribosome binding 
site is called the Shine-Dalgamo (SD) sequence and includes an initiation codon (ATG) and a 
sequence 3-9 nucleotides in length located 3-1 1 nucleotides upstream of the initiation codon 
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(Shine et al (1975) Nature 254:34). The SD sequence is thought to promote binding of 
mRNA to the ribosome by the pairing of bases between the SD sequence and the 3' end of 
E. coli 16S rRNA (Steitz et al (1979) "Genetic signals and nucleotide sequences in 
messenger RNA." In Biological Regulation and Development: Gene Expression (ed. R.F. 
Goldberger)). To express eukaryotic genes and prokaryotic genes with weak ribosome- 
binding site, it is often necessary to optimize the distance between the SD sequence and the 
ATG of the eukaryotic gene (Sambrook et al (1989) "Expression of cloned genes in 
Escherichia coli." In Molecular Cloning: A Laboratory Manual). 

A DNA molecule may be expressed intracellularly. A promoter sequence may be 
directly linked with the DNA molecule, in which case the first amino acid at the N-terminus 
will always be a methionine, which is encoded by the ATG start codon. If desired, methionine 
at the N-terminus may be cleaved from the protein by in vitro incubation with cyanogen 
bromide or by either in vivo or in vitro incubation with a bacterial methionine N-terminal 
peptidase (EPO Publ. No. 219 237). 

Fusion proteins provide an alternative to direct expression. Usually, a DNA sequence 
encoding the N-terminal portion of an endogenous bacterial protein, or other stable protein is 
fused to the 5' end of heterologous coding sequences. Upon expression, this construct will 
provide a fusion of the two amino acid sequences. For example, the bacteriophage lambda cell 
gene can be linked at the 5' terminus of a foreign gene and expressed in bacteria. The resulting 
fusion protein preferably retains a site for a processing enzyme (factor Xa) to cleave the 
bacteriophage protein from the foreign gene (Nagai et al. (1984) Nature 309:810). Fusion 
proteins can also be made with sequences from the lacZ (Jia et al. (1 987) Gene 60: 197), trpE 
(Allen et al. (1987) J. Biotechnol. 5:93; Makoff et al (1989) J. Gen. Microbiol 135:1 1), and 
Chey (EPO Publ. No. 324 647) genes. The DNA sequence at the junction of the two amino 
acid sequences may or may not encode a cleavable site. Another example is a ubiquitin fusion 
protein. Such a fusion protein is made with the ubiquitin region that preferably retains a site 
for a processing enzyme (e.g. ubiquitin specific processing-protease) to cleave the ubiquitin 
from the foreign protein. Through this method, native foreign protein can be isolated (Miller 
et al (1989) Bio/Technology 7:698). 

Alternatively, foreign proteins can also be secreted from the cell by creating chimeric 
DNA molecules that encode a fusion protein comprised of a signal peptide sequence fragment 
that provides for secretion of the foreign protein in bacteria (U.S. Patent 4,336,336). The 
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signal sequence fragment usually encodes a signal peptide compnsed of hydrophobic amino 
acids which direct the secretion of the protein from the cell. The protein is either secreted into 
the growth media (gram-positive bacteria) or into the periplasmic space, located between the 
inner and outer membrane of the cell (gram-negative bacteria). Preferably there are processing 
sites, which can be cleaved either in vivo or in vitro encoded between the signal peptide 
fragment and the foreign gene. 

DNA encoding suitable signal sequences can be derived from genes for secreted 
bacterial proteins, such as the E. coli outer membrane protein gene (ompA) (Masui et al. 
(1983), in: Experimental Manipulation of Gene Expression', Ghrayeb et al. (1984) EMBO 7. 
3:2437) and the E. coli alkaline phosphatase signal sequence (phoA) (Oka et al. (1985) Proc. 
Natl Acad. Sci. 52:7212). As an additional example, the signal sequence of the alpha-amylase 
gene from various Bacillus strains can be used to secrete heterologous proteins from B. 
subtilis (Palva et al. (1982) Proc. Natl. Acad. Sci. USA 79:5582; EPO Publ. No. 244 042). 

Usually, transcription termination sequences recognized by bacteria are regulatory 
regions located 3' to the translation stop codon, and thus together with the promoter flank the 
coding sequence. These sequences direct the transcription of an mRNA which can be 
translated into the polypeptide encoded by the DNA. Transcription termination sequences 
frequently include DNA sequences of about 50 nucleotides capable of forming stem loop 
structures that aid in terminating transcription. Examples include transcription termination 
sequences derived from genes with strong promoters, such as the trp gene in E. coli as well as 
other biosynthetic genes. 

Usually, the above described components, comprising a promoter, signal sequence (if 
desired), coding sequence of interest, and transcription termination sequence, are put together 
into expression constructs. Expression constructs are often maintained in a replicon, such as 
an extrachromosomal element (e.g., plasmids) capable of stable maintenance in a host, such as 
bacteria. The replicon will have a replication system, thus allowing it to be maintained in a 
prokaryotic host either for expression or for cloning and amplification. In addition, a replicon 
may be either a high or low copy number plasmid. A high copy number plasmid will 
generally have a copy number ranging from about 5 to about 200, and usually about 10 to 
about 150. A host containing a high copy number plasmid will preferably contain at least 
about 10, and more preferably at least about 20 plasmids. Either a high or low copy number 
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vector may be selected, depending upon the effect of the vector and the foreign protein on the 
host. 

Alternatively, the expression constructs can be integrated into the bacterial genome 
with an integrating vector. Integrating vectors usually contain at least one sequence 
homologous to the bacterial chromosome that allows the vector to integrate. Integrations 
appear to result from recombinations between homologous DNA in the vector and the 
bacterial chromosome. For example, integrating vectors constructed with DNA from various 
Bacillus strains integrate into the Bacillus chromosome (EPO Publ. No. 127 328). Integrating 
vectors may also be comprised of bacteriophage or transposon sequences. 

Usually, extrachromosomal and integrating expression constructs may contain 
selectable markers to allow for the selection of bacterial strains that have been transformed. 
Selectable markers can be expressed in the bacterial host and may include genes which render 
bacteria resistant to drugs such as ampicillin, chloramphenicol, erythromycin, kanarnycin 
(neomycin), and tetracycline (Davies et al (1978) Annu. Rev. Microbiol 32:469). Selectable 
markers may also include biosynthetic genes, such as those in the histidine, tryptophan, and 
leucine biosynthetic pathways. 

Alternatively, some of the above described components can be put together in 
transformation vectors. Transformation vectors are usually comprised of a selectable market 
that is either maintained in a replicon or developed into an integrating vector, as described 
above. 

Expression and transformation vectors, either extra-chromosomal replicons or 
integrating vectors, have been developed for transformation into many bacteria. For example, 
expression vectors have been developed for, inter alia, the following bacteria: Bacillus 
subtilis (Palva et al (1982) Proc. Natl Acad. ScL USA 79:5582; EPO Publ. Nos. 036 259 and 
063 953; PCT Publ. No. WO 84/04541), Escherichia coli (Shimatake et al (1981) Nature 
292:128; Amann et al (1985) Gene 40:183; Studier et al (1986) J. Mol Biol 759:1 13; EPO 
Publ. Nos. 036 776, 136 829 and 136 907), Streptococcus cremons (Powell et al (1988) Appl 
Environ. Microbiol 54.655); Streptococcus lividans (Powell et al (1988) Appl Environ. 
Microbiol 54:655), Streptomyces lividans (U.S. Patent 4,745,056). 

Methods of introducing exogenous DNA into bacterial hosts are well-known in the art, 
and usually include either the transformation of bacteria treated with CaCl 2 or other agents, 
such as divalent cations and DMSO. DNA can also be introduced into bacterial cells by 
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electroporation. Transformation procedures usually vary with the bacterial species to be 
transformed. (See e.g., use of Bacillus: Masson et al. (1989) FEMS Microbiol. Lett. 60:273; 
Palva et al. (1982) Proa Natl. Acad. Sci. USA 79:5582; EPO Publ. Nos. 036 259 and 063 953; 
PCT Publ. No. WO 84/04541; use of Campylobacter: Miller et al. (1988) Proc. Natl Acad. 
Sci. &5:856; and Wang et al. (1990) J. Bacterioi 1 72:949; use of Escherichia coli: Cohen et 
al. (1973) Proc. Natl. Acad. Sci. 69:21 10; Dower et al. (1988) Nucleic Acids Res. 7(5:6127; 
Kushner (1978) "An improved method for transformation of Escherichia coli with ColEl- 
derived plasmids. In Genetic Engineering; Proceedings of the International Symposium on 
Genetic Engineering (eds. H.W. Boyer and S. Nicosia); Mandel et al. (1970) J. Mol. Biol 
<>3:159; Taketo (1988) Biochim. Biophys. Acta 949:318; use of Lactobacillus: Chassy et al. 
(1987) FEMS Microbiol Lett. 44:\11>\ use of Pseudomonas: Fiedler e/ al. (1988) Anal 
Biochem 770:38; use of Staphylococcus: Augustin et al (1990) FEMS Microbiol Lett. 
66:203; use of Streptococcus: Barany et al. (1980) J. Bacterioi 744:698; Harlander (1987) 
"Transformation of Streptococcus lactis by electroporation, in: Streptococcal Genetics (ed. J. 
Ferretti and R. Curtiss III); Perry et al (1981) Infect. Immun. 32:1295; Powell et al (1988) 
Appl Environ. Microbiol 54:655; Somkuti et al. (1987) Proc. 4th Evr. Cong. Biotechnology? 
7:412. 

v. Yeast Expression 

Yeast expression systems are also known to one of ordinary' skill in the art. A yeast 
promoter is any DNA sequence capable of binding yeast RNA polymerase and initiating the 
downstream (3') transcription of a coding sequence (e.g. structural gene) into mRNA. A 
promoter will have a transcription initiation region which is usually placed proximal to the 5* 
end of the coding sequence. This transcription initiation region usually includes an RNA 
polymerase binding site (the "TATA Box") and a transcription initiation site. A yeast 
promoter may also have a second domain called an upstream activator sequence (UAS), 
which, if present, is usually distal to the structural gene. The UAS permits regulated 
(inducible) expression. Constitutive expression occurs in the absence of a UAS. Regulated 
expression may be either positive or negative, thereby either enhancing or reducing 
transcription. 

Yeast is a fermenting organism with an active metabolic pathway, therefore sequences 
encoding enzymes in the metabolic pathway provide particularly useful promoter sequences. 
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Examples include alcohol dehydrogenase (ADH) (EPO Publ. No. 284 044). enolase, 
glucokinase, glucose-6-phosphate isomerase, glyceraIdehyde-3-phosphate-dehydrogenase 
(GAP or GAPDH), hexokinase, phosphofructokinase, 3-phosphoglycerate mutase, and 
pyruvate kinase (PyK) (EPO Publ. No. 329 203). The yeast PH05 gene, encoding acid 
phosphatase, also provides useful promoter sequences (Myanohara et al (1983) Proc. Natl 
Acad. Set USA 80:1). 

In addition, synthetic promoters which do not occur in nature also function as yeast 
promoters. For example, UAS sequences of one yeast promoter may be joined with the 
transcription activation region of another yeast promoter, creating a synthetic hybrid 
promoter. Examples of such hybrid promoters include the ADH regulatory sequence linked to 
the GAP transcription activation region (U.S. Patent Nos. 4,876,197 and 4,880,734). Other 
examples of hybrid promoters include promoters which consist of the regulatory sequences of 
either \hzADH2, GALA, GAL10, OR PH05 genes, combined with the transcriptional 
activation region of a glycolytic enzyme gene such as GAP or PyK (EPO Publ. No. 164 556). 
Furthermore, a yeast promoter can include naturally occurring promoters of non-yeast origin 
that have the ability to bind yeast RNA polymerase and initiate transcription. Examples of 
such promoters include, inter alia, (Cohen et al. (1980) Proc. Natl Acad. Sci. USA 77:1078; 
Henikoff et al. (1981) Nature 253:835; Hollenberg et al. (1981) Curr. Topics Microbiol. 
Immunol. 96:1 19; Hollenberg et al. (1979) "The Expression of Bacterial Antibiotic Resistance 
Genes in the Yeast Saccharomyces cerevisiae," in: Plasmids of Medical Environmental and 
Commercial Importance (eds, K.N. Timmis and A. Puhler); Mercerau-Puigalon et al. (1980) 
Gene 77:163; Panthier et al. (1980) Curr. Genet. 2:109;). 

A DNA molecule may be expressed intracellularly in yeast. A promoter sequence may 
be directly linked with the DNA molecule, in which case the first amino acid at the N- 
terminus of the recombinant protein will always be a methionine, which is encoded by the 
ATG start codon. If desired, methionine at the N-terminus may be cleaved from the protein by 
in vitro incubation with cyanogen bromide. 

Fusion proteins provide an alternative for yeast expression systems, as well as in 
mammalian, plant, baculovirus, and bacterial expression systems. Usually, a DNA sequence 
encoding the N-terminal portion of an endogenous yeast protein, or other stable protein, is 
fused to the 5' end of heterologous coding sequences. Upon expression, this construct will 
provide a fusion of the two amino acid sequences. For example, the yeast or human 
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superoxide dismutase (SOD) gene, can be linked at the 5* terminus of a foreign gene and 
expressed in yeast. The DNA sequence at the junction of the two amino acid sequences may 
or may not encode a cleavable site. See e.g., EPO Publ. No. 196056. Another example is a 
ubiquitin fusion protein. Such a fusion protein is made with the ubiquitin region that 
preferably retains a site for a processing enzyme (e.g. ubiquitin-specific processing protease) 
to cleave the ubiquitin from the foreign protein. Through this method, therefore, native 
foreign protein can be isolated (e.g., WO88/024066). 

Alternatively, foreign proteins can also be secreted from the cell into the growth media 
by creating chimeric DNA molecules that encode a fusion protein comprised of a leader 
sequence fragment that provide for secretion in yeast of the foreign protein. Preferably, there 
are processing sites encoded between the leader fragment and the foreign gene that can be 
cleaved either in vivo or in vitro. The leader sequence fragment usually encodes a signal 
peptide comprised of hydrophobic amino acids which direct the secretion of the protein from 
the cell. 

DNA encoding suitable signal sequences can be derived from genes for secreted yeast 
proteins, such as the yeast invertase gene (EPO Publ. No. 012 873; JPO Publ. No. 
62:096,086) and the A-factor gene (U.S. Patent 4,588,684). Alternatively, leaders of non- 
yeast origin, such as an interferon leader, exist that also provide for secretion in yeast (EPO 
Publ. No. 060 057). 

A preferred class of secretion leaders are those that employ a fragment of the yeast 
alpha-factor gene, which contains both a "pre" signal sequence, and a "pro" region. The types 
of alpha- factor fragments that can be employed include the full-length pre-pro alpha factor 
leader (about 83 amino acid residues) as well as truncated alpha-factor leaders (usually about 
25 to about 50 amino acid residues) (U.S. Patent Nos. 4,546,083 and 4,870,008; EPO Publ. 
No. 324 274). Additional leaders employing an alpha-factor leader fragment that provides for 
secretion include hybrid alpha-factor leaders made with a presequence of a first yeast, but a 
pro-region from a second yeast alphafactor. (See e.g., PCT Publ. No. WO 89/02463.) 

Usually, transcnption termination sequences recognized by yeast are regulatory 
regions located 3' to the translation stop codon, and thus together with the promoter flank the 
coding sequence. These sequences direct the transcription of an mRNA which can be 
translated into the polypeptide encoded by the DNA. Examples of transcription terminator 
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sequence and other yeast-recognized termination sequences, such as those coding for 
glycolytic enzymes. 

Usually, the above described components, comprising a promoter, leader (if desired), 
coding sequence of interest, and transcription termination sequence, are put together into 
expression constructs. Expression constructs are often maintained in a replicon, such as an 
extrachromosomal element (e.g., plasmids) capable of stable maintenance in a host, such as 
yeast or bacteria. The replicon may have two replication systems, thus allowing it to be 
maintained, for example, in yeast for expression and in a prokaryotic host for cloning and 
amplification. Examples of such yeast-bacteria shuttle vectors include YEp24 (Botstein et al 
(1979) Gene 5:17-24), pCl/1 (Brake et nl (1984) Proc. Natl Acad. Sci USA 57:4642-4646), 
and YRpl7 (Stinchcomb et al. (1982) J. Mol. Biol 158:151), In addition, a replicon may be 
either a high or low copy number plasmid. A high copy number plasmid will generally have a 
copy number ranging from about 5 to about 200, and usually about 10 to about 1 50. A host 
containing a high copy number plasmid will preferably have at least about 10, and more 
preferably at least about 20. Enter a high or low copy number vector may be selected, 
depending upon the effect of the vector and the foreign protein on the host. See e.g., Brake et 
al, supra. 

Alternatively, the expression constructs can be integrated into the yeast genome with 
an integrating vector. Integrating vectors usually contain at least one sequence homologous to 
a yeast chromosome that allows the vector to integrate, and preferably contain two 
homologous sequences flanking the expression construct. Integrations appear to result from 
recombinations between homologous DNA in the vector and the yeast chromosome (Orr- 
Weaver et al (1983) Methods in Enzymol 707:228-245). An integrating vector may be 
directed to a specific locus in yeast by selecting the appropriate homologous sequence for 
inclusion in the vector. See Orr- Weaver et al, supra. One or more expression construct may 
integrate, possibly affecting levels of recombinant protein produced (Rine et al (1983) Proc. 
Natl Acad. ScL USA 80:6150). The chromosomal sequences included in the vector can occur 
either as a single segment in the vector, which results in the integration of the entire vector, or 
two segments homologous to adjacent segments in the chromosome and flanking the 
expression construct in the vector, which can result in the stable integration of only the 
expression construct. 



WO 99/57280 



28 



PCT7US99/09346 



Usually, extrachromosomal and integrating expression constructs may contain 
selectable markers to allow for the selection of yeast strains that have been transformed. 
Selectable markers may include biosynthetic genes that can be expressed in the yeast host, 
such as ADE2, HIS4, LEU 2, TRPJ, and ALG7, and the G41 8 resistance gene, which confer 
resistance in yeast cells to tunicamycin and G41 8, respectively. In addition, a suitable 
selectable marker may also provide yeast with the ability to g/ow in the presence of toxic 
compounds, such as metal. For example, the presence of CUP1 allows yeast to grow in the 
presence of copper ions (Butt et al. (1987) Microbiol, Rev. 51:351). 

Alternatively, some of the above described components can be put together into 
transformation vectors. Transformation vectors are usually comprised of a selectable marker 
that is either maintained in a replicon or developed into an integrating vector, as described 
above. 

Expression and transformation vectors, either extrachromosomal replicons or 
integrating vectors, have been developed for transformation into many yeasts. For example, 
expression vectors and methods of introducing exogenous DNA into yeast hosts have been 
developed for, inter alia, the following yeasts: Candida albicans (Kurtz, et al. (1986) Mol 
Cell Biol. 6:142); Candida maltosa (Kunze, et ai (1985) J. Basic Microbiol 25:141); 
Hansenula polymorpha (Gleeson, et al (1986)7. Gen. Microbiol 1 32:3459; Roggenkamp et 
al (1986) Mol Gen. Genet. 202:302); Kluyveromyces fragilis (Das, et al (1984) 7 Bacteriol 
158:1 165); Kluyveromyces lactis (De Louvencourt et al. (1983) J. Bacteriol 154:737; Van 
den Berg et al (1990) Bio/Technology 5:135); Pichia guillerimondii (Kunze et al (1985) 7 
Basic Microbiol 25:141); Pichia pastoris (Cregg, et al (1985) Mol Cell Biol 5:3376; U.S. 
Patent Nos. 4,837,148 and 4,929,555); Sacchuromyces cerevisiae (Hinnen et al (1978) Proc, 
Natl Acad. Sci. USA 75:1929; Ito et al (1983)7. Bacteriol 753:163); Schizosaccharomyces 
pombe (Beach and Nurse (1981) Nature 300:706); and Yarrowia lipolytica (Davidow, et al 
(1985) Curr. Genet. 70:380471 Gaillardin, et al. (1985) Curr. Genet 10:49). 

Methods of introducing exogenous DNA into yeast hosts are well-known in the art, 
and usually include either the transformation of spheroplasts or of intact yeast cells treated 
with alkali cations. Transformation procedures usually vary with the yeast species to be 
transformed. See e.g., [Kurtz et al (1986) Mol Cc 1} Biol 6: 142; Kunze et al (1985) 7. Basic 
Microbiol 25:141; Candida]; [Gleeson et al (1986)7 Gen. Microbiol 732:3459; 
Roggenkamp et al. (1986) Mol Gen. Genet. 202:302; Hansenula]; [Das et al (1984) 7 
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Bactehoi 755: 1 1 65; De Louvencourt et al (1983) J. Bactenoi 154:\ 165; Van den Berg et al. 
(1990) Bio/Technology 5:135; Kluyveromyces]; [Cregg et al (1985) Mol Cell Biol 5:3376; 
Kunze et al. (1985)7. Basic Microbiol 25:141; U.S. Patent Nos. 4,837,148 and 4,929,555; 
Pichia]; [Hmnen et al (1978) Proa Natl Acad. Sci. USA 75;1929; Ito et al (1983) J. 
Bacteriol 755:163 Saccharomyces]; [Beach and Nurse (1981) Nature 300:106; 
Schizosaccharomyces]; [Davidow et al (1985) Curr. Genet. 70:39; Gaillardin et al (1985) 
Curr. Genet. 70:49; Yarrowia]. 

Definitions 

A composition containing X is "substantially free of Y when at least 85% by weight 
of the total X+Y in the composition is X. Preferably, X comprises at least about 90% by 
weight of the total of X+Y in the composition, more preferably at least about 95% or even 
99% by weight. 

A "conserved" Neisseria amino acid fragment or protein is one that is present in a 
particular Neisserial protein in at least x% of Neisseria. The value of x may be 50% or more, 
e.g., 66%, 75%o, 80%, 90%, 95% or even 100% (i.e. the amino acid is found in the protein in 
question in all Neisseria). In order to determine whether an animo acid is "conserved" in a 
particular Neisserial protein, it is necessary to compare that amino acid residue in the 
sequences of the protein in question from a plurality of different Neisseria (a reference 
population). The reference population may include a number of different Neisseria species 
or may include a single species. The reference population may include a number of different 
serogroups of a particular species or a single serogroup. A preferred reference population 
consists of the 5 most common Neisseria strains. 

The term "heterologous" refers to two biological components that are not found 
together in nature. The components may be host cells, genes, or regulatory regions, such as 
promoters. Although the heterologous components are not found together in nature, they can 
function together, as when a promoter heterologous to a gene is operably linked to the gene. 
Another example is where a Neisserial sequence is heterologous to a mouse host cell. 

"Epitope" means antigenic determinant, and may elicit a cellular and/or humoral 
response. 

Conditions for "high stringency" are 65 degrees C in 0.1 xSSC 0.5% SDS solution. 
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An "origin of replication" is a polynucleotide sequence that initiates and regulates 
replication of polynucleotides, such as an expression vector. The origin of replication behaves 
as an autonomous unit of polynucleotide replication within a cell, capable of replication under 
its own control. An origin of replication may be needed for a vector to replicate in a particular 
host cell. With certain origins of replication, an expression vector can be reproduced at a high 
copy number in the presence of the appropriate proteins within the cell. Examples of origins 
are the autonomously replicating sequences, which are effective in yeast; and the viral 
T-antigen, effective in COS-7 cells. 

A "mutant" sequence is defined as a DNA, RNA or amino acid sequence differing 
from but having homology with the native or disclosed sequence. Depending on the 
particular sequence, the degree of homology (sequence identity) between the native or 
disclosed sequence and the mutant sequence is preferably greater than 50% (e.g., 60%, 70%, 
80%, 90%, 95%, 99% or more) which is calculated as described above. As used herein, an 
"allelic variant' 1 of a nucleic acid molecule, or region, for which nucleic acid sequence is 
provided herein is a nucleic acid molecule, or region, that occurs at essentially the same locus 
in the genome of another or second isolate, and that, due to natural variation caused by, for 
example, mutation or recombination, has a similar but not identical nucleic acid sequence. A 
coding region allelic variant typically encodes a protein having similar activity to that of the 
protein encoded by the gene to which it is being compared. An allelic variant can also 
comprise an alteration in the 5* or 3' untranslated regions of the gene, such as in regulatory 
control regions, (see, for example, U.S. Patent 5,753,235). 

Antibodies 

As used herein, the term "antibody" refers to a polypeptide or group of polypeptides 
composed of at least one antibody combining site. An "antibody combining site" is the 
three-dimensional binding space with an internal surface shape and charge distribution 
complementary to the features of an epitope of an antigen, which allows a binding of the 
antibody with the antigen. "Antibody" includes, for example, vertebrate antibodies, hybrid 
antibodies, chimeric antibodies, humanized antibodies, altered antibodies, univalent 
antibodies, Fab proteins, and single domain antibodies. 

Antibodies against the proteins of the invention are useful for affinity 
chromatography, immunoassays, and distinguishing/identifying Neisseria menB proteins. 
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Antibodies elicited against the proteins of the present invention bind to antigenic polypeptides 
or proteins or protein fragments that are present and specifically associated with strains of 
Neisseria meningitidis menB. In some instances, these antigens may be associated with 
specific strains, such as those antigens specific for the menB strains. The antibodies of the 
invention may be immobilized to a matrix and utilized in an immunoassay or on an affinity 
chromatography column, to enable the detection and/or separation of polypeptides, proteins or 
protein fragments or cells comprising such polypeptides, proteins or protein fragments. 
Alternatively, such polypeptides, proteins or protein fragments may be immobilized so as to 
detect antibodies bindably specific thereto. 

Antibodies to the proteins of the invention, both polyclonal and monoclonal, may be 
prepared by conventional methods. In general, the protein is first used to immunize a suitable 
animal, preferably a mouse, rat, rabbit or goat. Rabbits and goats are preferred for the 
preparation of polyclonal sera due to the volume of serum obtainable, and the availability of 
labeled anti-rabbit and anti-goat antibodies. Immunization is generally performed by mixing 
or emulsifying the protein in saline, preferably in an adjuvant such as Freund's complete 
adjuvant, and injecting the mixture or emulsion parenterally (generally subcutaneously or 
intramuscularly). A dose of 50-200 ^g/injection is typically sufficient. Immunization is 
generally boosted 2-6 weeks later with one or more injections of the protein in saline, 
preferably using Freund's incomplete adjuvant. One may alternatively generate antibodies by 
in vitro immunization using methods known in the art, which for the purposes of this 
invention is considered equivalent to in vivo immunization. Polyclonal antisera is obtained by 
bleeding the immunized animal into a glass or plastic container, incubating the blood at 25°C 
for one hour, followed by incubating at 4°C for 2-18 hours. The serum is recovered by 
centrifugation (e.g., l,000g for 10 minutes). About 20-50 ml per bleed may be obtained from 
rabbits. 

Monoclonal antibodies are prepared using the standard method of Kohler & Milstein 
(Nature (1975) 256:495-96), or a modification thereof. Typically, a mouse or rat is 
immunized as described above. However, rather than bleeding the animal to extract serum, 
the spleen (and optionally several large lymph nodes) is removed and dissociated into single 
cells. If desired, the spleen cells may be screened (after removal of nonspecifically adherent 
cells) by applying a cell suspension to a plate or well coated with the protein antigen. B-cells 
that express membrane-bound immunoglobulin specific for the antigen bind to the plate, and 
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are not rinsed away with the rest of the suspension. Resulting B-cells, or all dissociated spleen 
cells, are then induced to fuse with myeloma cells to form hybridomas, and are cultured in a 
selective medium (e.g., hypoxanthine, aminopterin, thymidine medium, "HAT"). The 
resulting hybridomas are plated by limiting dilution, and are assayed for the production of 
antibodies which bind specifically to the immunizing antigen (and which do not bind to 
unrelated antigens). The selected MAb-secreting hybridomas are then cultured either in vitro 
(e.g., in tissue culture bottles or hollow fiber reactors), or in vivo (as ascites in mice). 

If desired, the antibodies (whether polyclonal or monoclonal) may be labeled using 
conventional techniques. Suitable labels include fluorophores, chromophores, radioactive 
atoms (particularly 32 P and 125 I), electron-dense reagents, enzymes, and ligands having 
specific binding partners. Enzymes are typically detected by their activity. For example, 
horseradish peroxidase is usually detected by its ability to convert 
3,3',5,5'-tetramethylbenzidine (TMB) to a blue pigment, quantifiable with a 
spectrophotometer. "Specific binding partner" refers to a protein capable of binding a ligand 
molecule with high specificity, as for example in the case of an antigen and a monoclonal 
antibody specific therefor. Other specific binding partners include biotin and avidin or 
streptavidin, IgG and protein A, and the numerous receptor-ligand couples known in the art. It 
should be understood that the above description is not meant to categorize the various labels 
into distinct classes, as the same label may serve in several different modes. For example, 12;i l 
may serve as a radioactive label or as an electron-dense reagent. HRP may serve as enzyme or 
as antigen for a MAb. Further, one may combine various labels for desired effect. For 
example, MAbs and avidin also require labels in the practice of this invention: thus, one 
might label a MAb with biotin, and detect its presence with avidin labeled with !25 I, or with 
an anti-biotin MAb labeled with HRP. Other permutations and possibilities will be readily 
apparent to those of ordinary skill in the art, and are considered as equivalents within the 
scope of the instant invention. 

Antigens, immunogens, polypeptides, proteins or protein fragments of the present 
invention elicit formation of specific binding partner antibodies. These antigens, 
immunogens, polypeptides, proteins or protein fragments of the present invention comprise 
immunogenic compositions of the present invention. Such immunogenic compositions may 
further comprise or include adjuvants, carriers, or other compositions that promote or enhance 
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or stabilize the antigens, polypeptides, proteins or protein fragments of the present invention. 
Such adjuvants and carriers will be readily apparent to those of ordinary skill in the art. 

Pharmaceutical Compositions 

Pharmaceutical compositions can comprise (include) either polypeptides, antibodies, 
or nucleic acid of the invention. The pharmaceutical compositions will comprise a 
therapeutically effective amount of either polypeptides, antibodies, or polynucleotides of the 
claimed invention. 

The term "therapeutically effective amount" as used herein refers to an amount of a 
therapeutic agent to treat, ameliorate, or prevent a desired disease or condition, or to exhibit a 
detectable therapeutic or preventative effect. The effect can be detected by, for example, 
chemical markers or antigen levels. Therapeutic effects also include reduction in physical 
symptoms, such as decreased body temperature, when given to a patient that is febrile. The 
precise effective amount for a subject will depend upon the subject's size and health, the 
nature and extent of the condition, and the therapeutics or combination of therapeutics 
selected for administration. Thus, it is not useful to specify an exact effective amount in 
advance. However, the effective amount for a given situation can be determined by routine 
experimentation and is within the judgment of the clinician. 

For purposes of the present invention, an effective dose will be from about 0.01 mg/ 
kg to 50 mg/kg or 0.05 mg/kg to about 10 mg/kg of the DNA constructs in the individual to 
which it is administered. 

A pharmaceutical composition can also contain a pharmaceutically acceptable carrier. 
The term "pharmaceutically acceptable carrier" refers to a carrier for administration of a 
therapeutic agent, such as antibodies or a polypeptide, genes, and other therapeutic agents. 
The term refers to any pharmaceutical carrier that does not itself induce the production of 
antibodies harmful to the individual receiving the composition, and which may be 
administered without undue toxicity. Suitable carriers may be large, slowly metabolized 
macromolecules such as proteins, polysaccharides, polylactic acids, polyglycolic acids, 
polymeric amino acids, amino acid copolymers, and inactive virus particles. Such carriers are 
well known to those of ordinary skill in the art. 

Pharmaceutically acceptable salts can be used therein, for example, mineral acid salts 
such as hydrochlorides, hydrobromides, phosphates, sulfates, and the like; and the salts of 
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organic acids such as acetates, propionates, malonates, benzoates, and the like. A thorough 
discussion of pharmaceutical^ acceptable excipients is available in Remington's 
Pharmaceutical Sciences (Mack Pub. Co., N.J. 1991). 

Pharmaceutical ly acceptable carriers in therapeutic compositions may contain liquids 
such as water, saline, glycerol and ethanol. Additionally, auxiliary substances, such as wetting 
or emulsifying agents, pH buffering substances, and the like, may be present in such vehicles. 
Typically, the therapeutic compositions are prepared as injectables, either as liquid solutions 
or suspensions; solid forms suitable for solution in, or suspension in, liquid vehicles prior to 
injection may also be prepared. Liposomes are included within the definition of a 
pharmaceutical^ acceptable carrier. 

Delivery Methods 

Once formulated, the compositions of the invention can be administered directly to the 
subject. The subjects to be treated can be animals; in particular, human subjects can be 
treated. 

Direct delivery of the compositions will generally be accomplished by injection, either 
subcutaneously, intraperitoneally, intravenously or intramuscularly or delivered to the 
interstitial space of a tissue. The compositions can also be administered into a lesion. Other 
modes of administration include oral and pulmonary administration, suppositories, and 
transdermal and transcutaneous applications, needles, and gene guns or hyposprays. Dosage 
treatment may be a single dose schedule or a multiple dose schedule. 

Vaccines 

Vaccines according to the invention may either be prophylactic (i.e., to prevent 
infection) or therapeutic (i.e., to treat disease after infection). 

Such vaccines comprise immunizing antigen(s) or immunogen(s), immunogenic 
polypeptide, protein(s) or protein fragments, or nucleic acids (e.g., ribonucleic acid or 
deoxyribonucleic acid), usually in combination with "pharmaceutically acceptable carriers," 
which include any carrier that does not itself induce the production of antibodies harmful to 
the individual receiving the composition. Suitable carriers are typically large, slowly 
metabolized macromolecules such as proteins, polysaccharides, polylactic acids, polyglycolic 
acids, polymeric amino acids, amino acid copolymers, lipid aggregates (such as oil droplets or 
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liposomes), and inactive virus particles. Such carriers are well known to those of ordinary 
skill in the art. Additionally, these carriers may function as immunostimulating agents 
("adjuvants"). Furthermore, the immunogen or antigen may be conjugated to a bacterial 
toxoid, such as a toxoid from diphtheria, tetanus, cholera, H. pylori, etc. pathogens. 

Preferred adjuvants to enhance effectiveness of the composition include, but are not 
limited to: (1) aluminum salts (alum), such as aluminum hydroxide, aluminum phosphate, 
aluminum sulfate, etc; (2) oil-in-water emulsion formulations (with or without other specific 
immunostimulating agents such as muramyl peptides (see below) or bacterial cell wall 
components), such as for example (a) MF59 (PCT PubL No. WO 90/14837), containing 5% 
Squalene, 0.5% Tween 80, and 0.5% Span 85 (optionally containing various amounts of 
MTP-PE (see below), although not required) formulated into submicron particles using a 
micro fluidizer such as Model 1 10Y micro fluidizer (Microfluidics, Newton, MA), (b) SAF, 
containing 10% Squalane, 0.4% Tween 80, 5% pluronic-blocked polymer L121, and thr-MDP 
(see below) either microfluidized into a submicron emulsion or vortexed to generate a larger 
particle size emulsion, and (c) Ribi™ adjuvant system (RAS), (Ribi Immunochem, Hamilton, 
MT) containing 2% Squalene, 0.2% Tween 80, and one or more bacterial cell wall 
components from the group consisting of monophosphorylipid A (MPL), trehalose 
dimycolate (TDM), and cell wall skeleton (CWS), preferably MPL + CWS (Detox™); 

(3) saponin adjuvants, such as Stimulon™ (Cambridge Bioscience, Worcester, MA) may be 
used or particles generated therefrom such as ISCOMs (immunostimulating complexes); 

(4) Complete Freund's Adjuvant (CFA) and Incomplete Freund's Adjuvant (IF A); 

(5) cytokines, such as interleukins (e.g., IL-1, IL-2, IL-4, IL-5, IL-6, IL-7, IL-12, etc.), 
interferons (e.g., gamma interferon), macrophage colony stimulating factor (M-CSF), tumor 
necrosis factor (TNF), etc; (6) detoxified mutants of a bacterial ADP-ribosylating toxin such 
as a cholera toxin (CT), a pertussis toxin (PT), or an E. coli heat-labile toxin (LT), particularly 
LT-K63, LT-R72, CT-S109, PT-K9/G129; see, e.g., WO 93/13302 and WO 92/19265; and 
(7) other substances that act as immunostimulating agents to enhance the effectiveness of the 
composition. Alum and MF59 are preferred. 

As mentioned above, muramyl peptides include, but are not limited to, N-acetyl- 
muramyl-L-threonyl-D-isoglutamine (thr-MDP), N-acetyl-normuramyl-L-alanyl-D- 
isoglutamine (nor-MDP), N-acetylmuramyl-L-alanyl-D-isoglutammyl-L-alanine-2-( 1 '-2*- 
dipalmitoyl-^-glycero-3-huydroxyphosphoryloxy)-ethylamine (MTP-PE), etc. 
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The vaccine compositions comprising immunogenic compositions (e.g., which may 
include the antigen, pharmaceutically acceptable carrier, and adjuvant) typically will contain 
diluents, such as water, saline, glycerol, ethanol, etc. Additionally, auxiliary substances, such 
as wetting or emulsifying agents, pH buffering substances, and the like, may be present in 
such vehicles. Alternatively, vaccine compositions comprising immunogenic compositions 
may comprise an antigen, polypeptide, protein, protein fragment or nucleic acid in a 
pharmaceutically acceptable carrier. 

More specifically, vaccines comprising immunogenic compositions comprise an 
immunologically effective amount of the immunogenic polypeptides, as well as any other of 
the above-mentioned components, as needed. By "immunologically effective amount", it is 
meant that the administration of that amount to an individual, either in a single dose or as part 
of a series, is effective for treatment or prevention. This amount varies depending upon the 
health and physical condition of the individual to be treated, the taxonomic group of 
individual to be treated (e.g., nonhuman primate, primate, etc.), the capacity of the 
individual's immune system to synthesize antibodies, the degree of protection desired, the 
formulation of the vaccine, the treating doctor's assessment of the medical situation, and other 
relevant factors. It is expected that the amount will fall in a relatively broad range that can be 
determined through routine trials. 

Typically, the vaccine compositions or immunogenic compositions are prepared as 
injectables, either as liquid solutions or suspensions; solid forms suitable for solution in, or 
suspension in, liquid vehicles prior to injection may also be prepared. The preparation also 
may be emulsified or encapsulated in liposomes for enhanced adjuvant effect, as discussed 
above under pharmaceutically acceptable carriers. 

The immunogenic compositions are conventionally administered parenterally, e.g., by 
injection, either subcutaneously or intramuscularly. Additional formulations suitable for other 
modes of administration include oral and pulmonary formulations, suppositories, and 
transdermal and transcutaneous applications. Dosage treatment may be a single dose schedule 
or a multiple dose schedule. The vaccine may be administered in conjunction with other 
immunoregulatory agents. 

As an alternative to protein-based vaccines, DNA vaccination may be employed (e.g., 
Robinson & Torres (1997) Seminars in Immunology 9:271-283; Donnelly et al (1997) Annu 
Rev Immunol 15:617-648). 
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Gene Delivery Vehicles 

Gene therapy vehicles for delivery of constructs, including a coding sequence of a 
therapeutic of the invention, to be delivered to the mammal for expression in the mammal can 
be administered either locally or systemically. These constructs can utilize viral or non-viral 
vector approaches in in vivo or ex vivo modality. Expression of such coding sequence can be 
induced using endogenous mammalian or heterologous promoters. Expression of the coding 
sequence in vivo can be either constitutive or regulated. 

The invention includes gene delivery vehicles capable of expressing the contemplated 
nucleic acid sequences. The gene delivery vehicle is preferably a viral vector and, more 
preferably, a retroviral, adenoviral, adeno-associated viral (AAV), herpes viral, or alphavirus 
vector. The viral vector can also be an astrovirus, coronavirus, orthomyxovirus, papovavirus, 
paramyxovirus, parvovirus, picomavirus, poxvirus, or togavirus viral vector. See generally, 
Jolly (1994) Cancer Gene Therapy 1:51-64; Kimura (1994) Human Gene Therapy 5:845-852; 
Connelly (1995) Human Gene Therapy 6:185-193; and Kaplitt (1994) Nature Genetics 
6:148-153. 

Retroviral vectors are well known in the art, including B, C and D type retroviruses, 
xenotropic retroviruses (for example, NZB-X1, NZB-X2 and NZB9-1 (see O'Neill (1985) J. 
Virol 53:160) polytropic retroviruses e.g., MCF and MCF-MLV (see Kelly (1983) J. Virol 
45:291), spumaviruses and lentiviruses. See RNA Tumor Viruses, Second Edition, Cold 
Spring Harbor Laboratory, 1985. 

Portions of the retroviral gene therapy vector may be derived from different 
retroviruses. For example, retrovector LTRs may be derived from a Murine Sarcoma Virus, a 
tRNA binding site from a Rous Sarcoma Virus, a packaging signal from a Murine Leukemia 
Virus, and an origin of second strand synthesis from an Avian Leukosis Virus. 

These recombinant retroviral vectors may be used to generate transduction competent 
retroviral vector particles by introducing them into appropriate packaging cell lines (see US 
patent 5,591,624). Retrovirus vectors can be constructed for site-specific integration into host 
cell DNA by incorporation of a chimeric integrase enzyme into the retroviral particle (see 
W096/37626). It is preferable that the recombinant viral vector is a replication defective 
recombinant virus. 
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Packaging cell lines suitable for use with the above-described retrovirus vectors are 
well known in the art, are readily prepared (see WO95/30763 and WO92/05266), and can be 
used to create producer cell lines (also termed vector cell lines or "VCLs") for the production 
of recombinant vector particles. Preferably, the packaging cell lines are made from human 
parent cells (e.g., HT1080 cells) or mink parent cell lines, which eliminates inactivation in 
human serum. 

Preferred retroviruses for the construction of retroviral gene therapy vectors include 
Avian Leukosis Virus, Bovine Leukemia, Virus, Murine Leukemia Virus, Mink-Cell 
Focus-Inducing Virus, Murine Sarcoma Virus, Reticuloendotheliosis Virus and Rous 
Sarcoma Virus. Particularly preferred Murine Leukemia Viruses include 4070A and 1504A 
(Hartley and Rowe (1976) J Virol 19:19-25), Abelson (ATCC No. VR-999), Friend (ATCC 
No. VR-245), Graffi, Gross (ATCC Nol VR-590), Kirsten, Harvey Sarcoma Virus and 
Rauscher (ATCC No. VR-998) and Moloney Murine Leukemia Virus (ATCC No. VR-190). 
Such retroviruses may be obtained from depositories or collections such as the American 
Type Culture Collection ("ATCC") in Rockville, Maryland or isolated from known sources 
using commonly available techniques. 

Exemplary known retroviral gene therapy vectors employable in this invention include 
those described in patent applications GB2200651, EP0415731, EP0345242, EP0334301, 
WO89/02468; WO89/05349, WO89/09271, WO90/02806, WO90/07936, WO94/03622, 
W093/25698, W093/25234, W093/1 1230, WO93/10218, WO91/02805, WO91/02825, 
WO95/07994, US 5,219,740, US 4,405,712, US 4,861,719, US 4,980,289, US 4,777,127, US 
5,591,624. See also Vile (1993) Cancer Res 53:3860-3864; Vile (1993) Cancer Res 
53:962-967; Ram (1993) Cancer Res 53 (1993) 83-88; Takamiya (1992) J Neurosci Res 
33:493-503; Baba (1993) JNeurosurg 79:729-735; Mann (1983) Cell 33:153; Cane (1984) 
Proc Natl Acad Sci 81:6349; and Miller (1990) Human Gene Therapy L 

Human adenoviral gene therapy vectors are also known in the art and employable in 
this invention. See, for example, Berkner (1988) Biotechniques 6:616 and Rosenfeld (1991) 
Science 252:431, and WO93/07283, WO93/06223, and WO93/07282. Exemplary known 
adenoviral gene therapy vectors employable in this invention include those described in the 
above referenced documents and in W094/12649, WO93/03769, W093/19191, 
W094/28938, W095/11984, WO95/00655, WO95/27071, W095/29993, W095/34671, 
WO96/05320, WO94/08026, W094/1 1506, WO93/06223, W094/24299, WO95/14102, 
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W095/24297, WO95/02697, W094/2S152, W094/24299, WO95/09241, WO95/25807, 
WO95/05S35, W094/18922 and WO95/09654. Alternatively, administration of DNA linked 
to killed adenovirus as described in Curiel (1992) Hum. Gene Then 3:147-154 may be 
employed. The gene delivery vehicles of the invention also include adenovirus associated 
virus (AAV) vectors. Leading and preferred examples of such vectors for use in this invention 
are the AAV-2 based vectors disclosed in Srivastava, WO93/09239. Most preferred AAV 
vectors comprise the two AAV inverted terminal repeats in which the native D-sequences are 
modified by substitution of nucleotides, such that at least 5 native nucleotides and up to 18 
native nucleotides, preferably at least 10 native nucleotides up to 18 native nucleotides, most 
preferably 10 native nucleotides are retained and the remaining nucleotides of the D-sequence 
are deleted or replaced with non-native nucleotides. The native D-sequences of the AAV 
inverted terminal repeats are sequences of 20 consecutive nucleotides in each AAV inverted 
terminal repeat (i.e., there is one sequence at each end) which are not involved in HP 
formation. The non-native replacement nucleotide may be any nucleotide other than the 
nucleotide found in the native D-sequence in the same position. Other employable exemplary 
AAV vectors are pWP-19, pWN-1, both of which are disclosed in Nahreini (1993) Gene 
124:257-262. Another example of such an AAV vector is psub201 (see Samulski (1987)7. 
Virol 61:3096). Another exemplary AAV vector is the Double-D ITR vector. Construction of 
the Double-D ITR vector is disclosed in US Patent 5,478,745. Still other vectors are those 
disclosed in Carter US Patent 4,797,368 and Muzyczka US Patent 5,139,941, Chartejee US 
Patent 5,474,935, and Kotin W094/288157. Yet a further example of an AAV vector 
employable in this invention is SSV9AFABTKneo, which contains the AFP enhancer and 
albumin promoter and directs expression predominantly in the liver. Its structure and 
construction are disclosed in Su (1996) Human Gene Therapy 7:463-470. Additional AAV 
gene therapy vectors are described in US 5,354,678, US 5,173,414, US 5,139,941, and US 
5,252,479. 

The gene therapy vectors comprising sequences of the invention also include herpes 
vectors. Leading and preferred examples are herpes simplex virus vectors containing a 
sequence encoding a thymidine kinase polypeptide such as those disclosed in US 5,288,641 
and EP0176170 (Roizman). Additional exemplary herpes simplex virus vectors include 
HFEM/ICP6-LacZ disclosed in WO95/04139 (Wistar Institute), pHSVlac described in Geller 
(1988) Science 241:1667-1669 and in WO90/09441 and WO92/07945, HSV Us3::pgC-lacZ 
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described in Fink (1992) Human Gene Therapy 3:1 1-19 and HSV 7134, 2 RH 105 and GAL4 
described in EP 0453242 (Breakefield), and those deposited with the ATCC as accession 
numbers ATCC VR-977 and ATCC VR-260. 

Also contemplated are alpha virus gene therapy vectors that can be employed in this 
invention. Preferred alpha virus vectors are Sindbis viruses vectors. Togaviruses, Semliki 
Forest virus (ATCC VR-67; ATCC VR-1 247), Middleberg virus (ATCC VR-370), Ross 
River virus (ATCC VR-373; ATCC VR-1246), Venezuelan equine encephalitis virus (ATCC 
VR923; ATCC VR-1250; ATCC VR-1249; ATCC VR-532), and those described in US 
patents 5,091,309, 5,217,879, and WO92/10578. More particularly, those alpha virus vectors 
described in U.S. Serial No. 08/405,627, filed March 15, 1995,W094/21792, WO92/10578, 
WO95/07994, US 5,091,309 and U£ 5,217,879 are employable. Such alpha viruses may be 
obtained from depositories or collections such as the ATCC in Rockville, Maryland or 
isolated from known sources using commonly available techniques. Preferably, alphavirus 
vectors with reduced cytotoxicity are used (see USSN 08/679640). 

DNA vector systems such as eukarytic layered expression systems are also useful for 
expressing the nucleic acids of the invention. SeeWO95/07994 for a detailed description of 
eukaryotic layered expression systems. Preferably, the eukaryotic layered expression systems 
of the invention are derived from alphavirus vectors and most preferably from Sindbis viral 
vectors. 

Other viral vectors suitable for use in the present invention include those derived from 
poliovirus, for example ATCC VR-58 and those described in Evans, Nature 339 (1989) 385 
and Sabin (1973) J. Biol Standardization 1:115; rhinovirus, for example ATCC VR-1 1 10 and 
those described in Arnold (1990) J Cell Bicchem L401; pox viruses such as canary pox virus 
or vaccinia virus, for example ATCC VR-1 1 1 and ATCC VR-2010 and those described in 
Fisher-Hoch(1989) Proc Natl Acad Sci 86:317; Flexner (1989) Ann NY Acad Sci 569:86, 
Flexner (1990) Vaccine 8:17; in US 4,603,112 and US 4,769,330 and WO89/01973; SV40 
virus, for example ATCC VR-305 and those described in Mulligan (1979) Nature 277:108 
and Madzak (1992) J Gen Virol 73:1533; influenza virus, for example ATCC VR-797 and 
recombinant influenza viruses made employing reverse genetics techniques as described in 
US 5,166,057 and in Enami (1990) Proc Natl Acad Sci 87:3802-3805; Enami & Palese (1991) 
J Virol 65:271 1-2713 and Luytjes (1989) Cell 59:1 10, (see also McMichael (1983) NEJ Med 
309:13, and Yap (1978) Nature 273:238 and Nature (1979) 277:108); human 
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immunodeficiency virus as described in EP-03868S2 and in Buchschacher (1992) J. Virol 
66:2731; measles virus, for example ATCC VR-67 and VR-1247 and those described in EP- 
0440219; Aura virus, for example ATCC VR-368; Bebaru virus, for example ATCC VR-600 
and ATCC VR-1240; Cabassou virus, for example ATCC VR-922; Chikungunya virus, for 
example ATCC VR-64 and ATCC VR-1241 ; Fort Morgan Virus, for example ATCC 
VR-924; Getah virus, for example ATCC VR-369 and ATCC VR-1243; Kyzylagach virus, 
for example ATCC VR-927; Mayaro virus, for example ATCC VR-66; Mucambo virus, for 
example ATCC VR-580 and ATCC VR-1244; Ndumu virus, for example ATCC VR-371; 
Pixuna virus, for example ATCC VR-372 and ATCC VR-1245; Tonate virus, for example 
ATCC VR-925; Triniti virus, for example ATCC VR-469; Una virus, for example ATCC 
VR-374; Whataroa virus, for example ATCC VR-926; Y-62-33 virus, for example ATCC 
VR-375: O'Nyong virus, Eastern encephalitis virus, for example ATCC VR-65 and ATCC 
VR-1242; Western encephalitis virus, for example ATCC VR-70, ATCC VR-1251, ATCC 
VR-622 and ATCC VR-1252; and coronavirus, for example ATCC VR-740 and those 
described in Hamre (1 966) Proc Soc Exp Biol Med 1 2 1 : 1 90. 

Delivery of the compositions of this invention into cells is not limited to the above 
mentioned viral vectors. Other delivery methods and media may be employed such as, for 
example, nucleic acid expression vectors, polycationic condensed DNA linked or unlinked to 
killed adenovirus alone, for example see US Serial No. 08/366,787, filed December 30, 1994 
and Curiel (1992) Hum Gene Ther 3:147-154 ligand linked DNA, for example see Wu (1989) 
j Biol Chem 264:16985-16987, eucaryotic cell delivery vehicles cells, for example see US 
Serial No.08/240,030, filed May 9, 1994, and US Serial No. 08/404,796, deposition of 
photopolymerized hydrogel materials, hand-held gene transfer particle gun, as described in 
US Patent 5,149,655, ionizing radiation as described in US5, 206,152 and in W092/1 1033, 
nucleic charge neutralization or fusion with cell membranes. Additional approaches are 
described in Philip (1994) Mol Cell Biol 14:2411-2418 and in Woffendin (1994) Proc Natl 
Acad Sci 91:1581-1585. 

Particle mediated gene transfer may be employed, for example see US Serial No. 
60/023,867. Briefly, the sequence can be inserted into conventional vectors that contain 
conventional control sequences for high level expression, and then incubated with synthetic 
gene transfer molecules such as polymeric DNA-binding cations like polylysine, protamine, 
and albumin, linked to cell targeting ligands such as asialoorosomucoid, as described in Wu & 
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Wu (1987)7. Biol. Chem. 262:4429-4432, insulin as descnbed in Hucked (1990) Biochem 
Pharmacol 40:253-263, galactose as described in Plank (1992) Bioconjugate Chem 
3:533-539, lactose or transferrin. 

Naked DNA may also be employed to transform a host cell. Exemplary naked DNA 
introduction methods are described in WO 90/1 1092 and US 5,580,859. Uptake efficiency 
may be improved using biodegradable latex beads. DNA coated latex beads are efficiently 
transported into cells after endocytosis initiation by the beads. The method may be improved 
further by treatment of the beads to increase hydrophobicity and thereby facilitate disruption 
of the endosome and release of the DNA into the cytoplasm. 

Liposomes that can act as gene delivery vehicles are described in U.S. 5,422,120, 
W095/13796, W094/23697, W091/14445 and EP-524,968. As described in USSN. 
60/023,867, on non-viral delivery, the nucleic acid sequences encoding a polypeptide can be 
inserted into conventional vectors that contain conventional control sequences for high level 
expression, and then be incubated with synthetic gene transfer molecules such as polymeric 
DNA-binding cations like polylysine, protamine, and albumin, linked to cell targeting ligands 
such as asialoorosomucoid, insulin, galactose, lactose, or transferrin. Other delivery systems 
include the use of liposomes to encapsulate DNA comprising the gene under the control of a 
variety of tissue-specific or ubiquitously-active promoters. Further non- viral delivery suitable 
for use includes mechanical delivery systems such as the approach described in Woffendin et 
al (1994) Proc. Nail. Acad, ScL USA 91(24):1 1581-1 1585. Moreover, the coding sequence 
and the product of expression of such can be delivered through deposition of 
photopolymerized hydrogel materials. Other conventional methods for gene delivery that can 
be used for delivery of the coding sequence include, for example, use of hand-held gene 
transfer particle gun, as described in U.S. 5,149,655; use of ionizing radiation for activating 
transferred gene, as described in U.S. 5,206,152 and W092/1 1033. 

Exemplary liposome and polycationic gene delivery vehicles are those described in 
US 5,422,120 and 4,762,915; in WO 95/13796; W094/23697; and W091/14445; in EP- 
0524968; and in Stryer, Biochemistry, pages 236-240 (1975) W.H. Freeman, San Francisco; 
Szoka (1980) Biochem Biophys Acta 600:1; Bayer (1979) Biochem Biophys Acta 550:464; 
Rivnay (1987) Meth Enzymol 149:1 19; Wang (1987) Proc Natl Acad Sci 84:7851 ; Plant 
(\9S9) Anal Biochem 176:420. 
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A polynucleotide composition can comprises therapeutically effective amount of a 
gene therapy vehicle, as the term is defined above. For purposes of the present invention, an 
effective dose will be from about 0.01 mg/ kg to 50 mg/kg or 0.05 mg/kg to about 10 mg/kg 
of the DNA constructs in the individual to which it is administered. 

Delivery Methods 

Once formulated, the polynucleotide compositions of the invention can be 
administered (1) directly to the subject; (2) delivered ex vivo, to cells derived from the 
subject; or (3) in vitro for expression of recombinant proteins. The subjects to be treated can 
be mammals or birds. Also, human subjects can be treated. 

Direct delivery of the compositions will generally be accomplished by injection, either 
subcutaneously, intraperitoneal ly, intravenously or intramuscularly or delivered to the 
interstitial space of a tissue. The compositions can also be administered into a tumor or lesion. 
Other modes of administration include oral and pulmonary administration, suppositories, and 
transdermal applications, needles, and gene guns or hyposprays. Dosage treatment may be a 
single dose schedule or a multiple dose schedule. 

Methods for the ex vivo delivery and reimplantation of transformed cells into a subject 
are known in the art and described in eg. W093/14778. Examples of cells useful in ex vivo 
applications include, for example, stem cells, particularly hematopoetic, lymph cells, 
macrophages, dendritic cells, or tumor cells. 

Generally, delivery of nucleic acids for both ex vivo and in vitro applications can be 
accomplished by the following procedures, for example, dextran-mediated transfection, 
calcium phosphate precipitation, polybrene mediated transfection, protoplast fusion, 
electroporation, encapsulation of the polynucleotide(s) in liposomes, and direct microinjection 
of the DNA into nuclei, all well known in the art. 

Polynucleotide and polypeptide pharmaceutical compositions 

In addition to the pharmaceutically acceptable carriers and salts described above, the 
following additional agents can be used with polynucleotide and/or polypeptide compositions. 
A.Polypeptides 

One example are polypeptides which include, without limitation: asioloorosomucoid 
(ASOR); transferrin; asialoglycoproteins; antibodies; antibody fragments; ferritin; 
interleukins; interferons, granulocyte, macrophage colony stimulating factor (GM-CSF), 
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granulocyte colony stimulating factor (G-CSF), macrophage colony stimulating factor 
(M-CSF), stem cell factor and erythropoietin. Viral antigens, such as envelope proteins, can 
also be used. Also, proteins from other invasive organisms, such as the 17 amino acid peptide 
from the circumsporozoite protein of Plasmodium falciparum known as RII. 

B. Hormones, Vitamins, Etc. 

Other groups that can be included are, for example: hormones, steroids, androgens, 
estrogens, thyroid hormone, or vitamins, folic acid. 

C. Polyalkylenes, Polysaccharides, etc. 

Also, polyalkylene glycol can be included with the desired polynucleotides or 
polypeptides. In a preferred embodiment, the polyalkylene glycol is polyethlylene glycol. In 
addition, mono-, di-, or polysaccariues can be included. In a preferred embodiment of this 
aspect, the polysaccharide is dextran or DEAE-dextran. Also, chitosan and 
poly(lactide-co-glycolide) 

D. Lipids, and Liposomes 

The desired polynucleotide or polypeptide can also be encapsulated in lipids or 
packaged in liposomes prior to delivery to the subject or to cells derived therefrom. 

Lipid encapsulation is generally accomplished using liposomes which are able to 
stably bind or entrap and retain nucleic acid. The ratio of condensed polynucleotide or 
polypeptide to lipid preparation can vary but will generally be around 1:1 (mg 
DNA:micromoles lipid), or more of lipid. For a review of the use of liposomes as carriers for 
delivery of nucleic acids, see, Hug and Sleight (1991) Biochim. Biophys. Acta. 1097:1-17; 
Straubinger(1983) Afer/i. EnzymoL 101:512-527. 

Liposomal preparations for use in the present invention include cationic (positively 
charged), anionic (negatively charged) and neutral preparations. Cationic liposomes have 
been shown to mediate intracellular delivery of plasmid DNA (Feigner (1987) Proc. Natl. 
Acad. Sci. USA 84:7413-7416); mRNA (Malone (1989) Proc. Natl. Acad. ScL USA 
86:6077-6081); and purified transcription factors (Debs (1990) J. Biol. Chem. 
265:10189-10192), in functional form. 

Cationic liposomes are readily available. For example, 
N[l-2,3-dioleyloxy)propyl]-N,N,N-triethylamn^nium (DOTMA) liposomes are available 
under the trademark Lipofectin, from GIBCO BRL, Grand Island, NY. (See, also, Feigner 
supra). Other commercially available liposomes include transfectace (DDAB/DOPE) and 
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DOTAP/DOPE (Boerhinger). Other cationic liposomes can be prepared from readily available 
materials using techniques well known in the art. See, eg. Szoka (1978) Proc. Natl. Acad. Sci. 
USA 75:4194-4198; WO90/11092 for a description of the synthesis of DOTAP 
(l,2-bis(oleoyloxy)-3-(trimethylammonio)propane) liposomes. 

Similarly, anionic and neutral liposomes are readily available, such as from Avanti 
Polar Lipids (Birmingham, AL), or can be easily prepared using readily available materials. 
Such materials include phosphatidyl choline, cholesterol, phosphatidyl ethanolamine, 
dioleoylphosphatidyl choline (DOPC), dioleoylphosphatidyl glycerol (DOPG), 
dioleoylphoshatidyl ethanolamine (DOPE; : among others. These materials can also be mixed 
with the DOTMA and DOTAP starting materials in appropriate ratios. Methods for making 
liposomes using these materials are well known in the art. 

The liposomes can comprise multilammeiar vesicles (MLVs), small unilamellar 
vesicles (SUVs), or large unilamellar vesicles (LUVs). The various liposome-nucleic acid 
complexes are prepared using methods known in the art. See eg. Straubinger (1983) Meth. 
Immunol 101:512-527; Szoka (1978) Proc. Natl Acad. Sci. USA 75:4194-4198; 
Papahadjopoulos (1975) Biochim. Biophys. Acta 394:483; Wilson (1979) Cell 17:77); Deamer 
& Bangham (1976) Biochim. Biophys. Acta 443:629; Ostro (1977) Biochem. Biophys. Res. 
Commun. 76:836; Fraley (1979) Proc. Natl Acad. Sci USA 76:3348); Enoch & Strittmatter 
(1979) Proc. Natl. Acad Sci. USA 76:145; Fraley (1980)7. Biol Chem. (1980) 255:10431; 
Szoka & Papahadjopoulos (1978) Proc. Natl. Acad, Sci USA 75:145; and Schaefer-Ridder 
(1982) Science 215:166. 
E.Lipoproteins 

In addition, lipoproteins can be included with the polynucleotide or polypeptide to be 
delivered. Examples of lipoproteins to be utilized include: chylomicrons, HDL, IDL, LDL, 
and VLDL. Mutants, fragments, or fusions of these proteins can also be used. Also, 
modifications of naturally occurring lipoproteins can be used, such as acetylated LDL. These 
lipoproteins can target the delivery of polynucleotides to cells expressing lipoprotein 
receptors. Preferably, if lipoproteins are including with the polynucleotide to be delivered, no 
other targeting ligand is included in the composition. 

Naturally occurring lipoproteins compose a lipid and a protein portion. The protein 
portion are known as apoproteins. At the present, apoproteins A, B, C, D, and E have been 
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isolated and identified. At least two of these contain several proteins, designated by Roman 
numerals, Al, All, A1V; CI, CII, CIIL 

A lipoprotein can comprise more than one apoprotein. For example, naturally 
occurring chylomicrons comprises of A, B, C, and E, over time these lipoproteins lose A and 
acquire C and E apoproteins. VLDL comprises A, B, C, and E apoproteins, LDL comprises 
apoprotein B; and HDL comprises apoproteins A, C, and E. 

The amino acid of these apoproteins are known and are described in, for example, 
Breslow(1985)^/?m/7?f?v. Biochem 54:699; Law (1986) Adv. Exp Med. Biol. 151:162; Chen 
(1 986) J Biol Chem 261 : 1291 8; Kane ( 1 980) Proc Natl Acad Sci USA 77:2465; and Utermann 
( 1 984) Hum Genet 65:232. 

Lipoproteins contain a variety of lipids including, triglycerides, cholesterol (free and 
esters), and phopholipids. The composition of the lipids varies in naturally occurring 
lipoproteins. For example, chylomicrons comprise mainly triglycerides. A more detailed 
description of the lipid content of naturally occurring lipoproteins can be found, for example, 
in Meth. Enzymoi 128 (1986). The composition of the lipids are chosen to aid in 
conformation of the apoprotein for receptor binding activity. The composition of lipids can 
also be chosen to facilitate hydrophobic interaction and association with the polynucleotide 
binding molecule. 

Naturally occurring lipoproteins can be isolated from serum by ultracentrifugation, for 
instance. Such methods are described in Meth. Enzymoi (supra); Pitas (1980) J. Biochem. 
255:5454-5460 and Mahey (1979) J Clin. Invest 64:743-750. 

Lipoproteins can also be produced by in vitro or recombinant methods by expression 
of the apoprotein genes in a desired host cell. See, for example, Atkinson (1986) Annu Rev 
Biophys Chem 15:403 and Radding (1958) Biochim Biophys Acta 30: 443. 

Lipoproteins can also be purchased from commercial suppliers, such as Biomedical 
Techniologies, Inc., Stoughton, Massachusetts, USA. 

Further description of lipoproteins can be found in Zuckermann et al, PCT. Appln. 
No. US97/14465. 
F.Polycationic Agents 

Polycationic agents can be included, with or without lipoprotein, in a composition 
with the desired polynucleotide or polypeptide to be delivered. 
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Polycationic agents, typically, exhibit a net positive charge at physiological relevant 
pH and are capable of neutralizing the electrical charge of nucleic acids to facilitate delivery 
to a desired location. These agents have both in vitro, ex vivo, and in vivo applications. 
Polycationic agents can be used to deliver nucleic acids to a living subject either 
intramuscularly, subcutaneously, etc. 

The following are examples of useful polypeptides as polycationic agents: polylysine, 
polyargimne, polyornithine, and protamine. Other examples include histones, protamines, 
human serum albumin, DNA binding proteins, non-histone chromosomal proteins, coat 
proteins from DNA viruses, such as (XI 74, transcriptional factors also contain domains that 
bind DNA and therefore may be useful as nucleic aid condensing agents. Briefly, 
transcriptional factors such as C/CEBP, c-jun, c-fos, AP-1, AP-2, AP-3, CPF, Prot-1, Sp-1, 
Oct-1, Oct-2, CREP, and TFIID contain basic domains that bind DNA sequences. 

Organic polycationic agents include: spermine, spermidine, and purtrescine. 

The dimensions and of the physical properties of a polycationic agent can be 
extrapolated from the list above, to construct other polypeptide polycationic agents or to 
produce synthetic polycationic agents. 

Synthetic Polycationic Agents 

Synthetic polycationic agents which are useful include, for example, DEAE-dextran, 
polybrene. Lipofectind, and lipofectAMINE □ are monomers that form polycationic 
complexes when combined with polynucleotides or polypeptides. 

Immunodiagnostic Assays 

Neisserial antigens of the invention can be used in immunoassays to detect antibody 
levels (or, conversely, anti-Neisserial antibodies can be used to detect antigen levels). 
Immunoassays based on well defined, recombinant antigens can be developed to replace 
invasive diagnostics methods. Antibodies to Neisserial proteins within biological samples, 
including for example, blood or serum samples, can be detected. Design of the immunoassays 
is subject to a great deal of variation, and a variety of these are known in the art. Protocols for 
the immunoassay may be based, for example, upon competition, or direct reaction, or 
sandwich type assays. Protocols may also, for example, use solid supports, or may be by 
immunoprecipitation. Most assays involve the use of labeled antibody or polypeptide; the 
labels may be, for example, fluorescent, chemiluminescent, radioactive, or dye molecules. 
Assays which amplify the signals from the probe are also known; examples of which are 
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assays which utilize biotin and avidin, and enzyme-labeled and mediated immunoassays, such 
as ELISA assays. 

Kits suitable for immunodiagnosis and containing the appropriate labeled reagents are 
constructed by packaging the appropriate materials, including the compositions of the 
invention, in suitable containers, along with the remaining reagents and materials (for 
example, suitable buffers, salt solutions, etc.) required for the conduct of the assay, as well as 
suitable set of assay instructions. 

Nucleic Acid Hybridisation 

"Hybridization'* refers to the association of two nucleic acid sequences to one another 
by hydrogen bonding. Typically, one sequence will be fixed to a solid support and the other 
will be free in solution. Then, the two sequences will be placed in contact with one another 
under conditions that favor hydrogen bonding. Factors that affect this bonding include: the 
type and volume of solvent; reaction temperature; time of hybridization; agitation; agents to 
block the non-specific attachment of the liquid phase sequence to the solid support 
(Denhardt's reagent or BLOTTO); concentration of the sequences; use of compounds to 
increase the rate of association of sequences (dextran sulfate or polyethylene glycol); and the 
stringency of the washing conditions following hybridization. See Sambrook et al. [supra] 
Volume 2, chapter 9, pages 9.47 to 9.57. 

"Stringency" refers to conditions in a hybridization reaction that favor association of 
very similar sequences over sequences that differ. For example, the combination of 
temperature and salt concentration should be chosen that is approximately 120 to 200 DC 
below the calculated Tm of the hybrid under study. The temperature and salt conditions can 
often be determined empirically in preliminary experiments in which samples of genomic 
DNA immobilized on filters are hybridized to the sequence of interest and then washed under 
conditions of different stringencies. See Sambrook et al. at page 9.50. 

Variables to consider when performing, for example, a Southern blot are (1 ) the 
complexity of the DNA being blotted and (2) the homology between the probe and the 
sequences being detected. The total amount of the fragment(s) to be studied can vary a 
magnitude of 1 0, from 0. 1 to 1 |ig for a plasmid or phage digest to 1 0" 9 to 1 0" 8 g for a single 
copy gene in a highly complex eukaryotic geno.ae. For lower complexity polynucleotides, 
substantially shorter blotting, hybridization, and exposure times, a smaller amount of starting 
polynucleotides, and lower specific activity of probes can be used. For example, a single-copy 
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yeast gene can be detected with an exposure time of only 1 hour starting with 1 jig of yeast 
DNA, blotting for two hours, and hybridizing for 4-8 hours with a probe of 10 8 cpm/|ig. For a 
single-copy mammalian gene a conservative approach would start with 10 jig of DNA, blot 
overnight, and hybridize overnight in the presence of 10% dextran sulfate using a probe of 
greater than 10 s cpm/jug, resulting in an exposure time of -24 hours. 

Several factors can affect the melting temperature (Tm) of a DNA-DNA hybrid 
between the probe and the fragment of interest, and consequently, the appropriate conditions 
for hybridization and washing. In many cases the probe is not 100% homologous to the 
fragment. Other commonly encountered variables include the length and total G+C content of 
the hybridizing sequences and the ionic strength and formamide content of the hybridization 
buffer. The effects of all of these factors can be approximated by a single equation: 

Tm= 81 + 16.6(log, 0 Ci) + 0.4[%(G + C)]-0.6(%formamide) - 600/«-1.5(%mismatch). 

where Ci is the salt concentration (monovalent ions) and n is the length of the hybrid 
in base pairs (slightly modified from Meinkoth & Wahl (1984) Anal Biochem. 138: 267-284). 

In designing a hybridization experiment, some factors affecting nucleic acid 
hybridization can be conveniently altered. The temperature of the hybridization and washes 
and the salt concentration during the washes are the simplest to adjust. As the temperature of 
the hybridization increases (ie. stringency), it becomes less likely for hybridization to occur 
between strands that are nonhomologous, and as a result, background decreases. If the 
radiolabeled probe is not completely homologous with the immobilized fragment (as is 
frequently the case in gene family and interspecies hybridization experiments), the 
hybridization temperature must be reduced, and background will increase. The temperature of 
the washes affects the intensity of the hybridizing band and the degree of background in a 
similar manner. The stringency of the washes is also increased with decreasing salt 
concentrations. 

In general, convenient hybridization temperatures in the presence of 50% formamide 
are 42 DC for a probe with is 95% to 100% homologous to the target fragment, 37 DC for 90% 
to 95% homology, and 32 DC for 85% to 90% homology. For lower homologies, formamide 
content should be lowered and temperature adjusted accordingly, using the equation above. If 
the homology between the probe and the target fragment are not known, the simplest 
approach is to start with both hybridization and wash conditions which are nonstringent. If 
non-specific bands or high background are observed after autoradiography, the filter can be 
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washed at high stringency and reexposed. If the time required for exposure makes this 
approach impractical, several hybridization and/or washing stringencies should be tested in 
parallel. 

Nucleic Acid Probe Assays 

Methods such as PCR, branched DNA probe assays, or blotting techniques utilizing 
nucleic acid probes according to the invention can determine the presence of cDNA or 
mRNA. A probe is said to "hybridize" with a sequence of the invention if it can form a duplex 
or double stranded complex, which is stable enough to be detected. 

The nucleic acid probes will hybridize to the Neisserial nucleotide sequences of the 
invention (including both sense and antisense strands). Though many different nucleotide 
sequences will encode the amino acid sequence, the native Neisserial sequence is preferred 
because it is the actual sequence present in cells. mRNA represents a coding sequence and so 
a probe should be complementary to the coding sequence; single-stranded cDNA is 
complementary to mRNA, and so a cDNA probe should be complementary to the non-coding 
sequence. 

The orobe sequence need not be identical to the Neisserial sequence (or its 
complement) — some variation in the sequence and length can lead to increased assay 
sensitivity if the nucleic acid probe can form a duplex with target nucleotides, which can be 
detected. Also, the nucleic acid probe can include additional nucleotides to stabilize the 
formed duplex. Additional Neisserial sequence may also be helpful as a label to detect the 
formed duplex. For example, a non-complementary nucleotide sequence may be attached to 
the 5' end of the probe, with the remainder of the probe sequence being complementary to a 
Neisserial sequence. Alternatively, non-complementary bases or longer sequences can be 
interspersed into the probe, provided that the probe sequence has sufficient complementarity 
with the a Neisserial sequence in order to hybridize therewith and thereby form a duplex 
which can be detected. 

The exact length and sequence of the probe will depend on the hybridization 
conditions, such as temperature, salt condition and the like. For example, for diagnostic 
applications, depending on the complexity of the analyte sequence, the nucleic acid probe 
typically contains at least 10-20 nucleotides, preferably 15-25, and more preferably at least 30 
nucleotides, although it may be shorter than this. Short primers generally require cooler 
temperatures to form sufficiently stable hybrid complexes with the template. 
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Probes may be produced by synthetic procedures, such as the triester method of 
Matteucci et al [J. Am. Chem. Soc. (1981) 103:3185], or according to Urdea et al [Proa 
Natl Acad. ScL USA (1983) SO: 7461], or using commercially available automated 
oligonucleotide synthesizers. 

The chemical nature of the probe can be selected according to preference. For certain 
applications, DNA or RNA are appropriate. For other applications, modifications may be 
incorporated eg. backbone modifications, such as phosphorothioates or methylphosphonates, 
can be used to increase in vivo half-life, alter RNA affinity, increase nuclease resistance etc. 
[eg. see Agrawal & Iyer (1995) Curr Opin Biotechnol 6:12-19; Agrawal (1996) TIBTECH 
14:376-387]; analogues such as peptide nucleic acids may also be used [eg. see Corey (1997) 
TIBTECH 15:224-229; Buchardt et al (1993) TIBTECH 1 1 :384-386]. 

One example of a nucleotide hybridization assay is described by Urdea et al in 
international patent application WO92/02526 [see also US patent 5,124,246]. 

Alternatively, the polymerase chain reaction (PCR) is another well-known means for 
detecting small amounts of target nucleic acids. The assay is descnbed in: Mullis et al [Meth. 
Enzymol (1987) 155: 335-350]; US patent 4,683,195; and US patent 4,683,202. Two 
"primer" nucleotides hybridize with the target nucleic acids and are used to prime the 
reaction. The primers can comprise sequence that does not hybridize to the sequence of the 
amplification target (or its complement) to aid with duplex stability or, for example, to 
incorporate a convenient restriction site. Typically, such sequence will flank the desired 
Neisserial sequence. 

A thermostable polymerase creates copies of target nucleic acids from the primers 
using the original target nucleic acids as a template. After a threshold amount of target nucleic 
acids are generated by the polymerase, they can be detected by more traditional methods, such 
as Southern blots. When using the Southern blot method, the labelled probe will hybridize to 
the Neisserial sequence (or its complement). 

Also, mRNA or cDNA can be detected by traditional blotting techniques described in 
Sambrook et al [supra]. mRNA, or cDNA generated from mRNA using a polymerase 
enzyme, can be purified and separated using gel electrophoresis. The nucleic acids on the gel 
are then blotted onto a solid support, such as nitrocellulose. The solid support is exposed to a 
labelled probe and then washed to remove any unhybridized probe. Next, the duplexes 
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containing the labeled probe are detected. Typically, the probe is labelled with a radioactive 
moiety. 

EXAMPLES 

The examples describe nucleic acid sequences which have been identified in 
N. meningitidis, and N. gonorrhoeae along with their respective and putative translation 
products. Not all of the nucleic acid sequences are complete ie. they encode less than the full- 
length wild-type protein. 

The examples are generally in the following format: 

• a nucleotide sequence which has been identified in N. meningitidis 

• the putative translation product of said N. meningitidis sequence 

• a computer analysis of said translation product based on database comparisons 

• a corresponding nucleotide sequence identified from N. gonorrhoeae 

• the putative translation product of said N. gonorrhoeae sequence 

• a comparision of the percentage of identity between the translation product of the 
N. meningitidis sequence and the N. gonorrhoeae sequence. 

• a corresponding nucleotide sequence identified from strain A of N. meningitidis 

• the putative translation product of said N. meningitidis strain A sequence 

• a comparision of the percentage of identity between the translation product of the 
N. meningitidis sequence and the N. gonorrhoeae sequence. 

• a description of the characteristics of the protein which indicates that it might be 
suitably antigenic or immunogenic. 

Sequence comparisons were performed at NCBI (http://www.ncbi.nlm.nih.gov) using 
the algorithms BLAST, BLAST2, BLASTn, BLASTp, tBLASTn, BLASTx, & tBLASTx [eg 
see also Altschul et al (1997) Gapped BLAST and PSI-BLAST: a new generation of protein 
database search programs. Nucleic Acids Research 25:2289-3402]. Searches were performed 
against the following databases: non-redundant GenBank+EMBL+DDBJ+PDB sequences 
and non-redundant GenBank CDS translations+PDB+SwissProt+SPupdate+PER sequences. 

Dots within nucleotide sequences represent nucleotides which have been arbitrarily 
introduced in order to maintain a reading frame. In the same way, double-underlined 
nucleotides were removed . Lower case letters represent ambiguities which arose during 
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alignment of independent sequencing reactions (some of the nucleotide sequences in the 
examples are derived from combining the results of two or more experiments). 

Nucleotide sequences were scanned in all six reading frames to predict the presence of 
hydrophobic domains using an algorithm based on the statistical studies of Esposti et ai 
[Critical evaluation of the hydropathy of membrane proteins (1990) Eur J Biochem 190:207- 
219]. These domains represent potential transmembrane regions or hydrophobic leader 
sequences. 

Open reading frames were predicted from fragmented nucleotide sequences using the 
program ORFFINDER (NCBI). 

Underlined amino acid sequences indicate possible transmembrane domains or leader 
sequences in the ORFs, as predicted by the PSORT algorithm (http://www.psort.nibb.ac.jp). 
Functional domains were also predicted using the MOTIFS program (GCG Wisconsin & 
PROSITE). 

For each of the following examples: based on the presence of a putative leader 
sequence and/or several putative transmembrane domains (single-underlined) in the 
gonococcal protein, it is predicted that the proteins from N. meningitidis and N. gonorrhoeae, 
and their respective epitopes, could be useful antigens or immunogenic compositions for 
vaccines or diagnostics. 

The standard techniques and procedures which may be employed in order to perform 
the invention (e.g. to utilize the disclosed sequences for vaccination or diagnostic purposes) 
were summarized above. This summary is not a limitation on the invention but, rather, gives 
examples that may be used, but are not required. 

In particular, the following methods were used to express, purify and biochemically 
characterize the proteins of the invention. 
Chromosomal DNA Preparation 

N. meningitidis strain 2996 was grown to exponential phase in 100ml of GC medium, 
harvested by centrifugation, and resuspended in 5ml buffer (20%(w/v) Sucrose, 50mM Tris- 
HC1, 50mM EDTA, pH8). After 10 minutes incubation on ice, the bacteria were lysed by 
adding 10ml of lysis solution (50mM NaCl, 1% Na-Sarkosyl, 50jag/ml Proteinase K), and the 
suspension incubated at 37°C for 2 hours. Two phenol extractions (equilibrated to pH 8) and 
one CHCb/isoamylalcohol (24:1) extraction were performed. DNA was precipitated by 
addition of 0.3M sodium acetate and 2 volumes of ethanol. and collected by centrifugation. 
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The pellet was washed once with 70%(v/v) ethanol and redissolved in 4.0ml TE buffer 
(lOmM Tris-HCl, ImM EDTA, pH 8.0). The DNA concentration was measured by reading 
the OD at 260 nm. 

Oligonucleotide design 

Synthetic oligonucleotide primers were designed on the basis of the coding sequence 
of each ORF, using (a) the meningococcus B sequence when available, or (b) the 
gonococcus/meningococcus A sequence, adapted to the codon preference usage of 
meningococcus as necessary. Any predicted signal peptides were omitted, by designing the 5' 
primers to sequence immediately downstream from the predicted leader sequence. 

For most ORFs, the 5' primers included two restriction enzyme recognition sites 
{BamHl-Ndel, BamHl-Nhel, EcoRl-Ndel or EcoRl-Nhel), depending on the restriction pattern 
of the gene of interest. The 3' primers included a Xhol or a Hindlll restriction site (table 1). 
This procedure was established in order to direct the cloning of each amplification product 
(corresponding to each ORF) into two different expression systems: pGEX-KG (using 
BamHl-Xhol, Bamffl-Hmdlll, EcoRl-Xhol or EcoRl-Hindlll), and pET2 1 b+ (using Ndel- 
^ol, Nhel Xhol, Ndel-Hindai or Nhel-HindUl). 



5 '-end primer tail: 



3' -end primer tail: 



CGC GGATCCCATATG 
CGC GGATCCGCTAGC 
CCG GAATTC TA CATATG 
CCGGAATTCTAGCTAGC 



CCCGCTCGAG 
CCCGCTCGAG 



(BamHl-Ndel ) 
{BamHl-Nhel) 
(EcoRl-Ndel) 
(EcoRl-Nhel) 

(Xhol) 
(Hindlll) 



For cloning ORFs into the pGEX-His vector, the 5' and 3' primers contained only one 
restriction enzyme site (EcoRl, Kpnl or Sail for the 5' primers and Pstl, Xbal, Sphl or Sail for 
the 3' primers). Again restriction sites were chosen according to the particular restriction 
pattern of the gene (table 1). 



5'-end primer tail: 



3'-end primer tail: 



(AAA) AAA GAATTC 
( AAA ) AAA GGTACC 
(AAA) AAACTGCAG 
(AAA) AAA TC TAG A 



(EcoRl ) 
(Kpnl) 
(Pstl) 
(Xbal) 
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AAAGCATGC 



(Sph\) 



5' or 3'-end primer tail: 



AAAAAAGTCGAC 



(Sail) 



As well as containing the restriction enzyme recognition sequences, the primers 
included nucleotides which hybridized to the sequence to be amplified. The melting 
temperature depended on the number and type of hybridising nucleotides in the whole primer, 
and was determined for each primer using the formulae: 



X m = 64.9 + 0.41 (% GC) - 600/N (whole primer) 

The melting temperatures of the selected oligonucleotides were usually 65-70°C for 
the whole oligo and 50-55°C for the hybridising region alone. 

Table 1 shows the forward and reverse primers used for each amplification. In certain 
cases, the sequence of the primer does not exactly match the sequence of the predicted ORF. 
This is because when initial amplifications were performed, the complete 5 1 and/or 3' 
sequences for some meningococcal B ORFs were not known. However the corresponding 
sequences had been identified in Gonococcus or in Meningoccus A. Hence, when the 
Meningoccus B sequence was incomplete or uncertain, Gonococcal or Meningococcal A 
sequences were used as the basis for primer design. These sequences were altered to take 
account of codon preference. It can be appreciated that, once the complete sequence is 
identified, this approach will no longer be necessary. 

Oligonucleotides were synthesized using a Perkin Elmer 394 DNA/RNA Synthesizer, 
eluted from the columns in 2.0ml NH 4 OH, and deprotected by 5 hours incubation at 56°C. 
The oligos were precipitated by addition of 0.3M Na- Acetate and 2 volumes ethanol. The 
samples were centrifuged and the pellets resuspended in either 100|il or 1.0ml of water. The 
OD26O was determined using a Perkin Elmer Lambda Bio spectophotometer and the 
concentration adjusted to 2-10pmol/jal. 

Amplification 

The standard PCR protocol was as follows: 50-200ng of genomic DNA was used as a 
template in the presence of 20-40uM of each oligonucletide primer, 400-800|iM dNTPs 
solution, lx PCR buffer (including 1.5mM MgCl 2 ), 2.5 units TaqlDNA polymerase (using 



T m =4(G+C)+2(A+T) 



(tail excluded) 
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Perkin-Elmer AmpliTaQ, GIBCO Platinum, Pwo DNA polymerase, or Tahara Shuzo Taq 
polymerase). In some cases, PCR was optimsed by the addition of lOjal DMSO or 50|il 2M 
Betaine. 

After a hot start (adding the polymerase during a preliminary 3 minute incubation of 
the whole mix at 95°C), each sample underwent a two-step amplification. The first 5 cycles 
were performed using the hybridization temperature that excluded the restriction enzyme tail 
of the primer (see above). This was followed by 30 cycles using the hybridization temperature 
calculated for the whole length oligos. The cycles were completed with a 10 minute extension 
step at 72°C. The standard cycles were as follows: 





Denaturation 


Hybridisation 


Elongation 


First 5 cycles 


30 seconds 
95°C 


30 seconds 
50-55°C 


30-60 seconds 
72°C 


Last 30 cycles 


30 seconds 
95°C 


30 seconds 
65-70°C 


30-60 seconds 
72°C 



Elongation times varied according to the length of the ORF to be amplified. 
Amplifications were performed using either a 9600 or a 2400 Perkin Elmer GeneAmp PCR 
System. To check the results, 1/10 of the amplification volume was loaded onto a 1-1 .5% 
(w/v) agarose gel and the size of each amplified fragment compared with a DNA molecular 
weight marker. 

The amplified DNA was either loaded directly on a 1% agarose gel or first 
precipitated with ethanoland resuspended in a volume suitable to be loaded on a 1.0% 
agarose gel. The DNA fragment corresponding to the band of correct size was purified using 
the Qiagen Gel Extraction Kit, following the manufacturer's protocol. DNA fragments were 
eluted in a volume of 30\x\ or 50|il with either H20 or lOmM Tris, pH 8.5. 

Digestion of PCR fragments 

The purified DNA corresponding to the amplified fragment was doubly-digested with 
the appropriate restriction enzymes for; cloning into pET-21b+ and expressing the protein as a 
C-terminus His-tagged fusion, for cloning into pGEX-KG and expressing the protein as a N- 
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terminus GST-fusion, and for cloning into pGEX-His and expressing the protein as a 
N-terminus GST-His tagged fusion. 

Each purified DNA fragment was incubated at 37°C for 3 hours to overnight with 20 
units of appropriate restriction enzyme (New England Biolabs) in a volume of either 30 or 
40|il in the presence of suitable digestion buffer. Digested fragments were purified using the 
QIAquick PCR purification kit (following the manufacturer's instructions) and eluted in a 
volume of 30ul or 50(il with either H20 or lOmM Tris, ph 8.5. The DNA concentration was 
determined by quantitative agarose gel electrophoresis (1 .0% gel) in the presence of a titrated 
molecular weight marker. 

Digestion of the cloning vectors (pET22B, pGEX-KG, pTRC-His A, pET21b+, pGEX- 
KG, and pGEX-His) 

The vector pGEX-His is a modified pGEX-2T vector carrying a region encoding six 
histidine residues upstream of the thrombin cleavage site and containing the multiple cloning 
site of the vector pTRC99 (Pharmacia). 10 jig plasmid was double-digested with 50 units of 
each restriction enzyme in 200 \i\ reaciion voiume in the presence of appropnate buffer by 
overnight incubation at 37°C. After loading the whole digestion on a 1% agarose gel, the 
band corresponding to the digested vector was purified from the gel using the Qiagen 
QIAquick Gel Extraction Kit and the DNA was eluted in 50 [xl of 10 mM Tris-HCl, pH 8.5. 
The DNA concentration was evaluated by measuring OD260 of the sample, and adjusted to 50 
p.g/|il. 1 (il of plasmid was used for each cloning procedure. 

10|ig of plasmid vector was doubly-digested with 50 units of each restriction enzyme 
in a volume of 200|il with the appropnate buffer overnight at 37°C. The digest was loaded 
onto a 1.0% agarose gel and the band corresponding to the digested vector purified using the 
Qiagen QIAquick Gel Extraction Kit. DNA was eluted in 50|il of lOmM Tris-HCl, pH 8.5. 
The DNA concentration was evaluated by measuring OD 2 60nm and the concentration adjusted 
to 50|ig/jal. lpl of plasmid was used for each cloning procedure. 

Cloning 

For some ORFs, the fragments corresponding to each ORF, previously digested and 
purified, were ligated in both pET22b and pGEX-KG. In a final volume of 20 ul, a molar 



WO 99/57280 



58 



PCT/US99/09346 



ratio of 3:1 fragment/ vector was ligated using 0.5 yi\ of NEB T4 DNA ligase (400 units/pl), in 
the presence of the buffer supplied by the manufacturer. The reaction was incubated at room 
temperature for 3 hours. In some experiments, ligation was performed using the Boheringer 
"Rapid Ligation Kit", following the manufacturer's instructions. 

In order to introduce the recombinant plasmid in a suitable strain, 100 |il E. coli DH5 
competent cells were incubated with the ligase reaction solution for 40 minutes on ice, then at 
37°C for 3 minutes, then, after adding 800 ul LB broth, again at 37°C for 20 minutes. The 
cells were then centrifuged at maximum speed in an Eppendorf microfuge and resuspended in 
approximately 200 |il of the supernatant. The suspension was then plated on LB ampicillin 
(100 mg/ml ). 

The screening of the recombinant clones was performed by growing 5 
randomly-chosen colonies overnight at 37 °C in either 2 ml (pGEX or pTC clones) or 5ml 
(pET clones) LB broth + 100 |ag/ml ampicillin. The cells were then pelletted and the DNA 
extracted using the Qiagen QIAprep Spin Miniprep Kit, following the manufacturer's 
instructions, to a final volume of 30 pi 5 \x\ of each individual miniprep (approximately lg ) 
were digested with either NdeVXhol or BamHIIXhol and the whole digestion loaded onto a 1 - 
1.5% agarose gel (depending on the expected insert size), in parallel with the molecular 
weight marker (1Kb DNA Ladder, GIBCO). The screening of the positive clones was made 
on the base of the correct insert size. 

For other ORFs, the fragments corresponding to each ORF, previously digested and 
purified, were ligated into both pET21b+ and pGEX-KG. A molar ratio of of 3:1 
fragment/ vector was used in a final volume of 20ul, that included 0.5jil T4 DNA ligase (400 
units/|il, NEB) and ligation buffer supplied by the manufacturer. The reaction was performed 
at room temperature for 3 hours. In some experiments, ligation was performed using the 
Boheringer "Rapid Ligation Kit" and the manufacturer's protocol. 

Recombinant plasmid was transformed into IOOjjlI of competent E. coli DH5 or 
HB101 by incubating the ligase reaction solution and bacteria for 40 minutes on ice then at 
37°C for 3 minutes. This was followed by the addition of 800jil LB broth and incubation at 
37°C for 20 minutes. The cells were centrifuged at maximum speed in an Eppendorf 
microfuge, resuspended in approximately 200ul of the supernatant and plated onto LB 
ampicillin (lOOmg/ml ) agar. 
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Screening for recombinant clones was performed by growing 5 randomly selected 
colonies overnight at 37°C in either 2.0ml (pGEX-KG clones) or 5.0ml (pET clones) LB 
broth + 100jig/ml ampicillin. Cells were pelleted and plasmid DNA extracted using the 
Qiagen QIAprep Spin Miniprep Kit, following the manufacturer's instructions. 
Approximately l|ag of each individual miniprep was digested with the appropriate restriction 
enzymes and the digest loaded onto a 1-1.5% agarose gel (depending on the expected insert 
size), in parallel with the molecular weight marker (lkb DNA Ladder, GIBCO). Positive 
clones were selected on the basis of the size of insert. 

ORFs were cloned into PGEX-His, by doubly-digesting the PCR product and ligating 
into similarly digested vector. After cloning, recombinant plasmids were transformed into the 
E.coli host W31 10. Individual clones were grown overnight at 37°C in LB broth with 50^g/ml 
ampicillin. 

Certain ORFs may be cloned into the pGEX-HIS vector using EcoRl-Pstl cloning 
sites, or EcoRl-Satl, or Sall-Pstl. After cloning, the recombinant plasmids may be introduced 
in the£.coli host W3110. 

Expression 

Each ORF cloned into the expression vector may then be transformed into the strain 
suitable for expression of the recombinant protein product. 1 jil of each construct was used to 
transform 30 (il of E.coli BL21 (pGEX vector), E.coli TOP 10 (pTRC vector) or E.coli BL21- 
DE3 (pET vector), as described above. In the case of the pGEX-His vector, the same E.coli 
strain (W31 10) was used for initial cloning and expression. Single recombinant colonies were 
inoculated into 2ml LB+Amp (100 |Ag/ml), incubated at 37°C overnight, then diluted 1:30 in 
20 ml of LB+Amp (100 ^ig/ml) in 100 ml flasks, making sure that the OD600 ranged between 
0.1 and 0.15. The flasks were incubated at 30°C into gyratory water bath shakers until OD 
indicated exponential growth suitable for induction of expression (0.4-0.8 OD for pET and 
pTRC vectors; 0.8-1 OD for pGEX and pGEX-His vectors). For the pET, pTRC and pGEX- 
His vectors, the protein expression was induced by addiction of lrnM IPTG, whereas in the 
case of pGEX system the final concentration of IPTG was 0.2 mM. After 3 hours incubation 
at 30°C, the final concentration of the sample was checked by OD. In order to check 
expression, 1ml of each sample was removed, centrifuged in a micro fuge, the pellet 
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resuspended in PBS, and analysed by 12% SDS-PAGE with Coomassie Blue staining. The 
whole sample was centrifuged at 6000g and the pellet resuspended in PBS for further use. 

GST-fusion proteins large-scale purification. 

For some ORFs, a single colony was grown overnight at 37°C on LB+Amp agar plate. 
The bacteria were inoculated into 20 ml of LB+Amp liquid colture in a water bath shaker and 
grown overnight. Bacteria were diluted 1 :30 into 600 ml of fresh medium and allowed to 
grow at the optimal temperature (20-3 7°C) to OD 55 o 0.8-1 . Protein expression was induced 
with 0.2mM IPTG followed by three hours incubation. The culture was centrifuged at 8000 
rpm at 4°C. The supernatant was discarded and the bacterial pellet was resuspended in 7.5 ml 
cold PBS. The cells were disrupted by sonication on ice for 30 sec at 40W using a Branson 
sonifier B-15, frozen and thawed two times and centrifuged again. The supernatant was 
collected and mixed with 150ul Glutatione-Sepharose 4B resin (Pharmacia) (previously 
washed with PBS) and incubated at room temperature for 30 minutes. The sample was 
centrifuged at 700g for 5 minutes at 4C. The resin was washed twice with 10 ml cold PBS for 
10 minutes, resuspended in 1ml cold PBS, and loaded on a disposable column. The resin was 
washed twice with 2ml cold PBS until the flow-through reached OD 28 o of 0.02-0.06. The 
GST-fusion protein was eluted by addition of 700ul cold Glutathione elution buffer lOmM 
reduced glutathione, 50mM Tris-HCl) and fractions collected until the OD 28 o was 0. 1 . 21 ul of 
each fraction were loaded on a 12% SDS gel using either Biorad SDS-PAGE Molecular 
weight standard broad range (Ml) (200, 1 16.25, 97.4, 66.2, 45, 31, 21.5, 14.4, 6.5 kDa) or . 
Amersham Rainbow Marker (M") (220, 66, 46, 30, 21 .5, 14.3 kDa) as standards. As the MW 
of GST is 26kDa, this value must be added to the MW of each GST-fusion protein. 

For other ORFs, for each clone to be purified as a GST-fusion, a single colony was 
streaked out and grown overnight at 37°C on a LB/Amp. (100(ig/ml) agar plate. An isolated 
colony from this plate was inoculated into 20ml of LB/Amp (100 ug/ml) liquid medium and 
grown overnight at 37°C with shaking. The overnight culture was diluted 1 :30 into 600ml 
LB/Amp (100ng/ml) liquid medium and allowed to grow at the optimal temperature (20- 
37°C) until the OD 5 50nm reached 0.6-0.8. Recombinant protein expression was induced by 
addition of IPTG (final concentration 0.2mM) and the culture incubated for a further 3 hours. 
Bacteria were harvested by centnfugation at 8000xg for 15 min at 4°C. 
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The bacterial pellet was resuspended in 7.5ml cold PBS. Cells were disrupted by 
sonication on ice four times for 30 sec at 40W using a Branson sonifier 450 and centrifuged at 
13 OOOxg for 30 min at 4°C. The supernatant was collected and mixed with 150|il Glutatione- 
Sepharose 4B resin (Pharmacia), previously equilibrated with PBS, and incubated at room 
temperature with gentle agitation for 30 min. The batch-wise preparation was centrifuged at 
700x£ for 5 min at 4°C and the supernatant discarded. The resin was washed twice 
(batchwise) with 10ml cold PBS for 10 min, resuspended in 1ml cold PBS, and loaded onto a 
disposable column. The resin continued to be washed with cold PBS, until the OD 2 80nm of the 
flow-through reached 0.02-0.01. The CST-ftision protein was eluted by addition of 700(^1 cold 
glutathione elution buffer (lOmM reduced glutathione, 50mM Tris-HCl pH 8.0) and fractions 
collected, until the OD 28 onm of the eluate indicated all the recombinant protein was obtained. 
20^il aliquots of each elution fraction were analyzed by SDS-PAGE using a 12% gel. The 
molecular mass of the purified proteins was determined using either the Bio-Rad broad range 
molecular weight standard (Ml) (200, 116, 97.4, 66.2, 45.0, 31.0, 21.5, 14.4, 6.5 kDa) or the 
Amersham Rainbow Marker (M2) (220, 66.2, 46.0, 30.0, 21.5, 14.3 kDa). The molecular 
weights of GST-fusion proteins are a combination of the 26 kDa GST protein and its fusion 
partner. Protein concentrations were estimated using the Bradford assay. 

His-fusion soluble proteins large-scale purification. 

For some ORFs, a single colony was grown overnight at 37°C on a LB + Amp agar 
plate. The bacteria were inoculated into 20ml of LB+Amp liquid culture and incubated 
overnight in a water bath shaker. Bacteria were diluted 1 :30 into 600ml fresh medium and 
allowed to grow at the optimal temperature (20-37°C) to OD 550 0.6-0.8. Protein expression 
was induced by addition of 1 mM IPTG and the culture further incubated for three hours. The 
culture was centrifuged at 8000 rpm at 4°C, the supernatant was discarded and the bacterial 
pellet was resuspended in 7.5ml cold lOmM imidazole buffer (300 mM NaCl, 50 mM 
phosphate buffer, 10 mM imidazole, pH 8). The cells were disrupted by sonication on ice for 
30 sec at 40W using a Branson sonifier B-15, frozen and thawed two times and centrifuged 
again. The supernatant was collected and mixed with \ 50\x\ Ni 2+ -resin (Pharmacia) 
(previously washed with lOmM imidazole buffer) and incubated at room temperature with 
gentle agitation for 30 minutes. The sample was centrifuged at IQOg for 5 minutes at 4°C. 
The resin was washed twice with 10 ml cold lOmM imidazole buffer for 10 minutes, 
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resuspended in 1ml cold lOmM imidazole buffer and loaded on a disposable column. The 
resin was washed at 4°C with 2ml cold lOmM imidazole buffer until the flow-through 
reached the O.D 28 o of 0.02-0.06. The resin was washed with 2ml cold 20mM imidazole 
buffer (300 miM NaCl, 50 mM phosphate buffer, 20 mM imidazole, pH 8) until the flow- 
through reached the O.D 2 yo of 0.02-0.06. The His-fusion protein was eluted by addition of 
700ul cold 250mM imidazole buffer (300 mM NaCI, 50 mM phosphate buffer, 250 mM 
imidazole, pH 8) and fractions collected until the O.D 28 o was 0.1. 21u.l of each fraction were 
loaded on a 12%SDS gel. 

His-fusion insoluble proteins large-scale purification. 

A single colony was grown overnight at 37 °C on a LB + Amp agar plate. The 
bacteria were inoculated into 20 ml of LB+Amp liquid culture in a water bath shaker and 
grown overnight. Bacteria were diluted 1:30 into 600ml fresh medium and let to grow at the 
optimal temperature (37°C) to O.D550 0.6-0.8. Protein expression was induced by addition 
of 1 mM IPTG and the culture further incubated for three hours. The culture was centrifuged 
at 8000rpm at 4°C. The supernatant was discarded and the bacterial pellet was resuspended in 
T5 ml luffer B (urea SM, lOmM Tris-HCl, lOOmM phosphate buffer, pH 8.8). The cells 
were disrupted by sonication on ice for 30 sec at 40W using a Branson sonifier B-15, frozen 
and thawed twice and centrifuged again. The supernatant was stored at -20°C, while the 
pellets were resuspended in 2 ml guanidine buffer (6M guanidine hydrochloride, lOOmM 
phosphate buffer, 10 mM Tris-HCl, pH 7.5) and treated in a homogenizer for 10 cycles. The 
product was centrifuged at 13000 rpm for 40 minutes. The supernatant was mixed with 1 50^1 
Ni 2+ -resin (Pharmacia) (previously washed with buffer B) and incubated at room temperature 
with gentle agitation for 30 minutes. The sample was centrifuged at 700 g for 5 minutes at 
4°C. The resin was washed twice with 10 ml buffer B for 10 minutes, resuspended in 1ml 
buffer B, and loaded on a disposable column. The resin was washed at room temperature 
with 2ml buffer B until the flow-through reached the OD 2 8o of 0.02-0.06. The resin was 
washed with 2ml buffer C (urea 8M, lOmM Tris-HCl, lOOmM phosphate buffer, pH 6.3) until 
the flow-through reached the O.D 280 of 0.02-0.06. The His-fusion protein was eluted by 
addition of 700ul elution buffer (urea 8M, lOmM Tris-HCl, lOOmM phosphate buffer, pH 
4.5) and fractions collected until the OD 280 was 0.1 . 21 ul of each fraction were loaded on a 
12%SDS gel. 
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Purification of His-fusion proteins. 

For each clone to be purified as a His-fusion, a single colony was streaked out and 
grown overnight at 37°C on a LB/ Amp (100 ug/ml) agar plate. An isolated colony from this 
plate was inoculated into 20ml of LB/Amp (100 ug/ml) liquid medium and grown overnight 
at 37°C with shaking. The overnight culture was diluted 1:30 into 600ml LB/Amp (100 
fag/ml) liquid medium and allowed to grow at the optimal temperature (20-3 7°C) until the 
OD 55 onm reached 0.6-0.8. Expression of recombinant protein was induced by addition of IPTG 
(final concentration 1 .OmM) and the culture incubated for a further 3 hours. Bacteria were 
harvested by centrifugation at 8000xg for 15 min at 4°C. 

The bacterial pellet was resuspended in 7.5ml of either (i) cold buffer A (300mM 
NaCl, 50mM phosphate buffer, lOmM imidazole, pH 8.0) for soluble proteins or (ii) buffer B 
(8M urea, lOmM Tris-HCl, lOOmM phosphate buffer, pH 8.8) for insoluble proteins. Cells 
were disrupted by sonication on ice four times for 30 sec at 40W using a Branson sonifier 450 
and centrifuged at 13 OOOxg for 30 min at 4°C. For insoluble proteins, pellets were 
resuspended in 2.0 ml buffer C (6M guanidine hydrochloride, lOOmM phosphate buffer, 
lOmM Tris-HCl, pH 7.5) and treated with a Dounce homogenizer for 10 cycles. The 
homogenate was centrifuged at 13 OOOxg for 40 min and the supernatant retained. 

Supernatants for both soluble and insoluble preparations were mixed with 150|il Ni 2+ - 
resin (previously equilibrated with either buffer A or buffer B, as appropriate) and incubated 
at room temperature with gentle agitation for 30 min. The resin was Chelating Sepharose Fast 
Flow (Pharmacia), prepared according to manufacturers protocol. The batch-wise preparation 
was centrifuged at 700xg for 5 min at 4°C and the supernatant discarded. The resin was 
washed twice (batch-wise) with 1 0ml buffer A or B for 1 0 min, resuspended in 1 .0 ml buffer 
A or B and loaded onto a disposable column. The resin continued to be washed with either (i) 
buffer A at 4°C or (ii) buffer B at room temperature, until the OD 28 onm of the flow-through 
reached 0.02-0.01 . The resin was further washed with either (i) cold buffer C (300mM NaCl, 
50mM phosphate buffer, 20mM imidazole, pH 8.0) or (ii) buffer D (8M urea, lOmM Tris- 
HCl, lOOmM phosphate buffer, pH 6.3) until the the OD 28 onm of the flow-through reached 
0.02-0.01. The His-fusion protein was eluted by addition of 700jil of either (i) cold elution 
buffer A (300mM NaCl, 50mM phosphate buffer, 250mM imidazole, pH 8.0) or (ii) elution 
buffer B (8 M urea, lOmM Tris-HCl, lOOmM phosphate buffer, pH 4.5) and fractions 
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collected until the O.D 2 80nm indicated all the recombinant protein was obtained. 20|il aliquots 
of each elution fraction were analyzed by SDS-PAGE using a 12% gel. Protein concentrations 
were estimated using the Bradford assay. 

His-fusion proteins renaturation 

In the cases where denaturation was required to solubilize proteins, a renaturation step 
was employed prior to immunization. Glycerol was added to the denatured fractions obtained 
above to give a final concentration of 10%(v/v). The proteins were diluted to 200|ig/ml using 
dialysis buffer I (10% (v/v) glycerol, 0.5M arginine, 50mM phosphate buffer, 5.0mM reduced 
glutathione, 0.5mM oxidised glutathione, 2.0M urea, pH 8.8) and dialysed against the same 
buffer for 12-14 hours at 4°C. Further dialysis was performed with buffer II (10% (v/v) 
glycerol, 0.5M arginine, 50mM phosphate buffer, 5.0mM reduced glutathione, 0.5mM 
oxidised glutathione, pH 8.8) for 12-14 hours at 4°C. 

Alternatively, 10% glycerol was added to the denatured proteins. The proteins were 
then diluted to 20^ig/ml using dialysis buffer I (10% glycerol, 0.5M arginine, 50mM 
phosphate buffer, 5mM reduced glutathione, 0.5mM oxidised glutathione, 2M urea, pH 8.8) 
and dialysed against the same buffer at 4°C for 12-14 hours. The protein was further dialysed 
against dialysis buffer II (10% glycerol, 0.5M arginine, 50mM phosphate buffer, 5mM 
reduced glutathione, 0,5mM oxidised glutathione, pH 8.8) for 12-14 hours at 4°C. 

Protein concentration was evaluated using the formula: 

Protein (mg/ml) = (1.55 x OD 280 ) - (0.76 x OD 26 o) 

Purification of proteins 

To analyse the solubility, pellets obtained from 3.0ml cultures were resuspended in 
500^1 buffer Ml (PBS pH 7.2). 25^1 of lysozyme (lOmg/ml) was added and the bacteria 
incubated for 15 min at 4°C. Cells were disrupted by sonication on ice four times for 30 sec at 
40W using a Branson sonifier 450 and centrifuged at 13 OOOxg for 30 min at 4°C. The 
supernatant was collected and the pellet resuspended in buffer M2 [8M urea, 0.5M NaCl, 
20mM imidazole and 0.1M NaH 2 P0 4 ] and incubated for 3 to 4 hours at 4°C. After 
centrifugation, the supernatant was collected and the pellet resuspended in buffer M3 [6M 
guanidinium-HCl, 0.5M NaCl, 20miM imidazole and 0.1M NaH 2 P0 4 ] overnight at 4°C. The 
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supernatants from all steps were analysed by SDS-PAGE. Some proteins were found to be 
soluble in PBS, others need urea or guanidium-HCl for solubilization. 

For preparative scale purifications, 500ml cultures were induced and fusion proteins 
solubilized in either buffer Ml, M2 or M3 using the procedure described above. Crude 
extracts were loaded onto a Ni-NTA superflow column (Quiagen) equilibrated with buffer 
Ml, M2 or M3 depending on the solubilization buffer employed. Unbound material was 
eluted by washing the column with the same buffer. The recombinant fusion protein was 
eluted with the corresponding buffer containing 500mM imidazole then dialysed against the 
same buffer in the absence of imidazole. 

Mice immunisations 

20|xg of each purified protein are used to immunise mice intraperitoneally. In the case 
of some ORFs, Balb-C mice were immunised with Al(OH) 3 as adjuvant on days 1,21 and 42, 
and immune response was monitored in samples taken on day 56. For other ORFs, CD1 mice 
could be immunised using the same protocol. For ORFs 25 and 40, CD1 mice were 
immunised using Freund's adjuvant, and the same immunisation protocol was used, except 
that the immune response was mea^red on <'<iy 42, rather than 56. Similarly, for sliii other 
ORFs, CD1 mice were immunised with Freund's adjuvant, but the immune response was 
measured on day 49. Alternatively, 20(ig of each purified protein was mixed with Freund's 
adjuvant and used to immunise CD1 mice intraperitoneally. For many of the proteins, the 
immunization was performed on days 1,21 and 35, and immune response was monitored in 
samples taken on days 34 and 49. For some proteins, the third immunization was performed 
on day 28, rather than 35, and the immune response was measured on days 20 and 42, rather 
than 34 and 49. 

ELISA assay (sera analysis) 

The acapsulated MenB M7 strain was plated on chocolate agar plates and incubated 
overnight at 37°C. Bacterial colonies were collected from the agar plates using a sterile 
dracon swab and inoculated into 7ml of Mueller-Hinton Broth (Difco) containing 0.25% 
Glucose. Bacterial growth was monitored every 30 minutes by following OD620- The 
bacteria were let to grow until the OD reached the value of 0.3-0.4. The culture was 
centrifiiged for 10 minutes at 10000 rpm. The supernatant was discarded and bacteria were 
washed once with PBS, resuspended in PBS containing 0.025% formaldehyde, and incubated 
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for 2 hours at room temperature and then overnight at 4°C with stirring. lOOfil bacterial cells 
were added to each well of a 96 well Greiner plate and incubated overnight at 4°C. The wells 
were then washed three times with PBT washing buffer (0.1% Tween-20 in PBS). 200 [i\ of 
saturation buffer (2.7% Polyvinylpyrrolidone 10 in water) was added to each well and the 
plates incubated for 2 hours at 37°C. Wells were washed three times with PBT. 200 |al of 
diluted sera (Dilution buffer: 1% BSA, 0.1% Tween-20. 0. 1% NaN 3 in PBS) were added to 
each well and the plates incubated for 90 minutes at 37°C. Wells were washed three times 
with PBT. 100 ul of HRP-conjugated rabbit anti -mouse (Dako) serum diluted 1:2000 in 
dilution buffer were added to each well and the plates were incubated for 90 minutes at 37°C. 
Wells were washed three times with PBT buffer. 100 jal of substrate buffer for HRP (25 ml of 
citrate buffer pH5, 10 mg of O-phenildiamine and 10 |al of H 2 0) were added to each well and 
the plates were left at room temperature for 20 minutes. 100 ul H 2 S0 4 was added to each 
well and OD 490 was followed. The ELISA was considered positive when OD490 was 2.5 
times the respective pre-immune sera. 

Alternatively, The acapsulated MenB M7 strain was plated on chocolate agar plates 
and incubated overnight at 37°C. Bacterial colonies were collected from the agar plates using 
a sterile dracon swab and inoculated into Mueller-Hinton Broth (Difco) containing 0.25% 
Glucose. Bacterial growth was monitored every 30 minutes by following OD 6 2o- The bacteria 
were let to grow until the OD reached the value of 0.3-0.4. The culture was centrifuged for 10 
minutes at 10 OOOrpm. The supernatant was discarded and bacteria were washed once with 
PBS, resuspended in PBS containing 0.025% formaldehyde, and incubated for 1 hour at 37°C 
and then overnight at 4°C with stirring. lOOjal bacterial cells were added to each well of a 96 
well Greiner plate and incubated overnight at 4°C. The wells were then washed three times 
with PBT washing buffer (0.1% Tween-20 in PBS). 200(al of saturation buffer (2.7% 
Polyvinylpyrrolidone 10 in water) was added to each well and the plates incubated for 2 hours 
at 37°C. Wells were washed three times with PBT. 200|il of diluted sera (Dilution buffer: 1% 
BSA, 0.1%» Tween-20, 0.1% NaN 3 in PBS) were added to each well and the plates incubated 
for 2 hours at 37°C. Wells were washed three times with PBT. 100|xl of HRP-conjugated 
rabbit anti -mouse (Dako) serum diluted 1 :2000 in dilution buffer were added to each well and 
the plates were incubated for 90 minutes at 37°C. Wells were washed three times with PBT 
buffer. 100^1 of substrate buffer for HRP (25ml of citrate buffer pH5, lOmg of O- 
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phenildiamine and lOjal of H2O2) were added to each well and the plates were left at room 
temperature for 20 minutes. IOOjaI of 12.5% H 2 SOj was added to each well and OD490 was 
followed. The ELISA titers were calculated abitrarely as the dilution of sera which gave an 
OD490 value of 0.4 above the level of preimmune sera. The ELISA was considered positive 
when the dilution of sera with OD490 of 0.4 was higher than 1 :400. 

FACScan bacteria Binding Assay procedure. 

The acapsulated MenB M7 strain was plated on chocolate agar plates and incubated 
overnight at 37°C. Bacterial colonies were collected from the agar plates using a sterile 
dracon swab and inoculated into 4 tubes containing 8ml each Mueller-Hinton Broth (Difco) 
containing 0.25% glucose. Bacterial growth was monitored every 30 minutes by following 
OD 6 20- The bacteria were let to grow until the OD reached the value of 0.35-0.5. The culture 
was centrifuged for 10 minutes at 4000rpm. The supernatant was discarded and the pellet was 
resuspended in blocking buffer (1% BSA in PBS, 0.4% NaN 3 ) and centrifuged for 5 minutes 
at 4000rpm. Cells were resuspended in blocking buffer to reach OD620 of 0,07. lOOjal bacterial 
cells were added to each well ofaCostar96 well plate. lOOjal of diluted (1:100, 1:200, 1:400) 
sera (in blocking buffer) were added to each well and plates incubated for 2 hours at 4°C. 
Cells were centrifuged for 5 minutes at 4000rpm, the supernatant aspirated and cells washed 
by addition of 200jil/well of blocking buffer in each well. 100|il of R-Phicoerytrin conjugated 
F(ab)2 goat anti-mouse, diluted 1 TOO, was added to each well and plates incubated for 1 hour 
at 4°C. Cells were spun down by centrifugation at 4000rpm for 5 minutes and washed by 
addition of 200jalAvell of blocking buffer. The supernatant was aspirated and cells 
resuspended in 200)xl/well of PBS, 0.25% formaldehyde. Samples were transferred to 
FACScan tubes and read. The condition for FACScan (Laser Power 15mW) setting were: FL2 
on; FSC-H threshold:92; FSC PMT Voltage: E 01; SSC PMT: 474; Amp. Gains 6.1; FL-2 
PMT: 586; compensation values: 0. 

OMV preparations 

Bacteria were grown overnight on 5 GC plates, harvested with a loop and resuspended 
in 10 ml 20mM Tris-HCl. Heat inactivation was performed at 56°C for 30 minutes and the 
bacteria disrupted by sonication for 10 1 on ice ( 50% duty cycle, 50% output ). Unbroken 
cells were removed by centrifugation at 5000g for 10 minutes and the total cell envelope 
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fraction recovered by centrifugation at SOOOOg at 4°C for 75 minutes. To extract cytoplasmic 
membrane proteins from the crude outer membranes, the whole fraction was resuspended in 
2% sarkosyl (Sigma) and incubated at room temperature for 20 minutes. The suspension was 
centrifuged at lOOOOg for 10 minutes to remove aggregates, and the supernatant further 
ultracentriftiged at 50000g for 75 minutes to pellet the outer membranes. The outer 
membranes were resuspended in lOmM Tris-HCl, pH8 and the protein concentration 
measured by the Bio-Rad Protein assay, using BSA as a standard. 

Whole Extracts preparation 

Bacteria were grown overnight on a GC plate, harvested with a loop and resuspended 
in 1ml of 20mM Tris-HCl. Heat inactivation was performed at 56°C for 30' minutes. 

Western blotting 

Purified proteins (500ng/lane), outer membrane vesicles (Syig) and total cell extracts 
(25 jig) derived from MenB strain 2996 were loaded onto a 12% SDS-polyacrylamide gel and 
transferred to a nitrocellulose membrane. The transfer was performed for 2 hours at 150mA at 
4°C, using transfer buffer (0.3 % Tris base, 1 .44 % glycine, 20% (v/v) methanol). The 
membrane was saturated by overnight incubation at 4°C in saturation buffer (10% skimmed 
milk, 0.1% Triton XI 00 in PBS). The membrane was washed twice with washing buffer (3% 
skimmed milk, 0.1% Triton XI 00 in PBS) and incubated for 2 hours at 37°C with mice sera 
diluted 1 :200 in washing buffer. The membrane was washed twice and incubated for 90 
minutes with a 1 :2000 dilution of horseradish peroxidase labelled anti-mouse Ig. The 
membrane was washed twice with 0.1% Triton X100 in PBS and developed with the Opti- 
4CN Substrate Kit (Bio-Rad). The reaction was stopped by adding water. 

Bactericidal assay 

MC58 and 2996 strains were grown overnight at 37°C on chocolate agar plates. 5-7 
colonies were collected and used to inoculate 7ml Mueller-Hinton broth. The suspension was 
incubated at 37°C on a nutator and let to grow until OD 6 2o was in between 0.5-0.8. The 
culture was aliquoted into sterile 1.5ml Eppendorf tubes and centrifliged for 20 minutes at 
maximum speed in a microfuge. The pellet was washed once in Gey's buffer (Gibco) and 
resuspended in the same buffer to an OD 620 of 0.5, diluted 1 :20000 in Gey's buffer and stored 
at 25°C. 
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50fil of Gey's buffer/1% BSA was added to each well of a 96-well tissue culture plate. 
25(^1 of diluted (1:100) mice sera (dilution buffer: Gey's buffer/0.2% BSA) were added to 
each well and the plate incubated at 4°C. 25ul of the previously described bacterial 
suspension were added to each well. 25ul of either heat-inactivated (56°C waterbath for 30 
minutes) or normal baby rabbit complement were added to each well. Immediately after the 
addition of the baby rabbit complement, 22jil of each sample/well were plated on Mueller- 
Hinton agar plates (time 0). The 96-well plate was incubated for 1 hour at 37°C with rotation 
and then 22|il of each sample/well were plated on Mueller-Hinton agar plates (time 1). After 
overnight incubation the colonies corresponding to time 0 and time Ih were counted. 

Gene Variability 

The ORF4 and 919 genes were amplified by PCR on chromosomal DNA extracted 
from various Neisseria strains (see list of strains). The following oligonucleotides used as 
PCR primers were designed in the upstream and downstream regions of the genes: 

orf4.1 (forward) CGAATCCGGACGGCAGGACTC 
orf 4.3 (reverse) GGCAGGGAATGGCGGATTAAAG 

919.1 (forward) AAAATGCCTCTCCACGGCTG or 

CTGCGCCCTGTGTTAAAATCCCCT 
919.6 (reverse) CAAATAAGAAAGGAATTTTG or 

GGTATCGCAAAACTTCGCCTTAATGCG 

The PCR cycling conditions were: 

1 cycle 2 min. at 94° 

30 cycles 30 sec. at 94° 

30 sec. at ~ 54° or ~ 60° ( in according to Tm of the primers) 

40 sec. at 72° 
1 cycle 7 min. at 72° 

The PCR products were purified from 1 % agarose gel and sequenced using the 
following primers: 

orf4.1 (forward) CGAATCCGGACGGCAGGACTC 
orf 4.2 (forward) CGACCGCGCCTTTGGGACTG 
orf 4.3 (reverse) GGCAGGGAATGGCGGATTAAAG 
orf 4.4 (reverse) TCTTTGAGTTTGATCCAACC 
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919.1 (forward) AAAATGCCTCTCCACGGCTG or 

CTGCGCCCTGTGTTAAAATCCCCT 

919.2 (forward) ATCCTTCCGCCTCGGCTGCG 

919.3 (forward) AAAACAGCGGCACAATCGAC 

919.4 (forward) ATAAGGGCTACCTCAAACTC 

919.5 (forward) GCGCGTGGATTATTTTTGGG 

919.6 (reverse) CAAATAAGAAAGGAATTTTG or 

GGTATCGCAAAACTTCGCCTTAATGCG 

919.7 (reverse) CCCAAGGTAATGTAGTGCCG 

919.8 (reverse) TAAAAAAAAGTTCGACAGGG 

919.9 (reverse) CCGTCCGCCTGTCGTCGCCC 

919.10 (reverse) TCGTTCCGGCGGGGTCGGGG 

All documents cited herein are incorporated by reference in their entireties. 
The following Examples are presented to illustrate, not limit, the invention 



EXAMPLE 1 

Using the above-described procedures, the following oligonucleotide primers were 
employed in the polymerase chain reaction (PCR) assay in order to clone the ORFs as 
indicated: 



Table 1: Oligonucleotides used for PCR for Examples 2-10 



ORF 


Primer 


Sequence 


Restriction sites 


279 


Forward 
Reverse 


CGCGGATCCCATATG-TTGCCTGCAATCACGATT 
<SEQ ID 3021 > 

CCCGCTCGAG-TTTAGAAGCGGGCGGCAA <SEQ 
ID 3022> 


BamHI-Ndel 
Xhol 


519 


Forward 
Reverse 


CGCGGATCCCATATG-TTCAAATCCTTTGTCGTCA 
<SEQ ID 3023> 

CCCGCTCGAG-TTTGGCGG I I I I GCTGC <SEQ ID 
3024> 


BamHI-Ndel 
Xhol 


576 


Forward 
Reverse 


CGCGGATCCCATATG-GCCGCCCCCGCATCT 
<SEQ ID 3025> 

CCCGCTCGAG-ATTTAC I I I I I I GATGTCGAC 
<SEQ ID 3026> 


BamHI-Ndel 
Xhol 


919 


Forward 
Reverse 


CGCGGATCCCATATG-TGCCAAAGCAAGAGCATC 
<SEQ ID 3027> 

CCCGCTCGAG-CGGGCGGTATTCGGG <SEQ ID 
3028> 


BamHI-Ndel 
Xhol 


121 


Forward 


CGCGGATCCCATATG-GAAACACAGCTTTACAT 
<SEQ ID 3029> 


BamHI-Ndel 
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Reverse 


CCCGCTCGAG-ATAATAATATCCCGCGCCC <SEQ 
ID 3030> 


Xhol 


128 


Forward 
Reverse 


CGCGGATCCCATATG-ACTGACAACGCACT <SEQ 
ID 3031 > 

CCCGCTCGAG-GACCGCGTTGTCGAAA <SEQ ID 
3032> 


BamHI-Ndel 
Xhol 


206 


Forward 
Reverse 


CGCGGATCCCATATG-AAACACCGCCAACCGA 
<SEQ ID 3033> 

C C C G C TC G AG -TTC TG T AAAAAAAGT ATG TG C 
<SEQ ID 3034> 


BamHI-Ndel 
Xhol 


287 


Forward 
Reverse 


CCGGAATTCTAGCTAGC-CTTTCAGCCTGCGGG 
<SEQ ID 3035> 

CCCGCTCGAG-ATCCTGCTC Mill ! GCC <SEQ ID 
3036> 


EcoRI-Nhel 
Xhol 


406 


Forward 
Reverse 


CGCGGATCCCATATG-TGCGGGACACTGACAG 
<SEQ ID 3037> 

CCCGCTCGAG-AGGTTGTCCTTGTCTATG <SEQ 
ID 3038> 


BamHI-Ndel 
Xhol 



Localization of the ORFs 

The following DNA and amino acid sequences are identified by titles of the following 
form: [g, m, or a] [#].[seq or pep], where "g" means a sequence from N. gonorrhoeae, "m" 
means a sequence from N. meningitidis B, and "a" means a sequence from N. meningitidis A\ 

means the number of the sequence; "seq" means a DNA sequence, and "pep" means an 
amino acid sequence. For example, "gOOl.seq" refers to an N. gonorrohoeae DNA sequence, 
number 1. The presence of the suffix to these sequences indicates an additional sequence 
found for the same ORF, thus, data for an ORF having both an unsuffixed and a suffixed 
sequence designation applies to both such designated sequences. Further, open reading 
frames are identified as ORF #, where "#" means the number of the ORF, corresponding to 
the number of the sequence which encodes the ORF, and the ORF designations may be 
suffixed with M .ng" or ".a", indicating that the ORF corresponds to a N. gonorrhoeae sequence 
or a N. meningitidis A sequence, respectively. The word "partial" before a sequence indicates 
that the sequence may be a partial or a complete ORF. Computer analysis was performed for 
the comparisons that follow between "g", "m", and "a" peptide sequences; and therein the 
"pep" suffix is implied where not expressly stated. Further, in the event of a conflict between 
the text immediately preceding and describing which sequences are being compared, and the 
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designated sequences being compared, the designated sequence controls and is the actual 
sequence being compared. 



ORF: contig: 
279 gnm4.seq 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 3039>: 



m279.seq 














ATAACGCGGA 


TTTGCGGCTG 


CTTGATTTCA 


ACGGTTTTCA 


GGGCTTCGGC 


51 


AAGTTTGTCG 


GCGGCGGGTT 


TCATCAGGCT 


GCAATGGGAA 


GGTACGGACA 


101 


CGGGCAGCGG 


CAGGGCGCGT 


TTGGCACCGG 


CTTCTTTGGC 


GGCAGCCATG 


151 


GCGCGTCCGA 


CGGCGGCGGC 


GTTGCCTGCA 


ATCACGATTT 


GTCCGGGTGA 


201 


GTTGAAGTTG 


ACGGCTTCGA 


CCACTTCGCT 


TTGGGCGGCT 


TCGGCACAAA 


251 


TGGCTTTAAC 


CTGCTCATCT 


TCCAAGCCGA 


G? YTCGCCGC 


CATTGCGCCC 


301 


ACGCCTTGCG 


GTACGGCGGA 


CTGCATCAGT 


TCGGCGCGCA 


GGCGCACGAG 


351 


TTTGACCGCG 


TCGGCAAAAT 


TCAATGCGCC 


GGCGGCA/ ' ■ 


AGTGCGGTGT 


401 


ATTCGCCGAG 


GCTGTGTCCG 


GCAACGGCGG 


CAGGCGTT'i T 


GCCGCCCGCT 


451 


TCTAAATAG 










This corresponds to the amino acid 


sequence <SEQ ID 3040; ORF 279> 


m279.pep 












i 


ITRICGCLIS 


TVFRASASLS 


AAGFIRLQWE 


GTDTGSGRAR 


LAPASLAAAK 


51 


ARPTAAALPA 


ITICPGELKL 


TASTTSLWAA 


SAQMALTCSS 


SKPRIAAIAP 


101 


TPCGTADCIS 


SARRRTSLTA 


SAKFNAPAAT 


SAVYSPRLCP 


ATAAGVLPPA 


151 


SK* 











The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 304 1>: 
g279 . seq 

1 atgacgcgga tttgcggctg cttgatttca acggtttcga gtgtttcggc 

51 aagtttgtcg gcggcgggtt tcatcaggct gcaacgggaa ggaacggata 

101 ccggcagcgg cagggcgcgt ttggccccgg cttctttggc ggcagccacg 

151 gtgcgtccga cggcggcggc gttgcctgca atcacgactt gtccgggcga 

201 gctgaagttg acggcttcga ccacttcgcc ctgtgcggat tcggcacaaa 

251 tctgcctgac ctgttcacct tccaaaccca aaatggccgc cattgcgcct 

301 acgccttgcg gtacggcgga ccgcatcagt tcggcgcgca ggcggacgag 

351 tttgacggca ccggcaaaac ccaatgcttc ggcggcgaca agcgcggtgt 

401 attcgccgag gctgtgtccg gcaacggcgg caggcgtttt gccgcccact 

451 tccaaatag 

This corresponds to the amino acid sequence <SEQ ID 3042; ORF 279. ng>: 

g279 . pep 

1 MTRICGCLIS TVLSVSASLS AAGFIRLQWE GTDTGSGRAR LAPASLAAAM 

51 VRPTAAA LPA ITTCPGELKL TASTTSPCAD SAQICLTCSS 5KPKMAATAP 

101 TPCGTADCIS SARRRTSLTA SAKSNASAAT SAVYSPRLCP ATAAGVLPPT 

151 SK* 



ORF 279 shows 89.5% identity over a 152 aa overlap with a predicted ORF (ORF 279.ng) 
from N. gonorrhoeae: 

10 20 30 40 50 60 

m2 79 .pep ITRICGCLISWFRASA5LSAAGFIRLQWEGTDTGSGRARLAPASLAAAMARPTAAALPA 

: i: !M! i,. :IMIIIMIIIIIIIMMIIMIIIIIMMill:|M!MI|| 

92 79 MTRICGCLISTVLSVSASLSAAGFIRLQWEGTDTGSGRARIAPASLAAAMVRPTAAALPA 
10 20 30 40 50 60 
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70 80 90 100 110 120 

m279 .pep ITIC?GELKLTASTTSLWAAS:.^iMALTCSSSKPRIAAIAPTPCGTADCISSARRRTSLTA 
II MllliMIIIM I llh IllllllhHIIIIIIIIMMIIIIIIIIIIII 
g2 7S ITTCPGELKLTASTTSPCADSAOICLTCSSSKPKMAAIAPTPCGTADCISSARRRTSLTA 

70 80 90 100 110 120 



130 140 150 

m2 79 . Pep SAKFNAPAATS AVYS PRLCPATAAGVLPPAS KX 

III II IIIIIMIIIIIIIIIIIIIIhlll 

g 2 7 9 S AKSNAS AATS AV YS P RLC P AT AAGVLP PTS KX 

130 140 150 



The following partial DNA sequence was identified in N, meningitidis <SEQ ID 3043>: 

a279 . seq 

1 ATGACNCNGA TTTGCGGCTG CTTGATTTCA ACGGTTTNNA GGGCTTCGGC 

51 GAGTTTGTCG GCGGCGGGTT TCATGAGGCT GCAATGGGAA GGTACNGACA 

101 CNGGCAGCGG CAGGGCGCGT TTGGCGCCGG CTTCTTTGGC GGCAAGCATA 

151 GCGCGCTCGA CGGCGGCGGC ATTGCCTGCA ATCACGACTT GTCCGGGCGA 

201 GTTGAAGTTG ACGGCTTCAA CCACTTCATC CTGTGCGGAT TCGGCGCAAA 

2 51 TTTGTTTTAC CTGTTCATCT TCCAAGCCGA GAATCGCCGC CATTGCGCCC 
301 ACGCCTTGCG GTACGGCGGA CTGCATCAGT TCGGCGCGCA NGCGCACGAG 

3 51 TTTGACCGCG TCGGCAAAAT CCAATGCGCC GGCGGCAACN AGTGCGGTGT 

4 01 ATTCGCCGAN GCTGTGTCCG GCAACGGCGG CAGGCGTTTT GCCGCCCGCT 
4 51 TCCGAATAG 

This corresponds to the amino acid sequence <SEQ ID 3044; ORF 279.a>: 

a27 9 .pep 

1 MTXICGCLIS TVXRASASLS AAGFMRLQWE GTDTGSGRAR LAPASLAA5I 
51 ARSTAAALPA ITTCPGELKL TASTTSSCAD SAQICFTCSS SKPRIAAIA? 
101 TPCGTADCIS SARXRTSLTA SAKSNAPAAT SAVYSPXLCP ATAAGVLPPA 
151 SE* 

m279/a279 ORFs 279 and 279.a showed a 88.2% identity in 152 aa overlap 

10 20 30 40 50 60 

m27 9 . pep ITRICGCLISTVFRASASLSAAGFIRLQWEGTDTGSGRARLAPASLAAAMARPTAAALPA 

: I I I I I I I i i I I I I I I I I I I I I : I I I i ! I I I I f I I I I I II I II I I I : : ! I Mil 
a27 9 MTXICGCLISTVXRASASLSAAGFMRLQWEGTDTGSGRARLAPASLAASIARSTAAALPA 

10 20 30 40 50 60 



70 80 90 100 110 120 

m27 9 . pep ITICPGELKLTASTTSLWAASAQMALTCSSSKPRIAAIAPTPCGTADCISSARRRTSLTA 
II I I I I I I I I I I I I I I Mi: : I I I I 1 I I I I M I t I I ! i M I I ! II I i I I I I I I! 
a27 9 I TTCPGELKLTASTTS SCADS AQICFTCSSSKPR I AAI APT PCGTADC IS SARXRTSLTA 

70 80 90 100 110 120 



130 140 150 

m2 7 9.pep SAKFNAPAATSAVYSPRLCP AT AAGVLP PASKX 

ill I M I I I I I I I I I II I I I I I I I ! M I I : I 
a27 9 SAKSNAPAAT SAVYSPXLCPATAAGVLPPASEX 

130 140 150 



519 and 519-1 gnm7.seq 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 3045>: 

m519.seq (partial) 

1 . . TCCGTTATCG GGCGTATGGA GTTGGACAAA ACGTTTGAAG AACGCGACGA 

51 AATCAACAGT ACTGTTGTTG CGGCTTTGGA CGAGGCGGCC GGGgCTTgGG 

101 GTGTGAAGGT TTTGCGTTAT GAGATTAAAG ACTTGGTTCC GCCGCAAGAA 

151 ATCCTTCGCT CAATGCAGGC GCAAATTACT GCCGAACGCG AAAAACGCGC 

201 CCGTATCGCC GAATCCGAAG GTCGTAAAAT CGAACAAATC AACCTTGCCA 
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2 51 GTGGTCAGCG CGAAGCCGAA ATCCAACAAT CCGAAGGCGA GGCTCAGGCT 

3 01 GCGGTCAATG CGTCAAATGC CGAGAAAATC GCCCGCATCA ACCGCGCCAA 

3 51 AGGTGAAGCG GAATCCTTGC GCCTTGTTGC CGAAGCCAAT GCCGAAGCCA 

4 01 TCCGTCAAAT TGCCGCCGCC CTTCAAACCC AAGGCGGTGC GGATGCGGTC 
4 51 AATCTGAAGA TTGCGGAACA ATACGTCGCT GCGTTCAACA ATCTTGCCAA 
501 AGAAAGCAAT ACGCTGATTA TGCCCGCCAA TGTTGCCGAC ATCGGCAGCC 
551 TGATTTCTGC CGGTATGAAA ATTATCGACA GCAGCAAAAC CGCCAAaTAA 

This corresponds to the amino acid sequence <SEQ ID 3046; ORF 519>: 

m519.pep (partial) 

1 . . SVIGRMELDK TFEERDEINS TWAALDEAA GAWGVKVLRY EIKDLVPPQE 

51 ILRSMQAQIT AEREKRARIA ESEGRKIEQI NLASGQREAE IQQSEGEAQA 

101 AVNASNAEKI ARINRAKGEA ESLRLVAEAN AEAIRQIAAA LQTQGGADAV 

151 NLKIAEQYVA AFNNLAKESN TLIMPANVAD IGSLISAGMK IIDSSKTAK* 

The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 3047>: 

g519 . seq 

1 atggaatttt tcattatctt gttggcagcc gt^gccgttt tcggcttcaa 

51 atcctttgtc gtcacccccc agcaggaagt ccacgttgtc gaaaggcncg 

101 ggcgtttcca tcgcgccctg acggccggtt tgaatatttt gattcccttt 

151 atcgaccgcg tcgcctaccg ccattcgctg aaagaaatcc ctttagacgt 

201 acccagccag gtctgcatca cgcgcgataa tacgcaaccg actgttgacg 

251 gcatcatcta tttccaagta accgatccca aactcgcctc atacggtccg 

301 agcaactaca ttatggcaat tacccagctt gcccaaacga cgctgcgttc 

3 51 cgttatcggg cgtatggagt tggacaaaac gttcgaagaa cgcgacgaaa 

401 tcaacagtac cgtcgtctcc gccctcgatg aagccgccgg ggcttggggt 

451 gtgaaagtcc tccgttacga aatcaaggat ttggctccgc cgcaagaaat 

501 ccttcgcgca atgcaggcac aaattaccgc cgaacgcgaa aaacgcgccc 

551 gtattgccga atccgaaggc cgtaaaatcg aacaaancaa ccttgccagt 

601 ggtcagcgtg aagccgaaat ccaacaatcc gaaggcgagg ctcaggccgc 

651 ggtcaatgcg tccaatgccg agaaaatcgc ccgcatcaac cgcgccaaag 

701 gcgaagcgga atccctgcgc cttgttgccg aagccaargc cgaagccaac 

751 cgtcaaattg ccgccgcccc tcaaacccaa agcggggcgg atgcggtcaa 

801 tctgaagatt gcgggacaat acgttaccgc gttcaaaaat cttgccaaag 

851 aagacaatac gcggattaag cccgccaagg ttgccgaaat cgggaaccct 

901 aattttcggc ggcatgaaaa attttcgcca gaagcaaaaa cggccaaata 

951 a 

This corresponds to the amino acid sequence <SEQ ID 3048; ORF 519.ng>: 

g519 .pep 

1 MEFFI ILLAA VAVFG FKSFV VIPQQEVHW ERLGRFHRAL TAGLNILIPF 

51 IDRVAYRHSL KEIPLDVPSQ VCITRDNTQL TVDGI I YFQV TDPKLASYGS 

101 SNYIMAITQL AQTTLRSVIG RMELDKTFEE RDEINSTV^/3 ALDEAAGAWG 

151 VKVLRYEIKD LVPPQEILRA MQAQITAERE KRARIAESEG RKIEQINLAS 

2 01 GQREAEIQQS EGEAQAAVNA SNAEKIARIN RAKGEAESLR LVAEANAEAN 
251 RQ1AAALQTQ SGADAVNLKI AGQYVTAFKN LAKEDNTRIK PAKVAEIGNP 

3 01 NFRRHEKFSP EAKTAK* 

ORF 519 shows 87.5% identity over a 200 aa overlap with a predicted ORF (ORF 519.ng) 
from N. gonorrhoeae: 

m519/g519 

10 20 30 

m519.pep SVIGRMELDKTFEERBEINSTWAALDEAA 

iiminiiiiMiiiMiMhiiiiii 

g5 1 9 YFOVTDPKLASYGSSNYIMAITQLAQTTLRSVIGRMELDKTFEERDEINSTWSALDEAA 
90 100 110 120 130 140 

40 50 60 70 80 90 

m519.pep GAWGVKVLRYEIKDLVPFQEILRSMQAQITAEREKRARIAESEGRKIEQINLASGQREAE 
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g 519 GAWGVKVLRYEIKDLVPPQEILRAMQAQITAEREKRARIAESEGRKI EQINLASGQREAE 

150 160 170 180 190 200 . 



100 110 120 130 140 150 

m519 .pep IQQS EGEAQAAVNASNAEKI ARINRAKGEAES LRIjVAEANAEAI RQI AAALQTOGGADAV 

IIIIMIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIMIIIi MlllllllhlllM 

g 519 IQQSEGEAQAAVNASNAEKIARINRAKGEAESLRLVAEANAEANRQIAAALQTQSGADAV 
210 220 230 240 250 260 



160 170 180 190 200 

m519 .pep NLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL- ISAGMKI IDSSKTAK 

IINI iihlhllllMI I ! -i ■ ; : 

g519 NLKIAGQYVTAFKNLAKEDNTRI KPAKVAEIGNPNFRRHEKFSPEAKTAK 

270 280 290 300 310 



ihe following partial DNA sequence was identified in N. meningitidis <SEQ ID 3049>: 

a519 . seq 

1 ATGGAATTTT TCATTATCTT GCTGGCAGCC GTCGTTGTTT TCGGCTTCAA 

51 ATCCTTTGTT GTCATCCCAC AGCAGGAAGT CCACGTTGTC GAAAGGCTCG 

101 GGCGTTTCCA TCGCGCCCTG ACGGCCGGTT TGAATATTTT GATTCCCTTT 

151 ATCGACCGCG TCGCCTACCG CCATTCGCTG AAAGAAATCC CTTTAGACGT 

201 ACCCAGCCAG GTCTGCATCA CGCGCGACAA TACGCAGCTG ACTGTTGACG 

251 GTATCATCTA TTTCCAAGTA ACCGACCCCA AACTCGCCTC ATACGGTTCG 

301 AGCAACTACA TTATGGCGAT TACCCAGCTT GCCCAAACGA CGCTGCGTTC 

351 CGTTATCGGG CGTATGGAAT TGGACAAAAC GTTTGAAGAA CGCGACGAAA 

4 01 TCAACAGCAC CGTCGTCTCC GCCCTCGATG AAGCCGCCGG AGCTTGGGGT 

451 GTGAAGGTTT TGCGTTATGA GATTAAAGAC TTGGTTCCGC CGCAAGAAAT 

501 CCTTCGCTCA ATGCAGGCGC AAATTACTGC TGAACGCGAA AAACGCGCCC 

551 GTATCGCCGA ATCCGAAGGT CGTAAAATCG AACAAATCAA CCTTGCCAGT 

601 GGTCAGCGCG AAGCCGAAAT CCAACAATCC GAAGGCGAGG CTCAGGCTGC 

651 GGTCAATGCG TCAAATGCCG AGAAAATCGC CCGCATCAAC CGCGCCAAAG 

701 GTGAAGCGGA ATCCTTGCGC CTTGTTGCCG AAGCCAATGC CGAAGCCATC 

7 51 CGTCAAATTG CCGCCGCCCT TCAAACCCAA GGCGGTGCGG ATGCGGTCAA 

801 _ TCTGAAGATT GCGGAACAAT ACGTCGCCGC GTTCAACAAT CTTGCCAAAG 

851 ' AAAGCAATAC GCTGATTATG CCCGCCAATG TTGCCGACAT CGGCAGCCTG 

901 ATTTCTGCCG GTATGAAAAT TATCGACAGC AGCAAAACCG CCAAATAA 

This corresponds to the amino acid sequence <SEQ ID 3050; ORP 519.a>: 

a519 . pep 

1 MEFFIILLAA VWFG FKSFV VIPQQEVHVV ERLGRFHRAL TAGLNILIPF 

51 IDRVAYRHSL KEIPLDVPSQ VCITRDNTQL TVDGI I YFQV TDPKLASYGS 

101 SNYIMAITQL AQTTLRSVIG RMELDKT FEE RDEINSTWS ALDEAAGAWG 

151 VKVLRYEIKD LVPPQEILRS MQAQITAERE KRARIAESEG RKIEQINLAS 

201 GQREAEIQQS EGEAQAAVNA SNAEKIARIN RAKGEAESLR LVAEANAEAI 

251 RQIAAALQTQ GGADAVNLKI AEQYVAAFNN LAKESNTLIM PANVADIGSL 

301 ISAGMKIIDS SKTAK* 

m519/a519 ORFs 519 and 519. a showed a 99.5% identity in 199 aa overlap 



10 20 30 

m519 . pep SVIGRMELDKTFEERDEINSTWAALDEAA 

I I I i 1 I I I I I I 1 I ! I I I i I I I I I : I I I! ! I 
a519 YFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIGRMELDKT FEERDEINSTVVSALDEAA 

90 100 110 120 130 140 

40 50 60 70 80 90 

m5 1 9 . pep GAWGVKVLRYE IKDLVPPQE I LRSMQAQITAEREKRARIAESEGRKI EQINLASGQREAE 

I I I I I I I f I I I ! I I i I I I i I I I I I i I 1 I I I I I t I I I M I I! ! I ! I I I ! ! I M I i I I I I I ! 
a519 GAWGVKVLRYE IKDLVPPQE I LRSMQAQITAEREKRARIAESEGRKI EQINLASGQREAE 

150 160 170 180 190 200 



100 110 120 130 140 150 
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m519.pep I0QSEGEAQAAVI3ASNAEKIARINRAKGEAESLRLVAEANAEAIRQIAAALQTQGGADAV 
I M i I i [ I I I I I 1 I ! ! I I I I I j I | | | I M I II I [ I I t I I II I I I I I I | ! i i M M M If I 
a519 IQQSEGEAQAAVNASNAEKIARINRAKGEAESLRLVAEANAEAIRQIAAALQTQGGADAV 
210 220 230 240 250 260 

160 170 180 190 200 

m519. pep NLKIAEQYVAAFNNLAKESNTLIMPANVADIGSLI SAGMKI I DSSKTAKX 

M M I I I I I I t I I M I I I i i I I I I I I I ! | M | | | I | | | | | | | | | j j | | | | 
a 51 9 N L K I AE Q Y V AAFN N LAKE S N T L I M P AN V A D I G S L I S AGMK 1 1 D S S KT AK v 

270 230 290 300 310 



Further work revealed the DNA sequence identified in A', meningitidis <SEQ ID 305 1>: 

m519-l . seq 

1 ATGGAATTTT TCATTATCTT GTTGGTAGCC GTCGCCGTTT TCGGTTTCAA 

51 ATCCTTTGTT GTCATCCCAC AACAGGAAGT CCACGTTGTC GAAAGGCTGG 

101 GGCGTTTCCA TCGCGCCCTG ACGGcCGGTT TGAATATTTT GATTCCCTTT 

151 ATCGACCGCG TCGCCTACCG CCATTCGCTG AAAGAAATCC CTTTAGACGT 

2 01 ACCCAGCCAG GTCTGCATCA CGCGCGACAA TACGCAGCTG ACTGTTGACG 

251 GCATCATCTA TTTCCAAGTA ACCGACCCCA AACTCGCCTC ATACGGTTCG 

301 AGCAACTACA TTATGGCGAT TACCCAGCTT GCCCAAACGA CGCTGCGTTC 

351 CGTTATCGGG CGTATGGAGT TGGACAAAAC GT^GAAGAA CGCGACGAAA 

4 01 TCAACAGTAC TGTTGTTGCG GCTTTGGACG AG^C^CCCG GGCTTGGGGT 

4 51 GTGAAGGTTT TGCGTTATGA GATTAAAGAC TTGGTTCCGC CGCAAGAAAT 

501 CCTTCGCTCA ATGCAGGCGC AAATTACTGC CGAACGCGAA AAACGCGCCC 

551 GTATCGCCGA ATCCGAAGGT CGTAAAATCG AACAAATCAA GCTTGCCAGT 

601 GGTCAGCGCG AAGCCGAAAT CCAACAATCC GAAGGCGAGG CTCAGGCTGC 

651 GGTCAATGCG TCAAATGCCG AGAAAATCGC CCGCATCAAC CGCGCCAAAG 

701 GTGAAGCGGA ATCCTTGCGC CTTGTTGCCG AAGCCAATGC CGAAGCCATC 

7 51 CGTCAAATTG CCGCCGCCCT TCAAACCCAA GGCGGTGCGG ATGCGGTCAA 

801 TCTGAAGATT GCGGAACAAT ACGTCGCTGC GTTCAACAAT CTTGCCAAAG 

851 AAAGCAATAC GCTGATTATG CCCGCCAATG TTGCCGACAT CGGCAGCCTG 

901 ATTTCTGCCG GTATGAAAAT TATCGACAGC AGCAAAACCG CCAAATAA 

This corresponds to the amino acid sequence <SEQ ID 3052; ORF 519-1>; 

m519-l. 

1 MEFFI ILLVA VAVFG FKSFV VIPQQEVHVV ERLGRFHRAL TAGLNILIPF 

51 IDRVAYRHSL KEIPLDVPSQ VCITRDNTQL TVDGI IYFQV TDPKLASYGS 

101 SNYIMAITQL AQTTLRSVIG RMELDKTFEE RDEINSTVVA ALDEAAGAWG 

151 VKVLRYEIKD LVPPQEILRS MQAQITAERE KRARIAESEG RKIEQINLAS 

201 GQREAEIQQS EGEAQAAVNA SNAEKIARIN RAKGEAESLR LVAEANAEAI 

251 RQIAAALQTQ GGADAVNLKI AEQYVAAFNN LAKESNTLIM PANVADIGSL 

301 ISAGMKIIDS SKTAK* 

The following DNA sequence was identified in N. gonorrhoeae <SEQ ID 3053>: 

g519-l . seq 

1 ATGGAATTTT TCATTATCTT GTTGGCAGCC GTCGCCGTTT TCGGCTTCAA 

51 ATCCTTTGTC GTCATCCCCC AGCAGGAAGT CCACGTTGTC GAAAGGCTCG 

101 GGCGTTTCCA TCGCGCCCTG ACGGCCGGTT TGAATATTTT GATTCCCTTT 

151 ATCGACCGCG TCGCCTACCG CCATTCGCTG AAAGAAATCC CTTTAGACGT 

201 ACCCAGCCAG GTCTGCATCA CGCGCGATAA TACGCAATTG ACTGTTGACG 

2 51 GCATCATCTA TTTCCAAGTA ACCGATCCCA AACTCGCCTC ATACGGTTCG 

301 AGCAACTACA TTATGGCAAT TACCCAGCTT GCCCAAACGA CGCTGCGTTC 

351 CGTTATCGGG CGTATGGAGT TGGACAAAAC GTTTGAAGAA CGCGACGAAA 

4 01 TCAACAGTAC CGTCGTCTCC GCCCTCGATG AAGCCGCCGG GGCTTGGGGT 

4 51 GTGAAAGTCC TCCGTTACGA AATCAAGGAT TTGGTTCCGC CGCAAGAAAT 

501 CCTTCGCGCA ATGCAGGCAC AAATTACCGC CGAACGCGAA AAACGCGCCC 

551 GTATTGCCGA ATCCGAAGGC CGTAAAATCG AACAAATCAA CCTTGCCAG^ 

601 GGTCAGCGTG AAGCCGAAAT CCAACAATCC GAAGGCGAGG CTCAGGCTGC 

651 GGTCAATGCG TCCAATGCCG AGAAAATCGC CCGCATCAAC CGCGCCAAAG 

701 GCGAAGCGGA ATCCCTGCGC CTTGTTGCCG AAGCCAATGC CGAAGCCATC 

7 51 CGTCAAATTG CCGCCGCCCT TCAAACCCAA GGCGGGGCGG ATGCGGTCAA 

801 TCTGAAGATT GCGGAACAAT ACGTAGCCGC GTTCAACAAT CTTGCCAAAG 
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8 51 AAAGCAATAC GCTGATTATG CCCGCCAATG TTGCCGACAT CGGCAGCCTG 
901 ATTTCTGCCG GCATGAA/_\T TATCGACAGC AGCAAAACCG CCAAATAA 

This corresponds to the amino acid sequence <SEQ ID 3054; ORF 519-l.ng>: 

g519-l .pep 

1 MEFFIILLAA VAVFG FKSFV VIPQQEVHVV ERLGRFHRAL TAG LN I LI PF 

51 IDRVAYRF3L KEI PLDVPSQ VCITRDNTQL TVDGI IYFQV TDPKLASYGS 

101 SNYIMAITQL AQTTLRSVIG RMELDKTFEE RDEIMSTVVS ALDEAAGAWG 

151 VKVLRYEIKD LVPPQEILRA MQAQITAERE KRARIAESEG RKIEQINLAS 

201 GQREAEIQQS EGEAQAAVNA SNAEKIARIN RAKGEAESLR LVAEANAEAI 

251 RQIAAALQTQ GGADAVNLKI AEQYVAAFNN LAKESNTLIM PANVADIGSL 

301 ISAGMKIIDS SKTAK*" 



m519-l/g5l9-l ORFs 519-1 and 519-1. ng showed a 99.0% identity in 315 aa 
overlap 

10 20 30 40 50 60 

g519-l .pep MEFFIILLAAVAVFG FKSFWIPQQEVHWERLGRFHRALTAGLNILIPFIDRVAYRHSL 
I I I I M I i : I M I I I I M ! I I I I I I I! t I I I ! I i I I M I i I I I I I II I ! I I I I If M M I 
m519-l MEFFIILLVAVAVFGFKSFWIPQQEVHVVERLGRFHRALTAGLNILIPFIDRVAYRHSL 

10 20 30 40 50 60 

70 80 90 100 110 120 

g519-l .pep KEIPLDVPSQVCITRDNTQLTVDGI I YFQVTDPK1ASYGSSNYIMAITQLAQTTLRSVIG 
I it I I 1 I I I i i I! 1 I I ! ! I I I I I I i t i M I I I I I I I I I I i i 1 I I II I I I I I I I I II I ! I t 
m519-l KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 

70 80 90 100 110 120 

130 140 150 160 170 180 

g519-i .pep RMELDKTFEERDEINSTWSALDEAAGAWGVKVLRYEIKDLVPPQEILRAMQAQITAERE 
I I I I I I (I I I I I I I I I I I I : j IS i II I I M I I I I M i II I I I I I i I I I I : I I I I I I I I I I 
m519-l RMELDKTFEERDEINSTVVAALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 

130 140 150 160 170 180 

190 200 210 220 230 240 

g519-l .pep KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
I I I i I I I i I I 1 I II I I I I i I ! I I I I II I I I I I I I I t I M II 1 I I I 1 I I I I II I i I M I I I 
m519-l KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
190 200 210 220 230 240 

250 260 270 280 290 300 

g519-l .pep LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIiMPANVADIGSL 
I I I I I I M I ! ! M I I I I i I II I I I I I I I I II I I I I 1 I I I I i I I I I i | I | I I | | ! i | | | | | 
m519-l LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
250 260 270 280 290 300 

310 

g5 19-1 .pep I SAGMKI I DSSKTAKX 
M II I I I I I I II i I I I 
m5 1 9- 1 I SAGMKI I DSSKTAKX 

310 

The following DNA sequence was identified in N. meningitidis <SEQ ID 3055>: 

a519-l . seq 

1 ATGGAATTTT TCATTATCTT GCTGGCAGCC GTCGTTGTTT TCGGCTTCAA 

51 ATCCTTTGTT GTCATCCCAC AGCAGGAAGT CCACGTTGTC GAAAGGCTCG 

101 GGCGTTTCCA TCGCGCCCTG ACGGCCGGTT TGAATATTTT GATTCCCTTT 

151 ATCGACCGCG TCGCCTACCG CCATTCGCTG AAAGAAATCC CTTTAGACGT 

201 ACCCAGCCAG GTCTGCATCA CGCGCGACAA TACGCAGCTG ACTGTTGACG 

251 GTATCATCTA TTTCCAAGTA ACCGACCCCA AACTCGCCTC ATACGGTTCG 

301 AGCAACTACA TTATGGCGAT TACCCAGCTT GCCCAAACGA CGCTGCGTTC 

351 CGTTATCGGG CGTATGGAAT TGGACAAAAC GTTTGAAGAA CGCGACGAAA 
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4 01 TCAACAGCAC 

4 51 GTGAAGGTTT 

501 CCTTCGCTCA 

551 GTATCGCCGA 

601 GGTCAGCGCG 

651 GGTCAATGCG 

701 GTGAAGCGGA 

7 51 CGTCAAATTG 

801 TCTGAAGATT 

6 51 AAAGCAATAC 

901 ATTTCTGCCG 



CGTCGTCTCC 
TGCGTTATGA 
ATGCAGGCGC 
ATCCGAAGGT 
AAGCCGAAAT 
TCAAATGCCG 
ATCCTTGCGC 
CCGCCGCCCT 
GCGGAACAAT 
GCTGATTATG 
GTATGAAAAT 



GCCCTCGATG 
GATTAAAGAC 
AAATTACTGC 
CGTAAAATCG 
CCAACAATCC 
AGAAAATCGC 
CTTGTTGCCG 
TCAAACCCAA 
ACGTCGCCGC 
CCCGCCAATG 
TATCGACAGC 



AAGCCGCCGG 
TTGGTTCCGC 
TGAACGCGAA 
AACAAATCAA 
GAAGGCGAGG 
CCGCATCAAC 
AAGCCAATGC 
GGCGGTGCGG 
GTTCAACAAT 
TTGCCGACAT 
AGCAAAACCG 



AGCTTGGGGT 
CGCAAGAAAT 
AAACGCGCCC 
CCTTGCCAGT 
CTCAGGCTGC 
CGCGCCAAAG 
CGAAGCCATC 
ATGCGGTCAA 
CTTGCCAAAG 
CGGCAGCCTG 
CCAAATAA 



This corresponds to the amino acid sequence <SEQ ID 3056; ORF 519-1 .a>: 

a519-l .pep. 

1 MEFFIILLAA WVFG FKSFV VIPQQEVHVV ERLGRFHRAL TAGLNILIPF 

51 IDRVAYRHSL KEIPLDVPSQ VCITRDNTQL TVDGI IYFQV TDPKLASYGS 

101 SMYIMAITQL AQTTLRSVIG RMELDKTFEE RDEIN3TVVS ALDEAAGAWG 

151 VKVLRYEIKD LVPPQEILRS MQAQITAERE KRARIAESEG RKIEQINLAS 

201 GQREAEIQQS EGEAQAAVNA SNAEKIARIN RAKGEAESLR LVAEANAEAI 

251 RQIAAALQTQ GGADAVNLKI AEQYVAAFNN 1..KESNTLIM PANVADIGSL 
301 ISAGMKIIDS SKTAK* 

m519-l/a519-l ORFs 519-1 and 519-1. a showed a 99.0% identity in 315 aa 

overlap 



10 20 30 40 50 60 

a519-l .pep MEFFIILLAAVVVFGFKSFVVI PQQEVKVVERLGRFKRALTAGLNILIPFIDRVAYRKSL 
I I M I t II : I 1 : I I I I t I II I i I I I I I I I I I i M I I I i I I 1 i I I i I I I I I ! M I M I I I I 
m5l9-l MEFFI ILLVAVAVFG FKSFWI PQQEVHVVERLGRFHRALTAGLNILIPFI DRVAYRHSL 

10 20 30 40 50 60 



70 80 90 100 110 120 

a519-l .pep KEI PLDVPSQVCITRDNTQLTVDGII YFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
I I I i I t i I I I 1 f t I I I i I I I I I I ! I I I I M I I ! I I I I 1 M I I I 1 I I I i I i [ It I t I I I I ! 
m519-l KEI PLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVTG 

70 80 90 100 110 120 



130 140 150 160 170 180 

a519-l.pep RMELDKT FEERDEINSTVVSALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 
i I I I I t II I I I t II M I I I : i I I I I I ! I I I I I M I ! I l i I ! I I I I I i I i I I II I I I M I i 
m519-l RMELDKTFEERDEINSTVVAALDEAAGAWGVKVLRYE1KDLVPPQEILRSMQAQITAERE 

130 140 150 160 170 180 



190 200 210 220 230 240 

a519-l - pep KRARIAESEGRKIEOIHLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
I M I I t I II i I I I I I 11 I I I I I M M I I I I i I I I I I ! I I I I I II I I I 1 I I I I ! I I I N I | 
m519-l KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 

190 200 210 220 230 240 



250 260 270 280 290 300 

a519-l . pep LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
I I I I I I I I I I I I I M i II I I I I I I I I I 1 M I I I I I I ! I 1 I I I M | M | I I | ! I | | | | | | i 
m519-l LVAEANAEAI RQIAAALQTQGGADAVNLK I AEQYVAAFNN LAKE SNTL I M PAN VAD I GSL 

250 260 270 280 290 300 



310 

a519-l .pep I SAGMKI IDSSKTAKX 
I I I I i M I I I I I II I I 
m5 1 9- 1 I SAGMKI I DSSKTAKX 

310 
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576 and 576-1 gnm22.seq 



The following partial DNA sequence was identified in A', meningitidis <SEQ ID 3057>: 

m576.seq.. (partial) 

I . .ATGCAGCAGG CAAGCTATGC GATGGGCGTG GACATCGGAC GCTCCCTGAA 

51 GCAAATGAAG GAACAGGGCG CGGAAATCGA TTTGAAAGTC TTTACCGAAG 

101 CCATGCAGGC AGTGTATGAC GGCAAAGAAA TCAAAATGAC CGAAGAGCAG 

151 GCTCAGGAAG TCATGATGAA ATTCCTTCAG GAACAACAGG CTAAAGCCGT 

201 AGAAAAACAC AAGGCGGACG CGAAGGCCAA T AAAG AAAAA GGCGAAGCCT 

251 TTCTGAAAGA AAATGCCGCC AAAGACGGCG TGAAGACCAC TGCTTCCGGC 

301 CTGCAATACA AAATCACCAA ACAGGGCGAA GGCAAACAGC CGACCAAAGA 

351 CGACATCGTT ACCGTGGAAT ACGAAGGCCG CCTGATTGAC GGTACGGTAT 

4 01 TCGACAGCAG CAAAGCCAAC GGCGGCCCGG TCACCTTCCC TTTGAGCCAA 

451 GTGATTCCGG GTTGGACCGA AGgCGTACAG CTTCTGAAAG AAGGCGGCGA 

501 AGCCACGTTC TACATCCCGT CCAACCTTGC CTACCGCGAA CAGGGTGCGG 

551 GCGACAAAAT CGGTCCGAAC GCCACTTTGG TATTTGATGT GAAACTGGTC 

601 AAAATCGGCG CACCCGAAAA CGCGCCCGCC AAGCAGCCGG CTCAAGTCGA 

651 CATCAAAAAA GTAAATTAA 

This corresponds to the amino acid sequence <SEQ ID 3058; ORF 576>: 

ra576.pep. . {partial} 

1 . . MQQASYAMGV DIGRSLKQMK EQGAEIDLKV FTEAMQAVYD GKEIKMTEEQ 
51 AQEVMMKFLQ EQQAKAVEKH KADAKANKEK GEAFLKENAA KDGVKTTASG 
101 LQYKITKQGE GKQPTKDDIV TVEYEGRLID GTVFDSSKAN GGPVTFPLSQ 

151 VIPGWTEGVQ LLKEGGEATF YIPSNLAYRE QGAGDKIGPN ATLVFDVKLV 
201 KIGAPENAPA KQPAQVDIKK VN* 

The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 3059>: 

g576.seq. . (partial) 

1 . .atgggcgtgg acatcggacg ctccctgaaa caaatgaagg aacagggcgc 

51 ggaaatcgat ttgaaagtct ttaccgatgc catgcaggca gtgtatgacg 

101 gcaaagaaat caaaatgacc gaagagcagg cccaggaagt gatgatgaaa 

151 ttcctgcagg agcagcaggc taaagccgta gaaaaacaca aggcggatgc 

201 gaaggccaac aaagaaaaag gcgaagcctt cctgaaggaa aatgccgccg 

251 aagacggcgt gaagaccact gcttccggtc tgcagtacaa aatcaccaaa 

301 cagggtgaag gcaaacagcc gacaaaagac gacatcgtta ccgtggaata 

351 cgaaggccgc ctgattgacg gtaccgtatt cgacagcagc aaagccaacg 

401 gcggcccggc caccctcccc ttgagccaag tgattccggg ttqgaccgaa 

451 ggcgtacggc ttctgaaaga aggcggcgaa gccacgttct acatcccgtc 

501 caaccttgcc taccgcgaac agggtgcggg cgaaaaaatc ggtccgaacg 

551 ccactttggt atttgacgtg aaactggtca aaatcggcgc acccgaaaac 

601 gcgcccgcca agcagccgga tcaagtcgac atcaaaaaag taaattaa 

This corresponds to the amino acid sequence <SEQ ID 3060; ORF 576.ng>: 

g576.pep. . (partial) 

1 . . MGVDIGRSLK QMKEQGAEID LKV FTDAMQA VYDGKEIKMT EEQAQEVMMK 

51 FLQEQQAKAV EKHKADAKAN KEKGEAFLKE NAAEDGVKTT ASGLQYKITK 

101 QGEGKQPTKD DIVTVEYEGR LIDGTVFDSS KANGGPATFP LSQVIPGWTE 

151 GVRLLKEGGE AT FYIPSNLA YREQGAGEKI GPNAT LVFDV KLVKIGAPEN 
201 APAKQPDQVD IKKVN* 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

m576/g576 ORFs 576 and 576. ng showed a 97.2% identity in 215 aa overlap 

10 20 30 40 50 60 

m57 6 . pep MQQASYAMGVDIGRSLKQMKEQGAEIDLKV FTEAMQAVYDGKEIKMTEEQAQEVMMKFLQ 

E M I I I I I I M I I I I | | I S M ! I I I : ! I I M I I ! [ i i ! i i | | j I | M 1 I 1 I I S 
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g 5 7 6 MGVDIGRSLKQMKE QG AE I D L KY FT DAMQAVY DGKE I KMTEEQAQE VMM KFLQ 

10 20 30 40 50 

70 80 90 100 110 120 

m57 6 . pep EQQAKAVEKHKADAKANKEKGEAFLKENAAKDGVKTTASGLQYKITKQGEGKQPTKDDIV 
I I I I I I I f I i I I I I i I I I I I I i 1 I 1 I i M i : I ! I I I I I I I I I I I I I I I I [ I I I I I It ! I I 
g57 6 EQQAKAVEKKKADAKANKEKGEAFLKENAAEDGVKTTASGLQYKITKQGEGKQPTKDDIV 
60 70 60 90 100 110 

130 140 150 160 170 180 

m57 6 . pep TVEYEGRLIDGTVFDSSKANGGPVTFPLSQVI PGWTEGVQLLKEGGEATFYI PSNLAYRE 

I I I f I I ! I I I I II i II I I I I I I I : I I I I I I I I I I I I i I I : I I I I I I i I I I I I I I f M t I I 
g 57 6 TVEYEGRLIDGTVFDSSKANGGPATFPLSQVI PGWTEGVRLLKEGGEATFYI PSNLAYRE 

120 130 140 150 160 170 

190 200 210 220 

m57 6 . pep QGAGDKIGPNATLVFDVKLVKIGAPENAPAKQPAOVDIKKVNX 
I I t 1 : I ! ( I I I t t I ! M 1 It I I I t I I I I I I I 1 I ! I I I I I M ! 
g57 6 QGAGEKIGPNATLVFDVKLVKIGAPENAPAKQPDQVDIKKVNX 
180 190 200 210 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 306 1>: 

a576. seq 

1 ATGAACACCA TTTTCAAAAT CAGCGCACTG ACJCTTTCCG CCGCTTTGGC 

51 ACTTTCCGCC TGCGGCAAAA AAGAAGCCGC CCCCGCATCT GCATCCGAAC 

101 CTGCCGCCGC TTCTTCCGCG CAGGGCGACA CCTCTTCGAT CGGCAGCACG 

151 ATGCAGCAGG CAAGCTATGC GATGGGCGTG GACATCGGAC GCTCCCTGAA 

201 GCAAATGAAG GAACAGGGCG CGGAAATCGA TTTGAAAGTC TTTACCGAAG 

2 51 CCATGCAGGC AGTGTATGAC GGCAAAGAAA TCAAAATGAC CGAAGAGCAG 

301 GCTCAGGAAG TCATGATGAA ATTCCTTCAG GAACAACAGG CTAAAGCCGT 

351 AGAAAAACAC AAGGCGGACG CGAAGGCCAA TAAAGAAAAA GGCGAAGCCT 

4 01 TTCTGAAAGA AAATGCCGCC AAAGACGGCG TGAAGACCAC TGCTTCCGGC 

4 51 CTGCAATACA AAATCACCAA ACAGGGCGAA GGCAAACAGC CGACCAAAGA 

501 CGACATCGTT ACCGTGGAAT ACGAAGGCCG CCTGATTGAC GGTACGGTAT 

551 TCGACAGCAG CAAAGCCAAC GGCGGCCCGG TCACCTTCCC TTTGAGCCAA 

601 GTGATTCTGG GTTGGACCGA AGGCGTACAG CTTCTGAAAG AAGGCGGCGA 

651 AGCCACGTTC TACATCCCGT CCAACCTTGC CTACCGCGAA CAGGGTGCGG 

701 GCGACAAAAT CGGCCCGAAC GCCACTTTGG TATTTGATGT GAAACTGGTC 

7 51 AAAATCGGCG CACCCGAAAA CGCGCCCGCC AAGCAGCCGG CTCAAGTCGA 

801 CATCAAAAAA GTAAATTAA 

This corresponds to the amino acid sequence <SEQ ID 3062; ORF 576. a>: 

a57 6 . pep 

1 MNTI FKISAL TLSAALALSA CGKKEAAPAS ASEPAAASSA QGDTSSIGST 

51 MQQASYAMGV DIGRSLKQMK EQGAEIDLKV FTEAMQAVYD GKEIKMTEEQ 

101 AQEVMMKFLQ EQQAKAVEKH KADAKANKEK GEAFLKENAA KDGVKTTASG 

151 LQYKITKQGE GKQPTKDDIV TVEYEGRLID GTVFDSSKAN GGPVTFPLSQ 

2 01 VILGWTEGVQ LLKEGGEATF YI PSNLAYRE QGAGDKIGPN ATLVFDVKLV 

2 51 KIGAPENAPA KQPAQVDIKK VN + 

n\576/a576 ORFs 576 and 57 6. a showed a 99.5% identity in 222 aa overlap 

10 20 30 

m57 6 . pep MQQAS YAMGVDI GRSLKQMKEQGAE I DLKV 

1 1 1 1 I M I I t I I I I 1 1 j I I I I I I 1 1 1 1 1 1 I 
a57 6 CGKKEAAPASASEPAAASSAQGDTSSIGSTMQQASYAMGVDIGRSLKQMKEQGAEIDLKV 
30 40 50 60 70 80 

40 50 60 70 80 90 

m576 . pep FTEAMQAVYDGKE I KMTEEQAQE VMMKFLQEQQAKAVEKHKADAKANKEKGEAFLKENAA 

i I ! I I I I i I M II I I i I I I I I I I I ! i I I I I I I I I II I I I I I I I I [ I I I I I I I I II I I I I I 
a57 6 FTEAMQAVYDGKE I KMTEEQAQEVMMKFLQEQQAKAVEKHKADAKANKEKGEAFLKENAA 

90 100 110 120 130 140 
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100 110 120 130 140 150 

m57 6.pep KDGVKTTASGLQYL- 1 KQGEGKQPTKDDI VTVEYEGRLI DGTVFDS5KANGGPVTFPLSQ 

I ! I! I I I ! I I I I I I I I I I I I I ! I I I I I I I I I i f I I I I I I ! M I I I! I I I I I I I I I II I I I 
a 57 6 KDGVKTTASGLQYK I TKQGEGKQPTKDD I VTVEYEGRLI DGTVFDSSKANGG PVTFPLSQ 

150 160 170 130 190 200 



160 170 ISO 190 200 210 

m57 6 . pep VI PGWTEGVQLLKEGGEATFYI PSNLAYREQGAGDKIGPNATLVFDVKLVKIGAPENAPA 

II M I M I i I I I I M ! ! I I I 11 I I I I I I i I I I I f ! I ! I I I I I I I I I ! f I I I 1 ! ! ! M i I 
a 57 6 VILGWTEGVQLLKEGGEATFYIPSNLAYREQGAGDKIGPNATLVFDVKLVKIGAPENAPA 

210 220 230 240 250 260 



220 

m57 6 . pep KQPAQVDIKKVNX 
I! I I I I I I I I I i I 
a 57 6 KQPAQVDIKKVNX 
270 



Further work revealed the DNA sequence identified in N. meningitidis <SEQ ID 3063>: 

m576-l . seq 

1 ATGAACACCA TTTTCAAAAT CAGCGCACTG ACCCTTTCCG CCGCTTTGGC 

51 ACTTTCCGCC TGCGGCAAAA AAGAAGCCGC CCCCGCATCT GCATCCGAAC 

101 CTGCCGCCGC TTCTTCCGCG CAGGGCGACA CCTCTTCGAT CGGCAGCACG 

151 ATGCAGCAGG CAAGCTATGC GATGGGCGTG GACATCGGAC GCTCCCTGAA 

201 GCAAATGAAG GAACAGGGCG CGGAAATCGA TTTGAAAGTC TTTACCGAAG 

251 CCATGCAGGC AGTGTATGAC GGCAAAGAAA TCAAAATGAC CGAAGAGCAG 

301 GCTCAGGAAG TCATGATGAA ATTCCTTCAG GAACAACAGG CTAAAGCCGT 

351 AGAAAAACAC AAGGCGGACG CGAAGGCCAA TAAAGAAAAA GGCGAAGCCT 

401 TTCTGAAAGA AAATGCCGCC AAAGACGGCG TGAAGACCAC TGCTTCCGGC 

4 51 CTGCAATACA AAATCACCAA ACAGGGCGAA GGCAAACAGC CGACCAAAGA 

501 CGACATCGTT ACCGTGGAAT ACGAAGGCCG CCTGATTGAC GGTACGGTAT 

551 TCGACAGCAG CAAAGCCAAC GGCGGCCCGG TCACCTTCCC TTTGAGCCAA 

601 GTGATTCCGG GTTGGACCGA AGGCGTACAG CTTCTGAAAG AAGGCGGCGA 

651 AGCCACGTTC TACATCCCGT CCAACCTTGC CTACCGCGAA CAGGGTGCGG 

701 GCGACAAAAT CGGTCCGAAC GCCACTTTGG TATTTGATGT GAAACTGGTC 

7 51 AAAATCGGCG CACCCGAAAA CGCGCCCGCC AAGCAGCCGG CTCAAGTCGA 

801 CATCAAAAAA GTAAATTAA 

This corresponds to the amino acid sequence <SEQ ID 3064; ORF 576-l>: 

m576- 1 . pep 

1 MNTI FKISAL TLSAALALS A CGKKEAAPAS ASEPAAASSA QGDTSS IGST 

51 MQQASYAMGV DIGRSLKQMK EQGAEIDLKV FTEAMQAVYD GKE IKMTEEQ 

101 AQEVMMKFLQ EQQAKAVEKH KADAKANKEK GEAFLKENAA KDGVKTTASG 

151 LQYKITKQGE GKQPTKDDIV TVEYEGRLID GTVFDSSKAN GGPVTFPLSQ 

201 VIPGWTEGVQ LLKEGGEATF YIPSNLAYRE QGAGDKIGPN ATLVFDVKLV 

251 KIGAPENAPA KQPAQVDIKK VN* 

The following DNA sequence was identified in N. gonorrhoeae <SEQ ID 3065>: 

g576-l . seq 

1 ATGAACACCA TTTTCAAAAT CAGCGCACTG ACCCTTTCCG CCGCTTTGGC 

51 ACTTTCCGCC TGCGGCAAAA AAGAAGCCGC CCCCGCATCT GCATCCGAAC 

101 CTGCCGCCGC TTCTGCCGCG CAGGGCGACA CCTCTTCAAT CGGCAGCACG 

151 ATGCAGCAGG CAAGCTATGC AATGGGCGTG GACATCGGAC GCTCCCTGAA 

201 ACAAATGAAG GAACAGGGCG CGGAAATCGA TTTGAAAGTC TTTACCGATG 

251 CCATGCAGGC AGTGTATGAC GGCAAAGAAA TCAAAATGAC CGAAGAGCAG 

301 GCCCAGGAAG TGATGATGAA ATTCCTGCAG GAGCAGCAGG CTAAAGCCGT 

351 AGAAAAACAC AAGGCGGATG CGAAGGCCAA CAAAGAAAAA GGCGAAGCCT 

4 01 TCCTGAAGGA AAATGCCGCC AAAGACGGCG TGAAGACCAC TGCTTCCGGT 

4 51 CTGCAGTACA AAATCACCAA ACAGGGTGAA GGCAAACAGC CGACAAAAGA 

501 CGACATCGTT ACCGTGGAAT ACGAAGGCCG CCTGATTGAC GGTACCGTAT 

551 TCGACAGCAG CAAAGCCAAC GGCGGCCCGG CCACCTTCCC TTTGAGCCAA 

601 GTGATTCCGG GTTGGACCGA AGGCGTACGG CTTCTGAAAG AAGGCGGCGA 

651 AGCCACGTTC TACATCCCGT CCAACCTTGC CTACCGCGAA CAGGGTGCGG 
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7 01 GCGAAAAAAT CGGTCCGAAC GCCACTTTGG TATTTGACGT GAAACTGGTC 
7 51 AAAATCGGCG CACCCGAAAA CGCGCCCGCC AAGCAGCCGG ATCAAGTCGA 
SOI CATC AAAAAA GTAAATTAA 

This corresponds to the amino acid sequence <SEQ ID 3066; ORF 576-1 .ng>: 

g576-l . pep 

1 MNTI FKISAL TLSAALALS A CGKKEAAPAS ASEPAAASAA QGDTSSIGST 

51 MQQASYAMGV DIGRSLKQMK EQGAEIDLKV FTDAMQAVYD GKEIKMTEEQ 

101 AQEVMMKFLQ EQQAKAVEKK KADAKANKEK GEAFLKENAA KDGVKTTASG 

151 LQYKITKQGE GKQPTKDDIV TVEYEGRLID GTVFDSSKAN GGPATFPLSQ 

201 VIPGWTEGVR L-LKEGGEATF YIP5NLAYRE QGAGEKIGPN ATLVFDVKLV 

251 KIGAFENAPA KQPDQVDIKK VN* 



g576-l/m576-l ORFs 576-1 and 576-1. ng showed a 97.8% identity in 272 

overlap 

10 20 30 40 50 60 

g576-l .pep MNTI FKI SALTLSAALALSACGKKEAAPAS/.^EPAAASAAQGDTSSIGSTMQQAS YAMGV 
! I I i ! I I I I I I ! I I I ! t I I t ! I I I i I I I I I I M M I I i : I I I I I I I I I I f I i I I I I I f I I 
m5 7 6- 1 MNT I FKI SALTLS AALALSACGKKEAAPAS ASE PAAASSAQGDTS S IG STMQQAS YAMGV 

10 20 30 40 50 60 

70 80 90 100 110 120 

g576-l .pep DIGRSLKQMKEQGAEIDLKVFTDAMQAVYDGKEIKMTEEQAQEVMMKFLQEQQAKAVEKH 
I i I I I I I I I I I I I! I I I II I ! i : I I I I I I I I i i I I I i If I i i I i I I i I! ! I I i I I ! I t I I 
m57 6-l DIGRSLKQMKEQGAEIDLKVFTEAMQAVYDGKEIKMTEEQAQEVMMKFLQEQQAKAVEKH 

70 80 90 100 110 120 

130 140 150 160 170 180 

g576-l . pep KADAKANKEKGEAFLKENAAKDGVKTTASGLQYKITKQGEGKQPTKDDIVTVEYEGRLID 
M i 1 I M I U I ! I 11 M \ \ I I II 1 t I ! I I 1 I I I U t I M M I I I ! I I I ! I I ! i I t ! t I U 
m57 6-l KADAKANKEKGEAFLKENAAKDGVKTTASGLQYKITKQGEGKQPTKDDIVTVEYEGRLI D 

130 140 150 160 170 180 

190 200 210 220 230 240 

g 57 6-1 . pep GTVFDSSKANGGPATFPLSQVIPGWTEGVRLLKEGGEATFYIPSNLAYREQGAGEKIGPN 
! I ! I I ! I I I I I I I : I I I I I I I I I I I I t I I : I i I II t I I I M II I I i I i I I I I I I : I I II t 
m57 6-l GTVFDSSKANGGPVTFPLSQVIPGWTEGVQLLKEGGEATFYIPSNLAYREQGAGDKIGPN 

190 200 210 220 230 240 

250 260 270 

g57 6-l .pep ATLVFDVKLVKIGAPENAPAKQPDQVDIKKVNX 
I i I I I I I I I I I I M I I I I II I I I i 1 I I i I I i I 
m57 6-l ATLVFDVKLVKIGAPENAPAKQPAQVDIKKVNX 

250 260 270 

The following DNA sequence was identified in N. meningitidis <SEQ ID 3067>: 

a576-l . sea 

1 ATGAACACCA TTTTCAAAAT CAGCGCACTG ACCCTTTCCG CCGCTTTGGC 

51 ACTTTCCGCC TGCGGCAAAA AAGAAGCCGC CCCCGCATCT GCATCCGAAC 

101 CTGCCGCCGC TTCTTCCGCG CAGGGCGACA CCTCTTCGAT CGGCAGCACG 

151 ATGCAGCAGG CAAGCTATGC GATGGGCGTG GACATCGGAC GCTCCCTGAA 

201 GCAAATGAAG GAACAGGGCG CGGAAATCGA TTTGAAAGTC TTTACCGAAG 

251 CCATGCAGGC AGTGTATGAC GGCAAAGAAA TCAAAATGAC CGAAGAGCAG 

301 GCTCAGGAAG TCATGATGAA ATTCCTTCAG GAACAACAGG CTAAAGCCGT 

3 51 AGAAAAACAC AAGGCGGACG CGAAGGCCAA TAAAGAAAAA GGCGAAGCCT 

4 01 TTCTGAAAGA AAATGCCGCC AAAGACGGCG TGAAGACCAC TGCTTCCGGC 
4 51 CTGCAATACA AAATCACCAA ACAGGGCGAA GGCAAACAGC CGACCAAAGA 
501 CGACATCGTT ACCGTGGAAT ACGAAGGCCG CCTGATTGAC GGTACGGTAT 
551 TCGACAGCAG CAAAGCCAAC GGCGGCCCGG TCACCTTCCC TTTGAGCCAA 
601 GTGATTCTGG GTTGGACCGA AGGCGTACAG CTTCTGAAAG AAGGCGGCGA 
651 AGCCACGTTC TACATCCCGT CCAACCTTGC CTACCGCGAA CAGGGTGCGG 
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701 GCGACAAAAT CGGCCCGAAC GCCACTTTGG TATTTGATGT GAAACTGGTC 
7 51 AAAATCGGCG CACCCGAAAA CGCGCCCGCC AAGCAGCCGG CTCAAGTCGA 
801 CATCAAAAAA GTAAATTAA 

This corresponds to the amino acid sequence <SEQ ID 3068; ORF 576-1. a>: 

a57 6-1 . pep 

1 MNTIFKISAL TLSAALALS A CGKKEAAPAS ASEPAAASSA QGDTSSIGST 

51 MQQASYAMGV DIGRSLKQMK EQGAEIDLKV FTEAMQAVYD GKEIKMTEEQ 

101 AQEVMMKFLQ EQQAKAVEKH KADAKANKEK GEAFLKENAA KDGVKTTASG 

151 LQYKITKQGE GKQPTKDDIV TVEYEGRLID GTVFDSSKAN GGPVTFPLSQ 

201 VILGWTEGVQ LLKEGGEATF YIPSNLAYRE QGAGDKIGPN ATL.VFDVKLV 

251 KIGAPENAPA KQPAQVDIKK VN* 

a576-l/m576-l ORFs 576-1 and 576-1. a showed a 99.6% identity in 272 aa 

overlap 

10 20 30 40 50 60 

a57 6-l .pep MNTIFKISALTLSAALALSACGKKEAAPASASEPAAASSAQGDTSSIGSTMQQASYAHGV 
I I I M I I I I I I I I i ! 1 I 1 I I I I I I I I I I I I I I I II I 1 I I I i I ! I i M I I I I I i I I M | | I 
m5 7 6- 1 MNT I FKI S ALT LSAALALSACGKKEAAPAS AS E PAAAS S AQGDTS S I G STMQQAS YAMGV 

10 20 30 40 50 60 

70 80 90 100 110 120 

a 57 6-1 .pep DIGRSLKQMKEQGAEIDLK\ ? FTEAMQAVYDGKEIKMTEEQAQEVMMKFLQEQOAKAVEKH 
! II i ! I I I I I I I I ! I 1 ! I i M ! ! I I M i I M I i I I ! I I I I ! I I ! ! i M 1 i t I I i I I ! 1 I I 
m57 6-l DIGRSLKQMKEQGAEIDLKVFTEAMQAVYDGKEIKMTEEQAQEVMMKFLQEQQAKAVEKH 

70 80 90 100 110 120 

130 140 150 160 170 180 

a57 6-l .pep KADAKANKEKGEAFLKENAAKDGVKTTASGLQYKITKQGEGKQPTKDDIVTVEYEGRLID 
I i I M I I I II 1 I II I I I I I 1 I ! 1 II I I i U I M I I I I I I i I 1 I 1! I ! I i I I I I ! I I I I I I 
m57 6-l KADAKANKEKGEAFLKENAAKDGVKTTASGLQYKITKQGEGKQPTKDDIVTVEYEGRLID 

130 140 150 160 170 180 

190 200 210 220 230 240 

a 57 6-1 .pep GTVFDSSKANGGPVTFPLSQVILGWTEGVQLLKEGGEATFYI PSNLAYREQGAGDKIGPN 
I I II I I I I I I II I I M I i I I I I I I I I I I I I I I I I I M I I M I I ! I M I M M I M I I i I 
m57 6-l GTVFDSSKANGGPVTFPLSQVIPGWTEGVQLLKEGGEAT FY I PSNLAYREQGAGDKIGPN 

190 200 210 220 230 240 

250 260 270 

a576-l .pep ATLVFDVKLVKIGAPENAPAKQPAQVDIKKVNX 
I I I I I I ! 1 I I I I ! I I f I I I I ! I I I I I I I I I I I I 
m57 6-l ATLVFDVKLVKIGAPENAPAKQPAQVDIKKVNX 
250 260 270 



919 gnm43.seq 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 3069>: 

H1919 . seq 

1 ATGAAAAAAT ACCTATTCCG CGCCGCCCTG TACGGCATCG CCGCCGCCAT 

51 CCTCGCCGCC TGCCAAAGCA AGAGCATCCA AACCTTTCCG CAACCCGACA 

101 CATCCGTCAT CAACGGCCCG GACCGGCCGG TCGGCATCCC CGACCCCGCC 

151 GGAACGACGG TCGGCGGCGG CGGGGCCGTC TATACCGTTG TACCGCACCT 

201 GTCCCTGCCC CACTGGGCGG CGCAGGATTT CGCCAAAAGC CTGCAATCCT 

251 TCCGCCTCGG CTGCGCCAAT TTGAAAAACC GCCAAGGCTG GCAGGATGTG 

301 TGCGCCCAAG CCTTTCAAAC CCCCGTCCAT TCCTTTCAGG CAAAACAGTT 

3 51 TTTTGAACGC TATTTCACGC CGTGGCAGGT TGCAGGCAAC GGAAGCCTTG 

401 CCGGTACGGT TACCGGCTAT TACGAACCGG TGCTGAAGGG CGACGACAGG 
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451 


CGGACGGCAC 


AAGCCCGCTT 


501 


CTCCGTCCCC 


CTGCCTGCCG 


551 


TCAGGCAGAC 


GGGAAAAAAC 


601 


CATACCGCCG 


ACCTCTCCcG 


651 


CAAAGGCAGG 


TTTGAAGGAA 


701 


AAATCAACGG 


CGGCGCGCTT 


751 


GAAGACCCTG 


TCGAACTTTT 


SOI 


GAAAACCCCG 


TCCGGCAAAT 


851 


AACATCCyTA 


CGTTTCCATC 


901 


AAACTCGGAC 


AAACCTCCAT 


951 


TCCGCAACGC 


CTCGCCGAAG 


1001 


TCCGCGAGCT 


TGCCGGAAGC 


1051 


ACGCCGCTGA 


TGGGGGAATA 


1101 


CTTGGGTGCG 


CCCTTATTTG 


1151 


CCCTCAACCG 


CCTGATTATG 


1201 


GCGGTGCGCG 


TGGATTATTT 


1251 


TGCCGGCAAA 


CAGAAAACCA 


1301 


GTATGAAGCC 


CGAATACCGC 



CCCGATTTAC GGTATTCCCG ACGATTTTAT 
GTTTGCGGAG CGGAAAAGCC CTTGTCCGCA 
AGCGGCACAA TCGACAATAC CGGCGGCACA 
ATTCCCCATC ACCGCGCGCA CAACAGCAAT 
GCCGCTTCCT CCCCTACCAC ACGCGCAACC 
GACGGCAAAG CCCCGATACT CGGTTACGCC 
TTTTATGCAC ATCCAAGGCT CGGGCCGTCT 
ACATCCGCAT CGGCTATGCC GACAAAAACG 
GGACGCTATA TGGCGGATAA GGGCTACCTC 
GCAGGGCATT AAGTCTTATA TGCGGCAAAA 
TTTTGGGTCA AAACCCCAGC TATATCTTTT 
AGCAATGACG GCCCTGTCGG CGCACTGGGC 
TGCCGGCGCA GTCGACCGGC ACTACATTAC 
TCGCCACCGC CCATCCGGTT ACCCGCAAAG 
GCGCAGGATA CCGGCAGCGC GATTAAAGGC 
TTGGGGATAC GGCGACGAAG CCGGCGAACT 
CGGGATATGT CTGGCAGCTC CTACCCAACG 
CCGTAA 



This corresponds to the amino acid sequence <SEQ ID 3070; ORF 91 9>: 

m919 . pep 

1 MKKYLFRAAL YGIAAAILAA CQSKSIQTFP QPDTSVINGP DRPVGIPDPA 

51 GTTVGGGGAV YTWPHLSLP HWAAQDFAKS LQSFRLGCAN LKNRQGWQDV 

101 CAQAFQTPVH SFQAKQFFER YFTPWQVAGN GSLAGTVTGY YEPVLKGDDR 

151 RTAQARFPIY GIPDDFISVP LPAGLRSGKA LVRIRQTGKN SGTIDNTGGT 

201 HTADLSRFPI TARTTAIKGR FEGSRFLPYH TRNQINGGAL DGKAPILGYA 

251 EDPVELFFMH IQGSGRLKTP SGKYIRIGYA DKNEHPYVSI GRYMADKGYL 

301 KLGQTSMQGI KSYMRQNPQR LAEVLGQNPS YIFFRELAGS SNDGPVGALG 

351 TPLMGEYAGA VDRHYITLGA PLFVATAHPV TRKALNRLIM AQDTGSAI KG 

401 AVRVDYFWGY GDEAGELiAGK QKTTGYVWQL LPNGMKPEYR P* 

The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 3071 >: 

g919 . seq 

1 ATGAAAAAAC ACCTGCTCCG CTCCGCCCTG TACGGcatCG CCGCCgccAT 

51 CctcgCCGCC TGCCAAAgca gGAGCATCCA AACCTTTCCG CAACCCGACA 

101 CATCCGTCAT CAACGGCCCG GACCGGCCGG CCGGCATCCC CGACCCCGCC 

151 GGAACGACGG TTGCCGGCGG CGGGGCCGTC TATACCGTTG TGCCGCACCT 

201 GTCCATGCCC CACTGGGCGG CGCaggATTT TGCCAAAAGC CTGCAATCCT 

251 TCCGCCTCGG CTGCGCCAAT TTGAAAAACC GCCAAGGCTG GCAGGATGTG 

3 01 TGCGCCCAAG CCTTTCAAAC CCCCGTGCAT TCCTTTCAGG CAAAGcGgTT 

3 51 TTTTGAACGC TATTTCACGC cgtGGCaggt tgcaggcaAC GGAAGcCTTG 
401 Caggtacggt TACCGGCTAT TACGAACCGG TGCTGAAGGG CGACGGCAGG 

4 51 CGGACGGAAC GGGCCCGCTT CCCGATTTAC GGTATTCCCG ACGATTTTAT 
501 CTCCGTCCCG CTGCCTGCCG GTTTGCGGGG CGGAAAAAAC CTTGTCCGCA 

5 51 TCAGGCAGac ggGGAAAAAC AGCGGCACGA TCGACAATGC CGGCGGCACG 

6 01 CATACCGCCG ACCTCTCCCG ATTCCCCATC ACCGCGCGCA CAACGGcaat 
6 51 caaaGGCAGG TTTGAaggAA GCCGCTTCCT CCCTTACCAC ACGCGCAACC 
701 AAAtcaacGG CGGCgcgcTT GACGGCAAag cccCCATCCT CggttacgcC 
751 GAagaccCcG tcgaacttTT TTTCATGCAC AtccaaggCT CGGGCCGCCT 
801 GAAAACCCcg tccggcaaat acatCCGCAt cggaTacgcc gacAAAAACG 
851 AACAtccgTa tgtttccatc ggACGctaTA TGGCGGACAA AGGCTACCTC 
901 AAGctcgggc agACCTCGAT GCAGGgcatc aaagcCTATA TGCGGCAAAA 
951 TCCGCAACGC CTCGCCGAAG TTTTGGGTCA AAACCCCAGC TATATCTTTT 

1001 TCCGCGAGCT TGCCGGAAGC GGCAATGAGG GCCCCGTCGG CGCACTGGGC 

1051 ACGCCACTGA TGGGGGAATA CGCCGGCGCA ATCGACCGGC ACTACATTAC 

1101 CTTGGGCGCG CCCTTATTTG TCGCCACCGC CCATCCGGTT ACCCGCAAAG 

1151 CCCTCAACCG CCTGATTATG GCGCAGGATA CAGGCAGCGC GATCAAAGGC 

12 01 GCGGTGCGCG TGGATTATTT TTGGGGTTAC GGCGACGAAG CCGGCGAACT 

12 51 TGCCGGCAAA CAGAAAACCA CGGGATACGT CTGGCAGCTC CTGCCCAACG 

13 01 GCATGAAGCC CGAATACCGC CCGTGA 
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This corresponds to the amino acid sequence <SEQ ID 3072; ORF 919.ng>: 

g919 -pep 

1 NlKKHLLRSAL YGIAAAILAA CQSRSIQTFP QPDTSVINGP DRPAGI PDPA 

51 GTTVAGGGAV YTWPHLSMP HWAAQDFAKS LQSFRLGCAN LKNRQGWQDV 

101 CAOAFQTPVK SFQAKRFFER YFTPWQVAGN GSLAGTVTGY YEPVLKGDGR 

151 RTERARF PIY GIPDDFISVP LPAGLR3GKN LVRIRQTGKN SGTIDNAGGT 

201 HTADLSRFPI TARTTAIKGR FEGSRFLPYH TRNQINGGAL DGKAPILGYA 

251 EDPVELFFMH IQGSGRLKTP SGKYIRIGYA DKNEHPYVSI GRYMADKGYL 

301 KLGOTSMQGI KAYMRQNPQR LAEVLGQNPS VI FFRELAGS GNEGPVGALG 

3 51 TPLMGEYAGA IDRHYITLGA PLFVATAHPV TRKALNRLIM AQDTGSAIKG 

401 AVRVDYFWGY GDEAGELAGK QKTTGYVWQL LPNGMKPEYR P* 



ORF 919 shows 95.9 % identity over a 441 aa overlap with a predicted ORF (ORF 919.ng) 
from N. gonorrhoeae: 

m919/g919 

10 20 30 40 50 60 

MKKYLFRAALYGIAAAILAACQSKSIOTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 

llhhhMIIMMIiMllhililMIIIIIIIIMIIMIMillMhllMI 

MKKHLLRSALYGIAAAILAACQSRSIQTFPQPDTSVINGPDRPAGIPDPAGTTYAGGGAV 
10 20 30 40 50 60 



m919 . pep 
g919 



70 80 90 100 110 120 

m919 . pep YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNROGWODVCAQAFOTPVHSFQAKQFFER 

IlilllihliMilliilillllllMllliillllilMllllillllllilHIII 

g919 YTWPHLSMPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKRFFER 

70 80 90 100 110 120 



130 140 150 160 170 180 

m919.pep YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPI YGI PDDFISVPLPAGLRSGKA 

MiiiiiiiiiiiiMiiiniiiiiii 111 HHimimmmimhii 

g919 YFTPWQVAGNGSLAGTVTGYYEPVLKGDGRRTERARFPI YGI PDDFISVPLPAGLRGGKN 

130 140 150 160 170 180 



190 200 210 220 230 240 

m919 - pep LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 

MllllllllllllihMIIIIIMIIIIIIIIMIIIIIIMIIIIIMIIIMMII 

g919 LVRIRQTGKNSGTIDNAGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRKQINGGAL 
190 200 210 220 230 240 



250 260 270 280 290 300 

m919.pep DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 

mmmmmiimnimmmiimmmmimmmmi 

g919 DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
250 260 270 280 290 300 



310 320 330 340 350 360 

m919 . pep KLGOTSMQGI KSYMRQNPQRLAEVLGQNPSY I FFRELAGSSNDGPVGALGTPLMGEYAGA 

MMMIIIIhllllllllMIIIIIIMIIIIIIIIIhhllMMIIMIIIMII 

g919 KLGQTSMQGI KAYMRQNPQRLAEVLGQNPSYI F FRE LAG S GN EG P VG AL GT P LMG E Y AG A 

310 320 330 340 350 360 



370 380 390 400 410 420 

m9 1 9 . pep VDRHYI TLGAPLFVATAHPVTRKALNRLIMAODTGS A I KGAVRVDYFWGYGDE AGE LACK 

:||IIIIIIIIIII!IIIIIIIIIIMIIIIIIIIIIIIIIIIIIII|||||||||MI! 

g919 I DRH Y I TLGAPLFVATAHPVTRKALNRL I MAQDTGS A I KGAVRVDYFWGYGDE AGE LAG K 

370 380 390 400 410 420 
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430 440 
m9 1 9 . pep QKTTGYVWQLLPNGMKPE YRPX 

IIIIIIIIIIIIIIIIIIMII 

g 9 1 9 QKTTG YVWQLLPNGMKPEYRPX 

430 440 



The following partial DNA sequence was identified in N .meningitidis <SEQ ID 3073>: 

a919 . seq 

1 ATGAAAAAAT ACCTATTCCG CGCCGCCCTG TGCGGCATCG CCGCCGCCAT 

51 CCTCGCCGCC TGCCAAAGCA AGAGCATCCA AACCTTTCCG CAACCCGACA 

101 CATCCGTCAT CAACGGCCCG GACCGGCCGG TCGGCATCCC CGACCCCGCC 

151 GGAACGACGG TCGGCGGCGG CGGGGCCGTT TATACCGTTG TGCCGCACCT 

201 GTCCCTGCCC CACTGGGCGG CGCAGGATTT CGCCAAAAGC CTGCAATCCT 

251 TCCGCCTCGG CTGCGCCAAT TTGAAAAACC GCCAAGGCTG GCAGGATGTG 

301 TGCGCCCAAG CCTTTCAAAC CCCCGTCCAT TCCGTTCAGG CAAAACAGTT 

351 TTTTGAACGC TATTTCACGC CGTGGCAGGT TGCAGGCAAC GGAAGCCTTG 

401 CCGGTACGGT TACCGGCTAT TACGAGCCGG GCTGAAGGG CGACGACAGG 

451 CGGACGGCAC AAGCCCGCTT CCCGATTTAC GGTATTCCCG ACGATTTTAT 

501 CTCCGTCCCC CTGCCTGCCG GTTTGCGGAG CGGAAAAGCC CTTGTCCGCA 

551 TCAGGCAGAC GGGAAAAAAC AGCGGCACAA TCGACAATAC CGGCGGCACA 

601 CATACCGCCG ACCTCTCCCA ATTCCCCATC ACTGCGCGCA CAACGGCAAT 

651 CAAAGGCAGG TTTGAAGGAA GCCGCTTCCT CCCCTACCAC ACGCGCAACC 

701 AAATCAACGG CGGCGCGCTT GACGGCAAAG CCCCGATACT CGGTTACGCC 

751 GAAGACCCCG TCGAACTTTT TTTTATGCAC ATCCAAGGCT CGGGCCGTCT 

801 GAAAACCCCG TCCGGCAAAT ACATCCGCAT CGGCTATGCC GACAAAAACG 

8 51 AACATCCCTA CGTTTCCATC GGACGCTATA TGGCGGACAA AGGCTACCTC 

901 AAGCTCGGGC AGACCTCGAT GCAGGGCATC AAAGCCTATA TGCAGCAAAA 

951 CCCGCAACGC CTCGCCGAAG TTTTGGGGCA AAACCCCAGC TATATCTTTT 

1001 TCCGAGAGCT TACCGGAAGC AGCAATGACG GCCCTGTCGG CGCACTGGGC 

1051 ACGCCGCTGA TGGGCGAGTA CGCCGGCGCA GTCGACCGGC ACTACATTAC 

1101 CTTGGGCGCG CCCTTATTTG TCGCCACCGC CCATCCGGTT ACCCGCAAAG 

1151 CCCTCAACCG CCTGATTATG GCGCAGGATA CCGGCAGCGC GATTAAAGGC 

1201 GCGGTGCGCG TGGATTATTT TTGGGGATAC GGCGACGAAG CCGGCGAACT 

1251 TGCCGGCAAA CAGAAAACCA CGGGATATGT CTGGCAGCTT CTGCCCAACG 

1301 GTATGAAGCC CGAATACCGC CCGTAA 

This corresponds to the amino acid sequence <SEQ ID 3074; ORF 919.a>: 

a919.pep 

1 MKKYLFRAAL CGIAAAILAA CQSKSIQTFP QPDTSVING? DRPVGIPDPA 

51 GTTVGGGGAV YTWPHLSLP HWAAQDFAKS LQSFRLGCAN LKNRQGWQDV 

101 CAQAFQTPVH SVQAKQFFER YFTPWQVAGN G5LAGTVTGY YEPVLKGDDR 

151 RTAQARFPIY GIPDDFISVP LPAGLRSGKA LVRIRQTGKN SGTIDNTGGT 

201 HTADLSQFPI TARTTAIKGR FEGSRFLPYH TRNQINGGAL DGKAPILGYA 

251 EDPVELFFMH IQGSGRLKT? SGKYIRIGYA DKNEKPYVS I GRYMADKGYL 

301 KLGQTSMQGI KAYMQQN PQR LAEVLGQNPS YIFFRELTGS SNDGPVGALG 

351 TPLMGEYAGA VDRHYITLGA PL FV AT AH PV TRKALNRLIM AQDTGSAIKG 

401 AVRVDYFWGY GDEAGELAGK QKTTGYVWQL LPNGMKPEYR P* 

m919/a919 ORFs 919 and 919. a showed a 98.6% identity in 441 aa overlap 

10 20 30 40 50 60 

m919.pep MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 
I I I I I I I I I I I I I I I II I i I I t 1 I I i I t ! I I I i I! I i I I ! I I I | I ! I ( | [| | I | | | | | I 
a 91 9 MKKYLFRAALCGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 

10 20 30 40 50 60 



7 0 80 90 100 110 120 

m919.pep YTVVPKLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 
N 1 t I I I I I ! I I I I I i I I I j | | [ M M t I I i I M I I I I | t i ! I ! I | [ ( | | ! 1 j I | ! M j 
a919 YTVVPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFOTPVHSVQAKQFFER 

70 80 90 100 110 120 
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130 140 150 160 170 180 

m919.pep YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIY'G I PDDFI SVPLPAGLRSGKA 

I I I M I I I I M i I I I I M I I t I ! I j I I I I t I I I j I I I | I I | j | j | | | | | i | | M I II I I I 
a 91 9 YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFI SVPLPAGLRSGKA 

130 140 150 160 170 130 

190 200 210 220 230 240 

m919.pep LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 

Mil! ! I ! M 1 I ! I 1 I I I 1 I : I I I I ! i I I i | | | | | | ! | M j ; M I M I I i i i M 

a919 LVRIROTGKNSGTIDNTGGTHTADLSQFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 

190 200 210 220 230 240 

250 260 270 280 290 300 

m9i9 . pep dgkapilgyaedpvelffmhiqgsgrlktpsgky::rigyadknehpyvsigrymadkgyl 
I 1 I M I II I I I I I I M I I I I I | | | I | | | ! M i I j| [ I I I I I I I i I [ j i i i i j i j i i i i i i 
a919 dgkapilgyaedpvelffmhiqgsgrlktpsgkyirigyadknehpyvsigrymadkgyl 

250 260 270 280 290 300 

310 320 330 340 350 360 

m919.pep KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYI FFRELAGSSNDGPVGALGTPLMGEYAGA 

I I I I M I I I I I : I i : I M I I I I i I I I I II I i M I I I I : | I I | | | | | ] M I I I I I I i I I I j 
a919 KLGQTSMQGIKAYMQQNPQRLAEVLGQNPSYIFFRELTGS3NDGPVGALGTPLMGEYAGA 

310 320 330 340 350 360 

370 380 390 400 410 420 

m919. pep VDRKYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
M I I I II I I M I I I I II I I I I i I I I I I I I I | | I | | II | ( | ! | | | | f | | | \\ | M I I I I 1 ! 
a919 VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 

370 380 390 400 410 420 

430 440 
m919.pep QKTTGYVWQLLPNGMKPEYRPX 
i I I I I M i I I I i I M I I I I I I I 
a 91 9 QKTTGYVWQLLPNGMKPEYRPX 
430 440 



121 and 121-1 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 3075>: 

ml 21 . seq 

1 ATGGAAACAC AGCTTTACAT CGGCATCATG TCGGGAACCA GCATGGACGG 

51 GGCGGATGCC GTACTGATAC GGATGGACGG CGGCAAATGG CTGGGCGCGG 

101 AAGGGCACGC CTTTACCCCC TACCCCGGCA GGTTACGCCG CCAATTGCTG 

151 GATTTGCAGG ACACAGGCGC AGACGAACTG CACCGCAGCA GGATTTTGTC 

201 GCAAGAACTC AGCCGCCTAT ATGCGCAAAC CGCCGCCGAA CTGCTGTGCA 

2 51 GTCAAAACCT CGCACCGTCC GACATTACCG CCCTCGGCTG CCACGGGCAA 

301 ACCGTCCGAC ACGCGCCGGA ACACGGTTAC AGCATACAGC TTGCCGATTT 

351 GCCGCTGCTG GCGxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 

401 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 

451 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 

501 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 

551 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 

601 xxxxxxCAGC TTCCTTACGA CAAAAACGGT GCAAAGTCGG CACAAGGCAA 

651 CATATTGCCG CAACTGCTCG ACAGGCTGCT CGCCCACCCG TATTTCGCAC 

701 AACGCCACCC TAAAAGCACG GGGCGCGAAC TGTTTGCCAT AAATTGGCTC 

751 GAAACCTACC TTGACGGCGG CGAAAACCGA TACGACGTAT TGCGGACGCT 

801 TTCCCGTTTT ACCGCGCAAA CCGTTTGCGA CGCCGTCTCA CACGCAGCGG 

851 CAGATGCCCG TCAAATGTAC ATTTGCGACG GCGGCATCCG CAATCCTGTT 

901 TTAATGGCGG ATTTGGCAGA ATGTTTCGGC ACACGCGTTT CCCTGCACAG 

951 CACCGCCGAC CTGAACCTCG ATCCGCAATG GGTGGAAGCC GCCGnATTTG 

1001 CGTGGTTGGC GGCGTGTTGG ATTAATCGCA TTCCCGGTAG TCCGCACAAA 
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1051 GCAACCGGCG CATCCAAACC GTGTATTGTG AnCGCGGGAT ATTATTATTG 
1101 A 

This corresponds to the amino acid sequence <SEQ ID 3076; ORF 12 1>: 

ml21 -pep 

1 METQLYIGIM SGTSMDGADA VLIRMDGGKW LGAEGHAFT? YPGRLRRQLL 

51 DLQD7GADEL HRSR1LSQEL SRLYAQTAAE LLCSQNLAPS DITALGCHGQ 

101 TVRHAPEHGY S1QLADLPLL Axxxxxxxxx xxxxxxxxxx xxxxxxxxxx 

151 xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 

201 xxQLPYDKNG AKSAQGNILP QLLDRLLAHP YFAQRHPKST GRELFAINWL 

251 ETYLDGGENR YDVLRTLSRF TAQTVCDAVS KAAADARQMY ICDGGIRNPV 

301 LMADLAECFG TRVSLKSTAD LNLDPQWVEA AXFAWLAACW INRIPGSPHK 

351 ATGASKPCIL XAGYYY* 

The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 3077>: 

gl21 . seq 

1 ATGGAAACAC AGCTTTACAT CGGCATTATG TCGGGAACCA GTATGGACGG 

51 GGCGGATGCC GTGCTGGTAC GGATGGACGG CGGCAAATGG CTGGGCGCGG 

101 AAGGGCACGC CTTTACCCCC TACCCTGACC GGTTGCGCCG CAAATTGCTG 

151 GATTTGCAGG ACACAGGCAC AGACGAACTG CACCGCAGCA GGATGTTGTC 

201 GCAAGAACTC AGCCGCCTGT ACGCGCAAAC CGCCGCCGAA CTGCTGTGCA 

251 GTCAAAACCT CGCTCCGTGC GACATTACCG CCCTCGGCTG CCACGGGCAA 

301 ACCGTCCGAC ACGCGCCGGA ACACGGTtac -J ' VTACAGC TTGCCGATTT 

351 GCCGCTGCTG GCGGAACTGa cgcggatttT TACCGTCggc gacttcCGCA 

401 GCCGCGACCT TGCTGCCGGC GGacaAGGTG CGCCGCTCGT CCCCGCCTTT 

4 51 CACGAAGCCC TGTTCCGCGA TGACAGGGAA ACACGCGTGG TACTGAACAT 

501 CGGCGGGATT GCCAACATCA GCGTACTCCC CCCCGGCGCA CCCGCCTTCG 

551 GCTTCGACAC AGGGCCGGGC AATATGCTGA TGGAcgcgtg gacgcaggca 

601 cacTGGcagc TGCCTTACGA CAAAAacggt gcAAAGgcgg cacAAGGCAA 

651 catatTGCcg cAACTGCTCG gcaggctGCT CGCCcaccCG TATTTCTCAC 

701 AACCCcaccc aaAAAGCACG GGgcGCGaac TgtttgcccT AAattggctc 

751 gaaacctAcc ttgacggcgg cgaaaaccga tacgacgtat tqcgcacgct 

801 ttcccgatcc accgcgcaaA ccgTttggqa cgccgtctca CACGCAGCGG 

8 51 CAGATGCCCG TCAAATGTAC ATTTGCGGCG GCGGCATCCG CAATCCTGTT 

901 TTAATGGCGG ATTTGGCAGA ATGTTTCGGC ACACGCGTTT CCCTGCACAG 

951 CACCGCCGAA CTGAACCTCG ATCCTCAATG GGTGGAGGCG gccgCATTtg 

1001 cgtggttggC GGCGTGTTGG ATTAACCGCA TTCCCGGTAG TCCGCACAAA 

1051 GCGACCGGCG CATCCAAACC GTGTATTCTG GGCGCGGGAT ATTATTATTG 

1101 A 

This corresponds to the amino acid sequence <SEQ ID 3078; ORF 121. ng>: 

gl21 .pep 

1 METQLYIGIM SGTSMDGADA VLVRMDGGKW LGAEGHAFT P YPDRLRRKLL 

51 DLQDTGTDEL HRSRMLSQEL SRLYAQTAAE LLCSQNLAPC DITALGCHGQ 

101 TVRHAPEHGY SIQLADLPLL AELTRI FTVG DFRSRDLAAG GQGAPLVPAF 

151 HEALFRDDRE TRWLNIGGI ANISVLPPGA PAFGFDTGPG NMLMDAWTQA 

201 HWQLPYDKNG AKAAQGNILP QLLGRLLAHP YFSQPHPKST GRELFALNWL 

251 ETYLDGGENR YDVLRTLSRF TAQTVWDAVS HAAADARQMY ICGGGIRNPV 

301 LMADLAECFG TRVSLHSTAE LNLDPQWVEA AAFAWLAACW INRIPGSPHK 

351 ATGASKPCIL GAGYYY* 

ORF 121 shows 73.5% identity over a 366 aa overlap with a predicted ORF (ORF121.ng) 
from N. gonorrhoeae; 

ml21/gl21 

10 20 30 40 50 60 

ml 21 .pep METQLYIGIMSGTSMDGADAVLIRMDGGKWLGAEGHAFTPYPGRLRRQLLDLQDTGADEL 
i I I I I I I I I i I I M I 1 ! I I ! I I : I I I I I i I I I I I I I I I I I I I I ! I t : I I i I I t I I : t M 
gl21 METQLYIGIMSGTSMDGADAVLVRMDGGKWLGAEGHAFTPYPDRLRRKLLDLQDTGTDEL 
10 20 30 40 50 60 

70 80 90 100 110 120 

ml 21 . pep HRSRILSQELSRLYAQTAAELLCSQNLAPSDITALGCHGQTVRHAPEHGYSIQLADLPLL 

I 1 I I : i i 1 i I I 1 I I I I ! I I I I I I I I I II I I i I I ' ' I I I i I l I I I I I I I I I I I I I I I I i I 
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gl?l KRSR1-1LSQELSRLYAQTAAELLCSQNLAPCDITALGCHG0TVRHAPEHGYSIQLADLPLL 
70 80 90 100 110 120 

130 140 150 160 170 180 

ml 2 1 . pep AXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 

f : : : 
gl2i AELTRIFTVGDFRSRDLAAGGQGAPLVPAFHEALFRDDRETRWLNIGGIANISVLPPGA 
130 140 150 160 170 180 

190 200 210 220 230 240 

ml 21 .pep XXXXXXXXXXXXXXXXXXXXXXQLPYDKNGAKSAQGNILPQLLDRLLAHPYFAQRHPKST 
: : I I I j I I I I I I :! I I I I I I I I I I I I I I I I I : i II I i I 

g!2i PAFGFDTGPGNMLMDAWTQAHWQLPYDKNGAKAAQGNILPQLLGRLLAHPYFSQPHPKST 
190 200 210 220 230 240 

250 260 270 280 290 300 

ml 21 . pep GRELFAINWLETYLDGGENRYDVLRTLSRFTAQT7CDAVSHAAADARQMYICDGGIRNPV 

I I I I ! I : I II I i I I I I I M I I I I I I I II I I I I I II I I I I I I I I I I I I I I I I I i I II II 
gl21 GRELFALNWLETYLDGGENRYDVLRTLSRFTAQTVWDAVSHAAADARQMYICGGGIRNPV 

250 260 270 280 290 300 

310 320 330 340 350 360 

ml 21 - pep LMADLAECFGTRVSLHSTADLNLDPQWVEAAXFAWLAACWINRIPGSPHKATGASKPCIL 

II I II M I I I I I I I I I I I I : I I M I I I I I I I I i I I M I I I I I I I i II I I I I I I ! I I I II 
gl21 LMADLAECFGTRVSLHSTAELNLDPQWVEAAAFAWLAACWINRIPGSPHKATGASKPCIL 

310 320 330 340 350 360 

ml 21. pep XAGYYYX 
MINI 

gl21 GAGYYYX 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 3079>: 



al21 . seq 



1 


ATGGAAACAC 


AGCTTTACAT 


51 


GGCGGATGCC 


GTACTGATAC 


101 


AAGGGCACGC 


CTTTACCCCC 


151 


GATTTGCAGG 


ACACAGGCGC 


201 


GCAAGAACTC 


AGCCGCCTGT 


251 


GTCAAAACCT 


CGCGCCGTCC 


301 


ACCGTCAGAC 


ACGCGCCGGA 


351 


GCCGCTGCTG 


GCGGAACGGA 


401 


GCCGCGACCT 


TGCGGCCGGC 


451 


CACGAAGCCC 


TGTTCCGCGA 


501 


CGGCGGGATT 


GCCAACATCA 


551 


GCTTCGACAC 


AGGACCGGGC 


601 


CACTGGCAGC 


TTCCTTACGA 


651 


CATATTGCCG 


CAACTGCTCG 


701 


AACCCCACCC 


TAAAAGCACG 


751 


GAAACCTACC 


TTGACGGCGG 


801 


TTCCCGATTC 


ACCGCGCAAA 


851 


CAGATGCCCG 


TCAAATGTAC 


901 


TTAATGGCGG 


ATTTGGCAGA 


951 


CACCGCCGAA 


CTGAACCTCG 


1001 


CATGGATGGC 


GGCGTGTTGG 


1051 


GCAACCGGCG 


CATCCAAACC 


1101 


A 





CGGCATCATG TCGGGAACCA GCATGGACGG 
GGATGGACGG CGGCAAATGG CTGGGCGCGG 
TACCCCGGCA GGTTACGCCG CAAATTGCTG 
GGACGAACTG CACCGCAGCA GGATGTTGTC 
ACGCGCAAAC CGCCGCCGAA CTGCTGTGCA 
GACATTACCG CCCTCGGCTG CCACGGGCAA 
ACACAGTTAC AGCGTACAGC TTGCCGATTT 
CTCAGATTTT TACCGTCGGC GACTTCCGCA 
GGACAAGGCG CGCCGCTCGT CCCCGCCTTT 
CGACAGGGAA ACACGCGCGG TACTGAACAT 
GCGTACTCCC CCCCGACGCA CCCGCCTTCG 
AATATGCTGA TGGACGCGTG GATGCAGGCA 
CAAAAACGGT GCAAAGGCGG CACAAGGCAA 
ACAGGCTGCT CGCCCACCCG TATTTCGCAC 
GGGCGCGAAC TGTTTGCCCT AAATTGGCTC 
CGAAAACCGA TACGACGTAT TGCGGACGCT 
CCGTTTTCGA CGCCGTCTCA CACGCAGCGG 
ATTTGCGGCG GCGGCATCCG CAATCCTGTT 
ATGTTTCGGC ACACGCGTTT CCCTGCACAG 
ATCCGCAATG GGTAGAAGCC GCCGCGTTCG 
GTCAACCGCA TTCCCGGTAG TCCGCACAAA 
GTGTATTCTG GGCGCGGGAT ATTATTATTG 



This corresponds to the amino acid sequence <SEQ ID 3080; ORF 121. a>: 

al21 -pep 

1 METQLYIGIM SGTSMDGADA VLIRMDGGKW LGAEGHAFTP YPGRLRRKLL 

51 DLQDTGADEL HRSRMLSQEL SRLYAQTAAE LLCSQNLAPS DITALGCHGQ 

101 TVRHAPEHSY SVQLADLPLL AERTQI FTVG DFRSRDLAAG GQGAPLVPAF 

151 HEALFRDDRE TRAVLNIGGI ANISVLPPDA PAFGFDTGPG NMLMDAWMQA 

201 HWQLPYDKNG AKAAQGNILP QLLDRLLAHP YFAQPKPKST GRELFALNWL 

251 ETYLDGGENR YDVLRTLSRF TAQTVFDAVS HAAADARQMY ICGGGIRNPV 
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301 LMADLAECFG TRV3LHSTAE LNLDPQWVEA AAFAWMAACW VNRIPGSPHK 
351 ATGASKPC1L GAGYYY* 

ml21/al21 ORFs 121 and 121. a showed a 74.0% identity in 366 aa overlap 



10 20 30 40 50 60 

ml 21 .pep M E T QL Y I G I M S GT S M DGA DA V L I RM DG G K W LG AE G HA FT P Y PG R L-RR Q L L DLQ DTG A DE L 

M I I I i i I I I I I i I I I i I ! 1 I I 1 ! 1 I I I I ! I I M | I I i I ( I I I M ! I : I M I M I I i I I ! 
a 121 METQLYIGIMSGTSMDGADAVLIRJ-iDGGKWLGAEGHAFTFYPGRLRRKLLDLQDTGADEL 
10 20 30 40 50 60 



70 80 90 100 110 120 

ml 21 .pep HRSRI LSQELSRLYAQTAAELLCSQNLAPSDITALGCHGQTVRKAPEHGYSIQLADLPLL 

I i I I : M I M I I 1 I I I I j t 1 I I I M I I I ! I I i I I M I I I ! I I 1 I I I I I : I i : I I I | ! I I ! 
al21 HRSRMLSQELSRLYAQTAAELLCSQNLAPSDITALGCHGQTVRHAPEHSYSVQLADLPLL 
70 80 90 100 110 120 

130 140 150 160 170 180 

ml 2 1 . pep AXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 

I : : : 
al21 AERTQIFTVGDFRSRDLAAGGQGAPLVPAFHEALFRDDRETRAVLNIGGIANISVLPPDA 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml 21 .pep XXXXXXXXXXXXXXXXXXXXXXQL PYDKNGAK S AQGN 1 L PQLLDRLLA.H PY FAQRHPKST 

• I ! I I I I M I I : i I 1 I [ I i I I I I I I ! I M i | | | | | | | | 

a!21 PAFGFDTGPGNMLMDAWMQAHWQLPYDKNGAKAAQGNILPQLLDRLLAHPYFAQPHPKST 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml 21 .pep GRELFAINWLETYLDGGENRYDVLRTLSRFTAQTVCDAVSHAAADARQMYICDGGIRNPV 
I I I I I I : I I M I I I I i 1 I I I I I I If I I If ! I I N I II I ! I I I !! I I I I i I I I II I | | I 
al21 GRELFALNWLETYLDGGENRYDVLRTLSRFTAQTVFDAVSHAAADARQMYICGGGIRNPV 

250 260 270 280 290 300 



310 320 330 340 350 360 

ml21 .pep LMADLAECFGTRVSLHSTADLNLDPQWVEAAXFAWLAACWINRIPGSPHKATGASKPCIL 
I I I ! M I I I II I i I M I I I : t i I I I I I I ! i I I I I : ! I I I : I I I j I I I | I t I I I M I I I I 
al21 LMADLAECFGTRVSLHS'f AE LNLDPQWVEAAAFAWMAACWVNRI PGSPHKATGASKPCIL 

310 320 330 340 350 360 

ml21.pep XAGYYYX 
MINI 

a!21 GAGYYYX 



Further work revealed the DNA sequence identified in N. meningitidis <SEQ ID 308 1>: 

ml21-l . seq 

1 ATGGAAACAC AGCTTTACAT CGGCATCATG TCGGGAACCA GCATGGACGG 

51 GGCGGATGCC GTACTGATAC GGATGGACGG CGGCAAATGG CTGGGCGCGG 

101 AAGGGCACGC CTTTACCCCC TACCCCGGCA GGTTACGCCG CCAATTGCTG 

151 GATTTGCAGG ACACAGGCGC AGACGAACTG CACCGCAGCA GGATTTTGTC 

201 GCAAGAACTC AGCCGCCTAT ATGCGCAAAC CGCCGCCGAA CTGCTGTGCA 

251 GTCAAAACCT CGCACCGTCC GACATTACCG CCCTCGGCTG CCACGGGCAA 

301 ACCGTCCGAC ACGCGCCGGA ACACGGTTAC AGCATACAGC TTGCCGATTT 

351 GCCGCTGCTG GCGGAACGGA CGCGGATTTT TACCGTCGGC GACTTCCGCA 

4 01 GCCGCGACCT TGCGGCCGGC GGACAAGGCG CGCCACTCGT CCCCGCCTTT 

4 51 CACGAAGCCC TGTTCCGCGA CAACAGGGAA ACACGCGCGG TACTGAACAT 

501 CGGCGGGATT GCCAACATCA GCGTACTCCC CCCCGACGCA CCCGCCTTCG 

551 GCTTCGACAC AGGGCCGGGC AATATGCTGA TGGACGCGTG GACGCAGGCA 

601 CACTGGCAGC TTCCTTACGA CAAAAACGGT GCAAAGGCGG CACAAGGCAA 

651 CATATTGCCG CAACTGCTCG ACAGGCTGCT CGCCCACCCG TATTTCGCAC 

701 AACCCCACCC TAAAAGCACG GGGCGCGAAC TGTTTGCCCT AAATTGGCTC 

7 51 GAAACCTACC TTGACGGCGG CGAAAACCGA TACGACGTAT TGCGGACGCT 

801 TTCCCGTTTT ACCGCGCAAA CCGTTTGCGA CGCCGTCTCA CACGCAGCGG 
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851 CAGATGCCCG TCAAATGTAC 

901 TTAATGGCGG ATTTGGCAGA 

951 CACCGCCGAC CTGAACCTCG 

1001 CGTGGTTGGC GGCGTGTTGG 

1051 GCAACCGGCG CATCCAAACC 

1101 A 



ATTTGCGGCG GCGGCATCCG CAATCCTGTT 
ATGTTTCGGC ACACGCGTTT CCCTGCACAG 
ATCCGCAATG GGTGGAAGCC GCCGNATTTG 
ATTAATCGCA TTCCCGGTAG TCCGCACAAA 
GTGTATTCTG ANCGCGGGAT ATTATTATTG 



This corresponds to the amino acid sequence <SEQ ID 3082; ORJF 121-1 >: 

ml21-i . pep 

1 METQLYIGIM SGTSMDGADA VLIRMDGGKW LGAEGHAFTP YPGRLRRQLL 

51 DLQDTGADEL HRSRILSOEL SRLYAQTAAE LLCSQNLAPS DITALGCHGQ 

101 TVRHAPEHGY S IQLADLPLL AERTRI FTVG DFRSRDLAAG GQGAPLVPAF 

151 HEALFRDNRE TRAVLNIGGI ANISVLPPDA PAFGFDTGPG NMLMDAWTQA 

201 HWQLPYDKNG AKAAQGNILP QLLDRLLAHP YFAQPKPKST GRELFALNWL 

251 ETYLDGGENR YDVLRTLSRF TAQTVCDAVS HAAADARQMY ICGGGIRNPV 

301 LMADLAECFG TRVSLHSTAD LNLDPQWVEA AXFAWLAACW INRIPGSPHK 

351 ATGASKPCIL XAGYYY* 

ml21-l/gl21 ORFs 121-1 and 121. ng showed a 95.6% identity in 366 aa overlap 



10 20 30 40 50 60 

ml21-l . pep METQLYIGIMSGTSMDGADAVLIRMDGGKWLGAEGHAFTPYPGRLRRQLLDLQDTGADEL 
I M I I I ! I II I I ! i I M I I I I I : M I I M I M i I M ! I [ I M ! I I I : I I I I ! M i : I 1 I 
gl21 METQLYIGIMSGTSMDGADAVLVRMDGGKWLGAEGHAFTPYPDRLRRKLLDLQDTGTDEL 

10 20 30 40 50 60 



70 80 90 100 110 120 

ml21-l . pep HRSRILSQELSRLYAQTAAELLCSQN LAPS DITALGCHGQTVRHAPEHGY5 IQLADLPLL 
I I I I : I I i I I II I i I I i I I I I i I II I i I I I I I I I 1 I 1 I M I I i I I 11 M I ! I I I I I 1 I I 
gl21 HRSRMLSQELSRLYAQTAAELLCSQN LAPCDITALGCHGQTVRHAPEHGYS IQLADLPLL 

70 80 90 100 110 120 



130 140 150 160 170 180 

ml21-l .pep AERTRI FTVGDFRSRDLAAGGQGAPLV PAFHEALFRDNRETRAVLN I GG I ANISVLPPDA 
I i I I I I I I I If I I I I I I I I I i I I I 11 1 I I I t M I I I : I I i I : i I I I I I I I I M M ! I i 
gl21 AELTRIFTVGDFRSRDLAAGGQGAPLVPAFHEALFRDDRETRWLNIGGIANISVLPPGA 

130 140 150 160 170 180 



190 200 210 220 230 240 

ml2I-l .pep PAFGFDTGPGNMLMDAWTQAHWQLPYDKNGAKAAQGNILPQLLDRLLAHPYFAQPHPKST 
I I I I I I t I I I I I I M I I I I I I I II I M I I I I II I M I i ! M II II I I I I 1 I : I I i I I t I 
g!21 PAFGFDTGPGNMLMDAWTQAHWQLPYDKNGAKAAQGNILPQLLGRLLAHPYFSQPHPKST 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml21-l .pep GRELFALNWLETYLDGGENRYDVLRTLSRFTAQTVCDAVSHAAADARQMY ICGGGIRNPV 
I I i ! I I I I ! I I I I I I I I ! II I i I I I M I I M i I I 1 I M I I I I I I I I I I I I I | | | | | 1 j ! 
gl21 GRELFALNWLETYLDGGENRYDVLRTLSRFTAQTVWDAVSHAAADARQMYICGGGIRNPV 

250 260 270 280 290 300 

310 320 330 340 350 360 

ml21-l .pep LMADLAECFGTRVSLHSTADLNLDPQWVEAAXFAWLAACWINRIPGSPHKATGASKPCIL 
I ! It 1 I I I I I I ! 1 I I I I I I : I I I ! I II I I II I I ! I I I I I i I I i I M M I I I I I I I I I I I 
gl21 LMADLAECFGTRVSLHSTAELNLDPQWVEAAAFAWLAACWINRIPGSPHKATGASKPCIL 

310 320 330 340 350 360 



ml21-l .pep XAGYYYX 
MINI 

gl21 GAGYYYX 

The following DNA sequence was identified in N. meningitidis <SEQ ID 3083>: 

al21-l . seq 

1 ATGGAAACAC AGCTTTACAT CGGCATCATG TCGGGAACCA GCATGGACGG 
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51 GGCGGATGCC GTACTGATAC GGATGGACGG CGGCAAATGG CTGGGCGCGG 

101 AAGGGCACGC CTTTACCCCC TACCCCGGCA GGTTACGCCG CAAATTGCTG 

151 GATTTGCAGG ACACAGGCGC GGACGAACTG CACCGCAGCA GGATGTTGTC 

201 GCAAGAACTC AGCCGCCTGT ACGCGCAAAC CGCCGCCGAA CTGCTGTGCA 

251 GTCAAAACCT CGCGCCGTCC GACATTACCG CCCTCGGCTG CCACGGGCAA 

301 ACCGTCAGAC ACGCGCCGGA ACACAGTTAC AGCGTACAGC TTGCCGATTT 

351 GCCGCTGCTG GCGGAACGGA CTCAGATTTT TACCGTCGGC GACTTCCGCA 

4 01 GCCGCGACCT TGCGGCCGGC GGACAAGGCG CGCCGCTCGT CCCCGCCTTT 

4 51 CACGAAGCCC TGTTCCGCGA CGACAGGGAA ACACGCGCGG TACTGAACAT 

501 CGGCGGGATT GCCAACATCA GCGTACTCCC CCCCGACGCA CCCGCCTTCG 

551 GCTTCGACAC AGGACCGGGC AATATGCTGA TGGACGCGTG GATGCAGGCA 

601 CACTGGCAGC TTCCTTACGA CAAAAACGGT GCAAAGGCGG CACAAGGCAA 

651 CATATTGCCG CAACTGCTCG ACAGGCTGCT CGCCCACCCG TATTTCGCAC 

701 AACCCCACCC TAAAAGCACG GGGCGCGAAC TGTTTGCCCT AAATTGGCTC 

7 51 GAAACCTACC TTGACGGCGG CGAAAACCGA TACGACGTAT TGCGGACGCT 

801 TTCCCGATTC ACCGCGCAAA CCGTTTTCGA CGCCGTCTCA CACGCAGCGG 

851 CAGATGCCCG TCAAATGTAC ATTTGCGGCG GCGGCATCCG CAATCCTGTT 

901 TTAATGGCGG ATTTGGCAGA ATGTTTCGGC ACACGCGTTT CCCTGCACAG 

951 CACCGCCGAA CTGAACCTCG ATCCGCAATG GGTAGAAGCC GCCGCGTTCG 

1001 CATGGATGGC GGCGTGTTGG GTCAACCGCA TTCCCGGTAG TCCGCACAAA 

1051 GCAACCGGCG CATCCAAACC GTGTA^TCTG GGCGCGGGAT ATTATTATTG 

1101 A 

s corresponds to the ammo acid sequence <SEQ ID J0S4; ORF 121-l.a>: 

al21-l .pep 

1 METQLYIGIM SGTSMDGADA VLIRMDGGKW LGAEGHAFTP YPGRLRRKLL 

51 DLQDTGADEL HRSRMLSQEL SRLYAQTAAE LLCSQNLAPS DITALGCHGQ 

101 TVRHAPEHSY SVQLADLPLL AERTQI FTVG DFRSRDLAAG GQGAPLVPAF 

151 HEALFRDDRE TRAVLNIGGI ANISVLPPDA PAFGFDTGPG NMLMDAWMQA 

201 HWQLPYDKNG AKAAQGNILP QLLDRLLAHP YFAQPHPKST GRELFALNWL 

251 ETYLDGGENR YDVLRTLSRF TAQTVFDAVS HAAADARQMY ICGGGIRNPV 

301 LMADLAECFG TRVSLHSTAE LNLDPQWVEA AAFAWMAACW VNRIPGSPHK 

351 ATGASKPCIL GAGYYY+ 

ml21-l/al21-l ORFs 121-1 and 121-1. a showed a 96.4% identity in 366 aa overlap 

10 20 30 40 50 60 

ml21-l .pep METQLYIGIMSGTSMDGADAVLIRMDGGKWLGAEGHAFTPYPGRLRRQLLDLQDTGADEL 
M M i M I ! I ! I 1 ! I II I M I M I I I I I I j I I I I I i I I I I M I I I I I : 1 I j I ! i I I I [ M 
al21-l METQLYIGIMSGTSMDGADAVLIRMDGGKWLGAEGKAFTPYPGRLRRKLLDLQDTGADEL 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml21-l .pep HRSRILSQELSRLYAQTAAELLCSQNLAPSDITALGCHGQTVRHAPEHGYSIQLADLPLL 
I M I : I 1 I i I I 1 I ! I I I I [ I M ! I ! I I I I I i I ! I I I j I I I I I I I I I I I : I I : I I | I | | | | 
al21-l HRSRMLSQELSRLYAQTAAELLCSQNLAPSDITALGCHGQTVRHAPEHSYSVQLADLPLL 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml21-l .pep AERTRI FT VGDFRSRDLAAGGQGAPLVPAFHEALFRDNRETRAVLN I GG I ANISVLPPDA 
I I I I : I I I I I ! I I I I I I ! I I I I I I I I M II It I II I i : I I I I I I I M I I I I I M I I I | I | 
al21-l AERTQI FTVG DFRSRDLAAGGQGAPLVPAFHEALFRDDRETRAVLN I GG I ANISVLPPDA 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml21-l .pep PAFGFDTGPGNMLMDAWTQAHWQLPYDKNGAKAAQGN I LPQLLDRLLAHP YFAQPHPKST 
I I I I I i I t I I I 1 I I I I I 1 I I I I I I I ! | I I I | i | | | 1 | i I | | M | | M | | | | | ] | | | i | j 
a!21-l PAFGFDTGPGNMLMDAWMQAHWQLPYDKNGAKAAQGNILPQLLDRLLAHPYFAQPHPKST 
190 200 210 220 230 240 

250 260 270 280 290 300 

ml21-l .pep GRELFALNWLETYLDGGENRYDVLRTLSRFTAQTVCDAVSHAAADARQMYICGGGIRNPV 
I 1 I i ! M II I I ! I ! II M I II I I I I I | | | | I M I | I I | | i | | [ || | | | | i | | || | | | | i 
a 12 1-1 GRE LFALNWLETYLDGGENRYDVLRTLSR.FTAQTVFDAVS HAAADARQMY I CGGG I RNPV 

250 260 270 280 290 300 
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310 320 330 340 350 360 

ml21-l . pep ^lADLAECFGTRVSLHSTADLNLDPQWVEAAXFAWLAACWINRIPGSPHKATGASKPCI L 
I M I i I I 11 I I I I I I I I I I : I I ! I I I I I I ! I I M : f I I I : I I I I ! [ I I If M I I 1 f i I I 
a 121 L^DLAECFGTRVSLHSTAELNLDPQWVEAAArAWMAACWVNRIPGSPKKATGASKPCIL 
310 320 330 340 350 360 



ml 2 1-1. pep XAGYYYX 

i I 1 1 1 i 

a 121 GAGYYYX 



128 and 128-1 



The following partial DNA sequence was 


ml 0 ft sen 


(partial ) 




1 


ATGACTGACA 


ACGCACTGCT 


5 1 


AATCAAAACC 


GAAGACATCA 


101 


CGCGCGAACA 


AATCGCCGCC 


151 


AACACTGTCG 


AACCCCTGAC 


201 


GGGCGTGGTG 


TCGCACCTCA 


251 


CCGTCTATAA 


CGAACTGATG 


301 


GGACAAGACA 


TCGAGCTGTA 


351 


CGAATTCGAC 


ACCCTCTCCC 


1 


TACGCCAGCG 


AAAAACTGCG 


51 


wGTCAAAAAA TAyTTCCCyG 


101 


AAmTCAAAAA 


ACTmTACGGC 


151 


TGGCACAAAG 


ACGTGCGCTA 


201 


AGGCGGCGTT 


TATATGGATT 


251 


CGTGGATGAA 


CGACTACAAA 


301 


CAAyTGCCCA 


CCGCCTACCT 


351 


CAGGGAAGCC 


CGCyTGAGCC 


401 


CCGGACACGG 


GCTGCACCAC 


451 


T CCGG CATC A 


ACGGCGTAkA 


501 


TATGGAAAAT 


TTCGTTTGGG 


551 


ACGAAGAAAC 


CGGcgTTCCC 


601 


GCCGCCAAAA 


ACTTCCAAsG 


651 


CGCCCTCTTT 


GATATGATGA 


701 


AAAACTGGCA 


ACAGGTTTTA 


751 


CAGCCGCCCG 


AATACAACCG 


801 


AGGCGGCTAT 


TCCGCAGCTn 


851 


GCGCGGACGC 


ATACGCCGCC 


901 


GGCAAACGCT 


TTTGGCAGGA 


951 


nGCAGAATCC 


TTCAAAGCCT 


1001 


TCTTGCGCCA 


CAGCGGTTTC 



identified in /V. meningitidis <SEQ ID 3085>: 

CCATTTGGGC GAAGAACCCC GTTTTGATCA 
AACCCGCCCT GCAAACCGCC ATCGCCGAAG 
ATCAAAGCCC AAACGCACAC CGGCTGGGCA 
CGGCATCACC GAACGCGTCG GCAGGATTTG 
ACTGCGTCGC CGACACGCCC GAACTGCGCG 
CCCGAAATCA CCGTCTTCTT CACCGAAATC 
CAACCGCTTC AAAACCATCA AAAATTCCCC 
CCGCACAAAA AACCAAACTC AACCAC 
CGAAGCCAAA TACGCGTTCA GCGAAACCGA 
TCGGCAAwGT ATTAAACGGA CTGTTCGCCC 
ATCGGATTTA CCGAAAAAAC yGTCCCCGTC 
TTkTGAATTG CAACAAAACG GCGAAmCCAT 
TGTACGCACG CGAAGGCAAA CGCGGCGGCG 
GGCCGCCGCC GTTTTTCAGA CGGCACGCTG 
CGTCTGCAAC TTCGCCCCAC CCGTCGGCGG 
ACGACGAAAT CCTCATCCTC TTCCACGAAA 
CTGCTTACCC AAGTGGACGA ACTGGGCGTA 
ATGGGACGCG GTCGAACTGC CCAGCCAGTT 
AATACAATGT CTTGGCACAA mTGTCAGCCC 
yTGCCGAAAG AACTCTTsGA CAAAwTGCTC 
CGGCATGTTC yTsGTCCGGC AAwTGGAGTT 
TTTACAGCGA AGACGACGAA GGCCGTCTGA 
GACAGCGTGC GCAAAAAAGT CGCCGTCATC 
CTTCGCCTTG AGCTTCGGCC ACATCTTCGC 
ATTACAGCTA CGCGTGGGCG GAAGTATTGA 
TTTGAAGAAA GCGACGATGT CGCCGCCACA 
AATCCTCGCC GTCGGGGnAT CGCGCAGCGG 
TCCGCGGCCG CGAACCGAGC ATAGACGCAC 
GACAACGCGG TCTGA 



This corresponds to the amino acid sequence <SEQ ID 3086; ORF 128>: 

ml28.pep (partial) 

1 MTDNALLHLG EEPRFDQIKT EDIKPALQTA IAEAREQIAA I KAQTHTGWA 

51 NTVEPLTGIT ERVGRIWGW SHLNCVADTP ELRAVYNELM PEITVFFTEI 

101 GQDIELYNRF KTIKNSPEFD TLSPAQKTKL NH 



1 YASEKLREAK YAFSETXVKK YFPVGXVLNG LFAQXKKLYG IGFTEKTVPV 

51 WHKDVRYXEL QQNGEXIGGV YMDLYAREGK RGGAVfMNDYK GRRRFSDGTL 

101 QLPTAYLVCN FAPPVGGREA RLSHDEILIL FHETGHGLHH LLTQVDELGV 

151 SGINGVXWDA VELPSQFMEN FWIEYNVLAQ XSAHEETGVP LPKELXDKXL 

201 AAKNFQXGMF XVRQXEFALF DMMIYSEDDE GRLKNWQQVL DSVRKKVAVI 

251 QPPEYNRFAL SFGHIFAGGY SAAXYSYAWA EVLSADAYAA FEESDDVAAT 

301 GKRFWQEILA VGXSRSGAES FKAFRGREPS IDALLRHSGF DNAV* 
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The following partial DNA sequence was identified in /V. gonorrhoeae <SEQ ID 3087>: 

gl28 . seq 

1 atgattgaca acgCActgct ccactcgggc gaagaaccCC GTTTTaat ca 

51 aatccaaacc gaagACAtca AACCCGCCGT CCAAACCGCC ATCGCCGAAG 

101 CGCGCGGACA AATCGCCGCC GTCAAAGCGC AAACGCACAC CGGCTGGGCG 

151 AACACCGTCG AGCGTCTGAC CGGCATCACC GAACGCGTCG GCAGGATTTG 

201 GGGCGTCGTG TCCCATCTCA ACTCCGTCGT CGACACGCCC GAACTGCGCG 

251 CCGTCTATAA CGAACTGATG CCTGAAATCA CCGTCTTCTT CACCGAAATC 

301 GGACAAGACA TCGAACTGTA CAACCGCTTC AAAACCATCA AAAATTCCCC 

351 CGAATTTGCA ACGCTTTCCC CCGCACAAAA AACCAAGCTC GATCACGACC 

401 TGCGCGATTT CGTATTGAGC GGCGCGGAAC TGCCGCCCGA ACGGCAGGCA 

451 GAACTGGCAA AACTGCAAAC CGAAGGCGCG CAACTTTCCG CCAAATTCTC 

501 CCAAAACGTC CTAGACGCGA CCGACGCGTT CGGCATTTAC TTTGACGATG 

551 CCGCACCGCT TGCCGGCATT CCCGAAGACG CGCTCGCCAT GTTTGCCGCC 

601 GCCGCGCAAA GCGAAGGCAA AACAGGTTAC AAAATCGGCT TGCAGATTCC 

651 GCACTACCTT GCCGTTATCC AATACGCCGG CAACCGCGAA CTGCGCGAAC 

701 AAATCTACCG CGCCTACGTT ACCCGTGCCA GCGAACTTTC AAACGACGGC 

751 AAATTCGACA ACACCGCCAA CATCGACCGC ACGCTCGAAA ACGCATTGAA 

801 AACCGccaaa CTGCTCGGCT TTAAAAATTA CGCCGAATTG TCGCTGGCAA 

851 CCAAAATGGC GGACACGCCC GAACAGGTTT TAAACTTCCT GCACGACCTC 

901 GCCCGCCGCG CCAAACCCTA CGCCGAAAAA GACCTCGCCG AAGTCAAAGC 

951 CTTCGCCCGC GAACACCTCG GTCTCGCCGA CCCGCAGCCG TGGG ACT TG A 

1001 GCTACGCCGG CGAAAAACTG CGCGAAGCCA AATACGCATT CAGCGAAACC 

1051 GAAGTCAAAA AATACTTCCC CGTCGGCAAA GTTCTGGCAG GCCTGTTCGC 

1101 CCAAATCAAA AAACT CTACG GCATCGGATT CGCCGAAAAA ACCGTTCCCG 

1151 TCTGGCACAA AGACGTGCGC TATTTTGAAT TGCAACAAAA CGGCAAAACC 

1201 ATCGGCGGCG TTTATATGGA TTTGTACGCA CGCGAAGGCA AACGCGGCGG 

1251 CGCGTGGATG AACGACtaca AAGGCCGCCG CCGCTTTGCC GACGgcacGC 

1301 TGCAACTGCC CACCGCCTAC CTCGTCTGCA ACTTCGCCCC GCCCGTCGGC 

1351 GGCAAAGAAG CGCGTTTAAG CCACGACGAA ATCCTCACCC TCTTCCACGA 

14 01 AacCGGCCAC GGACTGCACC ACCTGCTTAC CCAAGTGGAC GAACTGGGCG 

14 51 TGTCCGGCAT CAAcggcgtA GAATGGGACG CGGTCGAACT GCCCAGCCAG 

1501 TTTATGGAAA ACTTCGTTTG GGAATACAAT GTATTGGCAC AAATGTCCGC 

1551 CCACGAAGAA AccgGCGAGC CCCTGCCGAA AGAACTCTTC GACAAAATGC 

1601 TcgcCGCCAA AAACTTCCAG CGCGGTATGT TCCTCGTCCG GCAAATGGAG 

1651 TTCGCCCTCT TCGATATGAT GATTTACAGT GAAAGCGACG AATGCCGTCT 

1701 GAAAAACTGG CAGCAGGTTT TAGACAGCGT GCGCAAAGAA GTcGCCGTCA 

1751 TCCAACCGCC CGAATACAAC CGCTTCGCCA ACAGCTTCGG CCacatctTC 

1801 GCcggcGGCT ATTCCGCAGG CTATTACAGC TACGCATGGG CCGAAGTCCt 

1851 CAGCACCGAT GCCTACGCCG CCTTTGAAGA AAGcGACGac gtcGCCGCCA 

1901 CAGGCAAACG CTTCTGGCAA GAAAtccttg ccgtcggcgg ctCCCGCAGC 

1951 gcgGCGGAAT CCTTCAAAGC CTTCCGCGGA CGCGAACCGA GCATAGACGC 

2 001 ACTGCTGCGC CAaagcggtT TCGACAACGC gGCttgA 

This corresponds to the amino acid sequence <SEQ ID 3088; ORF 128.ng>: 

gl28 . pep 

1 MIDNALLHLG EEPRFNQIQT EDIKPAVQTA IAEARGQIAA VKAQTHTGWA 

51 NTVERLTGIT ERVGRIWGW SHLNSWDTP ELRAVYNELM PEITVFFTEI 

101 GQDIELYNRF KTIKNSPEFA TLSPAQKTKL DHDLRDFVLS GAELPPERQA 

151 ELAKLQTEGA QLSAKFSQNV LDATDAFGIY FDDAAPLAGI PEDA1AMFAA . 

201 AAQSEGKTGY KIGLQIPHYL AVIQYAGNRE LREQI YRAYV TRASELSNDG 

251 KFDNTANIDR TLENALKTAK LLGFKNYAEL SLATKMADTP EQVLNFLHDL 

301 ARRAKPYAEK DLAEVKAFAR EHLGLADPQP WDLSYAGEKL REAKYAFSET 

3 51 EVKKYFPVGK VLAGLFAQIK KLYGIGFAEK TVPVWHKDVR YFELQQNGKT 

4 01 I GGVYMDL YA REGKRGGAWM NDYKGRRRFA DGTLQLPTAY LVCNFAPPVG 
451 GKEAHLSHDE ILTLFHETGH GLHHLLTQVD ELGVSGINGV EWDAVELPSQ 

5 01 FMENFVWEYN VLAQMSAHEE TGEPLPKELF DKMLAAKNFQ RGMFLVRQME 
551 FALFDMMIYS ESDECRLKNW QQVLDSVRKE VAVIQPPEYN RFANSFGHIF 
601 AGGYSAGYYS YAWAEVLSTD AYAAFEESDD VAATGKRFWQ SILAVGGSRS 



WO 99/57280 



95 



PCT/US99/09346 



651 AAESFXAFRG REPS IDALLR QSGFDNAA* 



ORF 128 shows 91.7% identity over a 475 aa overlap with a predicted ORF (ORF 128.ng) 
from N. gonorrhoeae: 

mI28/gl28 

10 20 30 40 50 60 

g!28 . pep MIDNALLHLGEEPRFNQIQTEDIKPAVOTAIAEARGOIAAVKAOTHTGWANTVERLTGIT 

I IIMMIM!llhl|:liil(ll:|lllllll li!l:(lM!!lllll!l Mill 

ml28 MTDNALLHLGEEPRFDQI KTEDIKPALQTAIAEAREQIAAIKAQTKTGWANTVEPLTGIT 

10 20 30 40 50 60 

70 80 90 100 110 120 

g!28 .pep ERVGRIWGWSHLNSWDTPELRAVYNELMPSITVFFTEIGODIELYNRFKTIKNSPEFA 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I ! 1 1 ! 1 1 1 1 1 1 1 1 1 i 1 1 i 1 1 1 1 i I i 1 1 1 i ! i 1 1 1 1 1 II ! II I 

m i2 8 ERVGRIWGVVSHLNCVADTPELRAVYNELMPEITVFFTEIGQDIELYNRFKTIKNSPEFD 
70 80 90 100 110 120 

130 140 150 160 170 180 

gl28 . pep TLSPAQKTKLDHDLRDFVLSGAELPPERQAEIiAKLOTEGAOLSAKFSONVLDATDAFGIY 

mimnm 

ml 2 8 TLS PAQKTKLNH 

130 

// 

340 350 360 

a 12 8 pep YAGEKLREAKYAFSETEVKKYFPVGKVLAG 

Ihltlllilllllli llllllll II I 
ml 2 8 YASEKLREAKYAFSETXVKKYFPVGXVLNG 

10 20 30 

370 380 390 400 410 420 

gl28 . pep LFAQIKKLYGIGFAEKTVPVWHKDVRYFELOQNGKTIGGVYMDLYAREGKRGGAWMNDYK 

mi imiitmiiimiimi iimmimmiimimiiiimi 

m i28 LFAQXKKLYGIGFTEKTVPWHKDVRYXELQQNGEXIGGVY>nDLYAREGKRGGAWMNDYK 
40 50 60 70 80 90 

430 440 450 460 470 480 

gl28 . pep GRRRFADGTLQLPTAYLVCNFAPPVGGKEARLSHDEILTLFHETGHGLHHLLTQVDELGV 

imimiimimmmmmmmiii mimmiiiimiiii 

m i 2 8 GRRRFSDGTLQLPTAYLVCNFAPPVGGREARLSHDE I LI LFHETGHGLHHLLTQVDELGV 

100 110 120 130 140 150 

490 500 510 520 530 540 

g 1 2 8 . pep SG I NGVEWDAVELPSQFMENFVWE YNVLAQMS AHEETGE PLP KEL FDKMLAAKNFQRGMF 

mm mmimmiiiiiiiiii mim mm n niim in 

ml 2 8 SGINGVXWDAVELPSQFMENFVWEYNVLAOXSAHEETGVPLPKELXDKXLAAKNFQXGMF 
160 170 180 190 200 210 

550 560 570 580 590 600 

g 1 2 8 . pep LVRQMEFALFDMMI YSESDECRLKNWQQVLDS VRKEVAV I QP PE YNRFANS FGHI FAGGY 

iii immiiiimi iiiiiiiiiiiiihiiiiiiiiiiiii 1 1 1 1 1 1 1 1 1 1 

ml 2 8 XVRQXEFALFDMMI YS EDDEGRLKNWQQVLDSVRKKVAV I QPPEYKRFALS FGHI FAGGY 

220 230 240 250 260 270 

610 620 630 640 650 660 

gl2 8 . peo SAGYYSYAWAEVLSTDAYAAFEESDDVAATGKRFWQEILAVGGSRSAAESFKAFRGREPS 

i 1 1 1 1 1 1 1 1 1 m 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 m 1 1 1 nmiimmim 

ml2 8 SAAXYSYAV-JAEVLSADAYAAFEESDDVAATGKRFWQEILAVGXSRSGAESFKAFRGREPS 
280 290 300 310 320 330 
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670 679 
g 1 2 8 . pep I DALLRQSGFDNAAX 

MlllhllMII: 

ml2 8 IDALLRHSGFDNAVX 
340 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 3089>: 



al28 . seq 








i 


ATGACTGACA 


ACGCACTGCT 


CCATTTGGGC 


51 


AATCAAAACC 


GAAGACATCA 


AACCCGCCCT 


101 


CGCGCGAACA 


AATCGCCGCC 


ATCAAAGCCC 


151 


AACACTGTCG 


AACCCCTGAC 


CGGCATCACC 


201 


GGGCGTGGTG 


TCGCACCTCA 


ACTCCGTCAC 


251 


CCGCCTACAA 


TGAATTAATG 


CCCGAAATTA 


301 


GGACAAGACA 


TCGAGCTGTA 


CAACCGCTTC 


351 


CGAGTTCGAC 


ACCCTCTCCC 


ACGCGCAAAA 


401 


TGCGCGATTT 


CGTCCTCAGC 


GGCGCGGAAC 


451 


GAATTGGCAA 


AACTGCAAAC 


CGAAGGCGCG 


501 


CCAAAACGTC 


CTAGACGCGA 


CCGACGCGTT 


551 


CCGCACCGCT 


TGCCGGCATT 


CCCGAAGACG 


601 


GCCGCGCAAA 


GCGAAGGCAA 


AACAGGCTAC 


651 


GCACTACCTC 


GCCGTCATCC 


AATACGCCGA 


701 


AAATCTACCG 


CGCCTACGTT 


ACCCGCGCCA 


751 


AAATTCGACA 


ACACCGCCAA 


CATCGACCGG 


801 


AACCGCCAAA 


CTGCTCGGCT 


TCAAAAACTA 


851 


CCAAAATGGC 


GGACACCCCC 


GAACAAGTTT 


901 


GCCCGCCGCG 


CCAAACCCTA 


CGCCGAAAAA 


951 


CTTCGCCCGC 


GAAAGCCTCG 


GCCTCGCCGA 


1001 


GCTACGCCGG 


CGAAAAACTG 


CGCGAAGCCA 


1051 


GAAGTCAAAA 


AATACTTCCC 


CGTCGGCAAA 


1101 


CCAAATCAAA 


AAACTCTACG 


GCATCGGATT 


1151 


TCTGGCACAA 


AGACGTGCGC 


TATTTTGAAT 


1201 


ATAGGCGGCG 


TTTATATGGA 


TTTGTACGCA 


1251 


CGCGTGGATG 


AACGACTACA 


AAGGCCGCCG 


1301 


TGCAACTGCC 


CACCGCCTAC 


CTCGTCTGCA 


1351 


GGCAAAGAAG 


CCCGCTTGAG 


CCATGACGAA 


1401 


AACCGGACAC 


GGCCTGCACC 


ACCTGCTTAC 


1451 


TATCCGGCAT 


CAACGGCGTA 


GAATGGGACG 


1501 


TTTATGGAAA 


ATTTCGTTTG 


GGAATACAAT 


1551 


CCACGAAGAA 


ACCGGCGTTC 


CCCTGCCGAA 


1601 


TCGCCGCCAA 


AAACTTCCAA 


CGCGGAATGT 


1651 


TTCGCCCTCT 


TTGATATGAT 


GATTTACAGC 


1701 


GAAAAACTGG 


CAACAGGTTT 


TAGACAGCGT 


1751 


TCCGACCGCC 


CGAATACAAC 


CGCTTCGCCA 


1801 


GCAGGCGGCT 


ATTCCGCAGG 


CTATTACAGC 


1851 


GAGCGCGGAC 


GCATACGCCG 


CCTTTGAAGA 


1901 


CAGGCAAACG 


CTTTTGGCAG 


GAAATCCTCG 


1951 


GCGGCAGAAT 


CCTTCAAAGC 


CTTCCGCGGA 


2001 


ACTCTTGCGC 


CACAGCGGCT 


TCGACAACGC 



3GGCTGGGCA 



:gcaaa ctgcgcgaac 

3CTTTC AGACGACGGC 



TCCTCGTCCG CCAAATGGAG 



This corresponds to the amino acid sequence <SEQ ID 3090; ORF 128.a>: 

al28.pep 

1 MTDNALLHLG EEPRFDQIKT EDIKPALQTA IAEAREQIAA IKAQTHTGWA 

51 NTVEPLTGIT ERVGRIWGW SHLNSVTDTP ELRAAYNELM PEITVFFTEI 

101 GQDIELYNRF KTIKNSPEFD TLSHAQKTKL NHDLRDFVLS GAELPPEQQA 

151 ELAKLQTEGA QLSAKFSQNV LDATDAFGIY FDDAAPLAGI PEDALAMFAA 

201 AAQSEGKTGY KIGLQI PHYL AV1QYADNRK LREQIYRAYV TRASELSDDG 

2 51 KFDNTANIDR TLENALQTAK LLGFKNYAEL SLATKMADTP EQVLNFLHDL 

301 ARRAKPYAEK DLAEVKAFAR ESLGLADLQP WDLGYAGEKL REAKYAFSET 

351 EVKKYFPVGK VLNGLFAQIK KLYGIGFTEK TVPVWHKDVR YFELQQNGET 

4 01 IGGVYMDLYA REGKRGGAWM NDYKGRRRFS DGTLQLPTAY LVCNFTPPVG 

4 51 GKEARLSHDE ILTLFHETGH GLHHLLTQVD ELGVSGINGV EWDAVELPSQ 

501 FMENFVWEYN V LA QMS AH EE TGVPLPKELF DKMLAAKNFQ RGMFLVRQME 
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551 FALFDMMIYS EDDEGRLKNW QQVLDSVRKE VAVVRPPEYN RFANSFGHIF 
601 AGGYSAGYYS YAWA_ . LSAD AYAAFEESDD VAATGKRFWQ EILAVGGSRS 
651 AAESFKAFRG REPS I DALLR HSGFDNAA* 

128/al28 ORFs 128 and 128. a showed a 66.0% identity in 677 aa overlap 

10 20 30 40 50 60 

ml 2 8 .pep M r . DNALLHLGEEPRFDQIKTEDIXPALQTAIAEAREQIAAIKAQTHTGWANTVEPLTGIT 

i ! M I I M I i I t I f I I I I t I ) i I I i I I I I I ] t | I U | | | | | I j | M | ! M | | | | I M I i I 
a 126 MTDNALLHLGEEPRFDQIKTEDIKPALQTAIAEAREQI AAIfCAQTHTGWANTVEPLTGIT 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml 2 8 .pep ERVGRIWGVVSHLNCVADTPELRAVYNELMPEITVFFTEIGQDIELYNRFKTIKNSPEFD 
I I I I I I I M I t I I I I : ! I ! ! I M : I I I i I II I I I I I II I i I I M I I i I I I I I M I M ! I 
al28 ERVGRIWGWSHLNSVTDTPELRAAYNELMPEITVFFTEIGQDIELYNRFKTIKNSPEFD 
70 80 90 100 110 120 

130 

ml28.pep TLSPAQKTKLNH 

ill I I I M M ! 

al28 TL5HAQKTKLNHDLRDFVLSGAELPPEQQAELAKLQTEGAQLSAKFSQNVLDATDAFGIY 
130 140 150 160 170 180 



m!28.pep 

a 128 FDDAAPLAGIPEDAL.AMFAAAAQSEGKTGYKIGLQI PHYLAVIQYADNRKLREQIYRAYV 

190 200 210 220 230 240 



ml23.pep 

a!28 TRASELSDDGKFDNTANIDRTLENALQTAKLLGFKNYAELSLATKMADTPEQVLNFLHDL 
250 260 270 280 290 300 

140 150 

ml 28 .pep YASEKLREAKYAFSETXVKKY FPVGX 

I I : I I It 1 I I I I I I I i II I i I I I I 
al28 ARRAKPYAEKDLAEVKAFARESLGLADLQPWDLGYAGEKLREAKYAFSETEVKKYFPVGK 
310 320 330 340 350 360 

160 170 180 190 200 210 

ml28.pep VLNGLFAQXKKLYGIGFTEKTVPVWHKDVRYXELQQNGEXIGGVYMDLYAREGKRGGAWM 
I ! I I I I I ! i i i I I I I I I I I 11 I I I I I II I ! I I I I t II : i i I I I 1 II I I I i i ! I I! I I i 
a 12 8 VLNGLFAQIKKLYGIGFTEKTVPVWHKDVRYFELQQNGETIGGVYMDLYAREGKRGGAWM 
370 380 390 400 410 420 

220 230 240 250 260 270 

ml28 . pep NDYKGRRRFSDGTLQLPTAYLVCNFAPPVGGREARLSHDEILILFHETGHGLHHLLTQVD 
I I I t I I ! i I I I I It M I I I N I I I I : I I j I I : I I I I I I I I M I M I I 1 I I I I I ! i M II 
a 128 NDYKGRRRFSDGTLQLPTAYLVCNFTPPVGGKEARLSHDEILTLFHETGHGLKHLLTQVD 
430 440 450 460 470 480 

280 290 300 310 320 330 

ml28 . pep ELGVSGINGVXWDAVELPSQFMENFVWEYNVLAQXSAHEETGVPEPKELXDKXLAAKNFQ 
I I I I I M i I I I) I I I II I I I II i I I II I I I I I ! I I 1 I E I I I I M I I I II I I I I I I I 
al28 ELGVSGINGVEWDAVELPSQFMENFVWEYNVLAQMSAHEETGVPLPKELFDKMLAAKNFQ 
490 500 510 520 530 540 

340 350 360 370 380 390 

ml28 .pep XGMFXVRQXEFALFDMMIYSEDDEGRLKKWQQVLDSVRKKVAVIQPPEYNRFALSFGHIF 
I ! I I I i I I I I ! I I I I i t I I I I II II I i I 1 I I I I I I : I | I : : | \ \ | | | ] | | | | || j 
al28 RGMFLVRQMEFALFDMMIYSEDDEGRLKNWQQVLDSVRKEVAWRPPEYNRFANSFGHIF 
550 560 570 530 590 600 
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400 410 420 430 440 450 

mi28 .pep AGG Y S AAX Y S Y A W AE V LS A DA Y AA FE E S D D V AAT G KR FW Q E I L A VG XSRSGAES FKA FRG 

I I I i I I : I ! ! I I I I ! ! M II I I I i I [ I 1 I j | j | | | | M ! I I I I I | I ! I : I j M M t 11 
a!28 AGGYSAGYY5YAWAEVLSADAYAAFEESDDVAATGKRFWQEILAVGGSRSAAESFKAFRG 
610 620 630 640 650 660 

460 470 
ml 2 3. pep REPSI DALLRHSGFDNAVX 

I I I II I ! I I I I M I f I I : 
al28 REPSI DALLRHSGFDNAAX 

670 



Further work revealed the DNA sequence identified in N. meningitidis <SEQ ID 309 1>: 

ml28-l.seq 

1 ATGACTGACA ACGCACTGCT CCATTTGGGC GAAGAACCCC GTTTTGATCA 

51 AATCAAAACC GAAGACATCA AACCCGCCC . GCAAACCGCC ATCGCCGAAG 

101 CGCGCGAACA AATCGCCGCC ATCAAAGCCC AAACGCACAC CGGCTGGGCA 

151 AACACTGTCG AACCCCTGAC CGGCATCACC GAACGCGTCG GCAGGATTTG 

201 GGGCGTGGTG TCGCACCTCA ACTCCGTCGC CGACACGCCC GAACTGCGCG 

2 51 CCGTCTATAA CGAACTGATG CCCGAAATCA CCGTCTTCTT CACCGAAATC 

301 GGACAAGACA TCGAGCTGTA CAACCGCTTC AAAACCATCA AAAATTCCCC 

351 CGAATTCGAC ACCCTCTCCC CCGCACAAAA AACCAAACTC AACCACGATC 

4 01 TGCGCGATTT CGTCCTCAGC GGCGCGGAAC TGCCGCCCGA ACAGCAGGCA 

4 51 GAACTGGCAA AACTGCAAAC CGAAGGCGCG CAACTTTCCG CCAAATTCTC 

501 CCAAAACGTC CTAGACGCGA CCGACGCGTT CGGCATTTAC TTTGACGATG 

551 CCGCACCGCT TGCCGGCATT CCCGAAGACG CGCTCGCCAT GTTTGCCGCC 

601 GCCGCGCAAA GCGAAAGCAA AACAGGCTAC AAAATCGGCT TGCAGATTCC 

651 ACACTACCTC GCCGTCATCC AATACGCCGA CAACCGCGAA CTGCGCGAAC 

7 01 AAATCTACCG CGCCTACGTT ACCCGCGCCA GCGAACTTTC AGACGACGGC 

7 51 AAATTCGACA ACACCGCCAA CATCGACCGC ACGCTCGCAA ACGCCCTGCA 

801 AACCGCCAAA CTGCTCGGCT TCAAAAACTA CGCCGAATTG TCGCTGGCAA 

851 CCAAAATGGC GGACACGCCC GAACAAGTTT TAAACTTCCT GCACGACCTC 

901 GCCCGCCGCG CCAAACCCTA CGCCGAAAAA GACCTCGCCG AAGTCAAAGC 

951 CTTCGCCCGC GAAAGCCTGA ACCTCGCCGA TTTGCAACCG TGGGACTTGG 

1001 GCTACGCCAG CGAAAAACTG CGCGAAGCCA AATACGCGTT CAGCGAAACC 

1051 GAAGTCAAAA AATACTTCCC CGTCGGCAAA GTATTAAACG GACTGTTCGC 

1101 CCAAATCAAA AAACTCTACG GCATCGGATT TACCGAAAAA ACCGTCCCCG 

1151 TCTGGCACAA AGACGTGCGC TATTTTGAAT TGCAACAAAA CGGCGAAACC 

1201 ATAGGCGGCG TTTATATGGA TTTGTACGCA CGCGAAGGCA AACGCGGCGG 

1251 CGCGTGGATG AACGACTACA AAGGCCGCCG CCGTTTTTCA GACGGCACGC 

1301 TGCAACTGCC CACCGCCTAC CTCGTCTGCA ACTTCGCCCC ACCCGTCGGC 

1351 GGCAGGGAAG CCCGCCTGAG CCACGACGAA ATCCTCATCC TCTTCCACGA 

1401 AACCGGACAC GGGCTGCACC ACCTGCTTAC CCAAGTGGAC GAACTGGGCG 

14 51 TATCCGGCAT CAACGGCGTA GAATGGGACG CGGTCGAACT GCCCAGCCAG 

1501 TTTATGGAAA ATTTCGTTTG GGAATACAAT GTCTTGGCAC AAATGTCAGC 

1551 CCACGAAGAA ACCGGCGTTC CCCTGCCGAA AGAACTCTTC GACAAAATGC 

1601 TCGCCGCCAA AAACTTCCAA CGCGGCATGT TCCTCGTCCG GCAAATGGAG 

1651 TTCGCCCTCT TTGATATGAT GATTTACAGC GAAGACGACG AAGGCCGTCT 

1701 GAAAAACTGG CAACAGGTTT TAGACAGCGT GCGCAAAAAA GTCGCCGTCA 

1751 TCCAGCCGCC CGAATACAAC CGCTTCGCCT TGAGCTTCGG CCACATCTTC 

1801 GCAGGCGGCT ATTCCGCAGG CTATTACAGC TACGCGTGGG CGGAAGTATT 

1851 GAGCGCGGAC GCATACGCCG CCTTTGAAGA AAGCGACGAT GTCGCCGCCA 

1901 CAGGCAAACG CTTTTGGCAG GAAATCCTCG CCGTCGGCGG ATCGCGCAGC 

1951 GCGGCAGAAT CCTTCAAAGC CTTCCGCGGC CGCGAACCGA GCATAGACGC 

2001 ACTCTTGCGC CACAGCGGTT TCGACAACGC GGTCTGA 

This corresponds to the amino acid sequence <SEQ ID 3092; ORF 128-1>: 

ml28-l .pep. 

1 MTDNALLHLG EEPRFDQIKT EDIKPALQTA IAEAREQIAA IKAQTHTGWA 

51 NTVEPLTGIT ERVGRIWGVV SHLMSVADTP ELRAVYNELM PEITVFFTEI 
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101 GQDIELYNRF 

151 ELAKLQTEGA 

201 AAQSESKTGY 

251 KFDNTANIDR 

301 ARRAKPYAEK 

351 EVKKYFPVGK 

401 IGGVYMDLYA 

4 51 GREARLSHDE 

501 FMENFVWEYN 

551 FALFDMMIYS 

601 AGGYSAGYYS 

651 AAESFKAFRG 



KTIKNSPEFD 
QLSAKFSQNV 
KIGLQIPHYL 
TLANALQTAK 
DLAEVKAFAR 
VLNGLFAQIK 
REGKRGGAWM 
ILILFHETGH 
VLAQMSAHEE 
EDDEGRLKNW 
YAWAEVLSAD 
REPSIDALLR 



TLSPAQKTKL 
LDATDAFGIY 
AVIQYADNRE 
LLGFKNYAEL 
ESLNLADLQP 
KLYGIGFTEK 
NDYKGRRRFS 
GLHKLLTQVD 
TGVPLPKELF 
QQV LDSVRKK 
AYAAFEESDD 
HSGFDNAV* 



NHDLRDFVLS 
FDDAAPLAGI 
LREQI YRAYV 
SLATKMADT? 
WDLGYASEKL 
TVPVWHKDVR 
DGTLQLPTAY 
ELGVSGINGV 
DKMLAAKNFQ 
VAVIQPPEYN 
VAATGKRFWQ 



GAELPPEQQA 
PEDALAMFAA 
TRASELSDDG 
EQVLNFLHDL 
REAKYAFSET 
YFELQQNGET 
LVCNFAPPVG 
EWDAVELPSQ 
RGMFLVRQME 
RFALSFGKIF 
EI LAVGGSRS 



The following DNA sequence was identified in N. gonorrhoeae <SEQ ID 3093>; 

gl28-l.seq (partial) 

1 ATGATTGACA ACGCACTGCT CCACTTGGGC GAAGAACCCC GTTTTAATCA 

51 AATCAAAACC GAAGACATCA AACCCGCCGT CCAAACCGCC ATCGCCGAAG 

101 CGCGCGGACA AATCGCCGCC GTCAAAGCGC AAACGCACAC CGGCTGGGCG 

151 AACACCGTCG AGCGTCTGAC CGGCATCACC GAACGCGTCG GCAGGATTTG 

201 GGGCGTCGTG TCCCATCTCA ACTCCGTCGT CGACACGCCC GAACTGCGCG 

251 CCGTCTATAA CGAACTGATG CCTGAAATCA CCGTCTTCTT CACCGAAATC 

301 GGACAAGACA TCGAACTGTA CAACCGCTTC AAAACCATCA AAAATTCCCC 

351 CGAATTTGCA ACGCTTTCCC CCGCACAAAA AACCAAGCTC GATCACGACC 

4 01 TGCGCGATTT CGTATTGAGC GGCGCGGAAC TGCCGCCCGA ACGGCAGGCA 

4 51 GAACTGGCAA AACTGCAAAC CGAAGGCGCG CAACTTTCCG CCAAATTCTC 

501 CCAAAACGTC CTAGACGCGA CCGACGCGTT CGGCATTTAC TTTGACGATG 

551 CCGCACCGCT TGCCGGCATT CCCGAAGACG CGCTCGCCAT GTTTGCCGCC 

601 GCCGCGCAAA GCGAAGGCAA AACAGGTTAC AAAATCGGCT TGCAGATTCC 

65 i GCACTACCTT GCCGTTATCC AATACGCCGG CAACCGCGAA CTGCGCGAAC 

701 AAATCTACCG CGCCTACGTT ACCCGTGCCA GCGAACTTTC AAACGACGGC 

7 51 AAATTCGACA ACACCGCCAA CATCGACCGC ACGCTCGAAA ACGCATTGAA 

801 AACCGCCAAA CTGCTCGGCT TTAAAAATTA CGCCGAATTG TCGCTGGCAA 

851 CCAAAATGGC GGACACGCCC GAACAGGTTT TAAACTTCCT GCACGACCTC 

901 GCCCGCCGCG CCAAACCCTA CGCCGAAAAA GACCTCGCCG AAGTCAAAGC 

951 CTTCGCCCGC GAACACCTCG GTCTCGCCGA CCCGCAGCCG TGGGACTTGA 

1001 GCTACGCCGG CGAAAAACTG CGCGAAGCCA AATACGCATT CAGCGAAACC 

1051 GAAGTCAAAA AATACTTCCC CGTCGGCAAA GTTCTGGCAG GCCTGTTCGC 

1101 CCAAATCAAA AAACTCTACG GCATCGGATT CGCCGAAAAA ACCGTTCCCG 

1151 TCTGGCACAA AGACGTGCGC TATTTTGAAT TGCAACAAAA CGGCAAAACC 

1201 ATCGGCGGCG TTTATATGGA TTTGTACGCA CGCGAAGGCA AACGCGGCGG 

1251 CGCGTGGATG AACGACTACA AAGGCCGCCG CCGCTTTGCC GACGGCACGC 

1301 • TGCAACTGCC CACCGCCTAC CTCGTCTGCA ACTTCGCCCC GCCCGTCGGC 

1351 GGCAAAGAAG CGCGTTTAAG CCACGACGAA ATCCTCACCC TCTTCCACGA 

14 01 AACCGGCCAC GGACTGCACC ACCTGCTTAC CCAAGTGGAC GAACTGGGCG 

14 51 TGTCCGGCAT CAACGGCGTA AAA 



This corresponds to the amino acid sequence <SEQ ID 3094; ORF 128-l.ng>: 

gl28-l.pep (partial) 

1 MIDNALLHLG EEPRFNQIKT EDIKPAVQTA IAEARGQIAA VKAQTHTGWA 

51 NTVERLTGIT ERVGRIWGVV SHLNSWDTP ELRAVYNELM PEITVFFTEI 

101 GQDIELYNRF KTIKNSPEFA TLSPAQKTKL DHDLRDFVLS GAELPPERQA 

151 ELAKLQTEGA QLSAKFSQNV LDATDAFGIY FDDAAPLAGI PEDALAMFAA 

201 AAQSEGKTGY KIGLQIPHYL AVIQYAGNRE LREQIYRAYV TRASELSNDG 

251 KFDNTANIDR TLENALKTAK LLGFKNYAEL SLATKMADT P EQVLNFLHDL 

301 ARRAKPYAEK DLAEVKAFAR EHLGLADPQP VJDLSYAGEKL REAKYAFSET 

351 EVKKYFPVGK VLAGLFAQIK KLYGIGFAEK TVPVWHKDVR YFELQQNGKT 

4 01 IGGVYMDLYA REGKRGGAWM NDYKGRRRFA DGTLQLPTAY LVCNFAPPVG 

4 51 GKEARLSHDE ILTLFHETGH GLHKLLTQVD ELGVSGINGV K 



ml28-l/gl28-l ORFs 128-1 and 128-1. ng showed a 94.5% identity in 491 aa 
overlap 

10 20 30 40 50 60 
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g 128-1 .pep MIDNALLHLGEEPRFNQIKTEDIKPAVOTAIAEARGQIAAVKAQTHTGWANTVERLTGIT 
I I I I I I I ! I ! i I ! I : I i M I I I I I ! : I I I I I M i II I I : I I I ! I I I I I j ! i ! I I I I I 
ml 28-1 MTDNALLHLGEEPRFDQIKTEDIKFALQTAIAEAREQIAAIKAQTH7GWANTVEPLTGIT 
10 20 30 40 50 60 

70 80 90 ICO 110 120 

g 128-1. pep E RVGRI WG VV S H LNSVV DT PELRAVYN E LM P E I T V F FTE I GQ D I E LYNRFKT I KN S PE F A 

I I I! I I I I I I I I II I I : I I ! ! M I ! I I ! I If I M I I i I I I I I I I I I I I I I I I I I ! I I I I 
ml 2 8-1 ERVGRIWGVVSHLNSVADTPELRAVYNELMPEITVFFTEIGQDIELYriRFKTIKNSPEFD 
70 80 90 100 110 120 

130 140 150 160 170 180 

g 128-1 . pep TLSPAQKTKLDHDLRDFVLSGAELPPERQAELAKLQTEGAQLSAKFSQNVLDATDAFGIY 
I I I I I I I I I ! : I I I I I I I I I I f I I ! I I : I I I I I M I I I II I I I I II I M I I I I I I I ! I I [ 
ml28-l TLSPAQKTKLNHDLRDFVLSGAELPPEQQAE LAKLQTEGAQLSAKFSQNVLDATDAFGI Y 

130 140 150 160 170 180 

190 200 210 220 230 240 

gl28-l -pep FDDAAPLAGI PEDALAMFAAAAQSEGKTGYKIGLQI PH YLAVIQYAGNRELREQIYRAYV 
I I I i I ! I I I H I I I I I M II I i E I I : I I I I i M I i i I I ( II I I I 11 III t U I I I I I I I 
ml2 8-l FDDAAPLAGI PEDALAMFAAA&QSE SKTGYKIGLQ I PHYLAVIQYADNRELREQIYRAYV 

190 200 210 220 230 240 

250 260 2 " : 280 290 300 

gl28-l .pep TRA5ELSNDGKFDNTANIDRTLENALKTAKLLGFKNYAELSLATKMADTPEQVLNFLHDL 
] I I 1 I 1 I : I Ml i i Ml H II I I I 1 : I i I I I 11 1 I I II I I I 1 1 I 1 1 ! I ! I I I I 1 I 1 I I I 
ml 28-1 TRASELSDDGKFDNTANIDRTLANALQTAKLLG FKNYAELSLATKMADTPEQVLNFLHDL 

250 260 270 280 290 300 

310 320 330 340 350 360 

gl 28-1 .pep ARRAKPYAEKDLAEVKAFAREKLGLADPQPWDLSYAGEKLREAKYAFSETEVKKYFPVGK 

I I I I I M M I I I I I I I I 1 I II Mill I i I I I : I I : I I I I I I I I I II I I I II ! I I I I M 
ml23-l ARRAKPYAEKDLAEVKAFARESLNLADLQPWDLGYASEKLREAKYAFSETEVKKYFPVGK 

310 320 330 340 350 360 

370 380 390 400 410 420 

g 128-1 .pep VLAGLFAQIKKLYGIGFAEKTVPWJHKDVRY FELQQNGKTIGGVYMDLYAREGKRGGAWM 

II I I ! I I I I II I I I I I : I M I M M ! I I ! I ! I M M I : I i I I I I I I il I I I I I I I I ! i I 
ml 28-1 VLNGLFAQIKKLYGIGFTEKTVPVWHKDVRYFELQQNGETIGGVYMDLYAREGKRGGAWM 

370 380 390 400 410 420 

430 440 450 460 470 480 

gl28-l .pep NDYKGRRRFADGTLQLPTAYLVCNFAPPVGGKEARLSHDEILTLFHETGHGLHHLLTQVD 
I I I ! I I M I : I ! I I I ! I I I I I M I I I II I I I : I I M I I II I I I I I I I I II I II I I I I I I 
ml 28-1 NDYKGRRRFSDGTLQLPTAYLVCNFAPPVGGREARLSHDEILILFHETGHGLHHLLTQVD 

430 440 450 460 470 480 

490 

gl28-l .pep ELGVSGINGVK 
i II M II M I ■ 

ml 28-1 ELGVSGINGVEWDAVELPSQFMENFVWEYMVLAQMSAHEETGVPLPKELFDKMLAAKNFQ 
490 500 510 520 530 540 

The following DNA sequence was identified in N. meningitidis <SEQ ID 3095>: 

al28-l . seq 

1 ATGACTGACA ACGCACTGCT CCATTTGGGC GAAGAACCCC GTTTTGATCA 

51 AATCAAAACC GAAGACATCA AACCCGCCCT GCAAACCGCC ATTGCCGAAG 

101 CGCGCGAACA AATCGCCGCC ATCAAAGCCC AAACGCACAC CGGCTGGGCA 

151 AACACTGTCG AACCCCTGAC CGGCATCACC GAACGCGTCG GCAGGATTTG 

201 GGGCGTGGTG TCGCACCTCA ACTCCGTCAC CGACACGCCC GAACTGCGCG 

2 51 CCGCCTACAA TGAATTAATG CCCGAAATTA CCGTCTTCTT CACCGAAATC 
301 GGACAAGACA TCGAGCTGTA CAACCGCTTC AAAACCATCA AAAACTCCCC 

3 51 CGAGTTCGAC ACCCTCTCCC ACGCGCAAAA AACCAAACTC AACCACGATC 

4 01 TGCGCGATTT CGTCCTCAGC GGCGCGGAAC TGCCGCCCGA ACAGCAGGCA 
4 51 GAATTGGCAA AACTGCAAAC CGAAGGCGCG CAATTTCCG CCAAATTCTC 
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501 CCAAAACGTC CTAGACGCGA CCGACGCGT7 CGGCATTTAC TTTGACGATG 

551 CCGCACCGCT TGCC^CATT CCCGAAGACG CGCTCGCCAT GTTTGCCGCT 

601 GCCGCGCAAA GCGAAGGCAA AACAGGCTAC AAAATCGGTT TGCAGATTCC 

651 GCACTACCTC GCCGTCATCC AATACGCCGA CAACCGCAAA ^TGCGCGAAC 

701 AAATCTACCG CGCCTACGTT ACCCGCGCCA GCGAGCTTTC AGACGACGGC 

7 51 AAATTCGACA ACACCGCCAA CATCGACCGG ACGCTCGAAA ACGCCCTGCA 
301 AACCHCCAAA CTGCTCGGCT TC.AAAAACTA CGCCGAATTG TCGCTGGCAA 

8 51 CCAAAATGGC GGACACCCCC GAACAAGTTT TAAACTTCCT GCACGACCTC 
901 GCCCGCCGCG CCAAACCCTA CGCCGAAAAA GACCTCGCCG AAGTCAAAGC 

9 51 CTTCGCCCGC GAAAGCCTCG GCCTCGCCGA TTTGCAACCG TGGGACTT G G 
1001 GCTACGCCGG CGAAAAACTG CGCGAAGCCA AATACGCATT CAGCGAAACC 
1051 GAAGTCAAAA AATACTTCCC CGTCGGCAAA GTATTAAACG GACTGTTCGC 
1101 CCAAATCAAA AAACTCTACG GCATCGGATT TACCGAAAAA ACCGTCCCCG 
1151 TCTGGCACAA AGACGTGCGC TATTTTGAAT TGCAACAAAA CGGCGAAACC 
12 01 ATAGGCGGCG TTTATATGGA TTTGTACGCA CGCGAAGGCA AACGCGGCGG 
12 51 CGCGTGGATG AACGACTACA AAGGCCGCCG CCGTTTTTCA GACGGCACGC 
1301 TGCAACTGCC CACCGCCTAC CTCGTCTGCA ACTTCACCCC GCCCGTCGGC 
1351 GGCAAAGAAG CCCGCTTGAG CCATGACGAA ATCCTCACCC TCTTCC ACGA 
14 01 AACCGGACAC GGCCTGCACC ACCTGCTTAC CCAAGTCGAC GAACTGGGCG 
14 51 TATCCGGCAT CAACGGCGTA GAATGGGACG CAGTCGAACT GCCCAGTCAG 
1501 TTTATGGAAA ATTTCGTTTG GGAATACAAT GTCTTGGCGC AAATGTCCGC 
1551 CCACGAAGAA ACCGGCGTTC CCCTGCCGAA AGAACTCTTC GACAAAATGC 
1601 TCGCCGCCAA AAACTTCCAA CGCGGAATGT TCCTCGTCCG CCAAATGGAG 
1651 TTCGCCCTCT TTGATATGAT GATTTACAGC GAAGACGACG AAGGCCGTCT 
17 01 GAAAAACTGG CAACAGGTTT TAGACAGCGT GCGCAAAGAA GTCGCCGTCG 
17 51 TCCGACCGCC CGAATACAAC CGCTTCGCCA ACAGCTTCGG CCACATrT T C 
1801 GCAGGCGGCT ATTCCGCAGG CTATTACAGC TACGCGTGGG CGGAAGTATT 
1851 GAGCGCGGAC GCATACGCCG CCTTTGAAGA AAGCGACGAT GTCGCCGCCA 
1901 CAGGCAAACG CTTTTGGCAG GAAATCCTCG CCGTCGGCGG ATCGCGCAGC 
1951 GCGGCAGAAT CCTTCAAAGC CTTCCGCGGA CGCGAACCGA GCATAGACGC ■ 
2 001 ACTCTTGCGC CACAGCGGCT TCGACAACGC GGCTTGA 

This corresponds to the amino acid sequence <SEQ ID 3096; ORF 128-1 .a>: 

al28-l .pep 

1 MTDNALLHLG EEPRFDQIKT EDIKPALQTA IAEAREQIAA IKAQTHTGWA 

51 NTVEPLTGIT ERVGRIWGVV SHLNSVTDTP ELRAAYNELM PEITVFFTEI 

101 GQDIELYNRF KTIKNSPEFD TLSHAQKTKL NHDLRDFVLS GAELPPEQQA 

151 ELAKLQTEGA QLSAKFSQNV LDATDAFGIY FDDAAPLAGI PEDALAMFAA 

201 AAQSEGKTGY KIGLQI PHYL AVIQYADNRK LREQIYRAYV TRASELSDDG 

251 KFDNTANIDR TLENALQTAK LLGFKNYAEL SLATKMADTP EQVLNFLHDL 

301 ARRAKPYAEK DLAEVKAFAR ESLGLADLQP WDLGYAGEKL REAKYAFSET 

351 EVKKYFPVGK VLNGLFAQIK KLYGIGFTEK TVPVWHKDVR YFELQQNGET 

401 IGGVYMDLYA REGKRGGAWM NDYKGRRRFS DGTLQLPTAY LVCNFTPPVG 

4 51 GKEARLSHDE ILTLFHETGH GLHHLLTQVD ELGVSGINGV EWDAVELPSQ 

501 FMENFVWEYN VLAQMSAHEE TGVPLPKELF DKMLAAKNFQ RGMFLVRQME 

551 FALFDMMIYS EDDEGRLKNW QQVLDSVRKE VAVVRPPEYN RFANSFGHIF 

601 AGGYSAGYYS YAWAEVLSAD AYAAFEESDD VAATGKRFWQ EI LAVGGSRS 

651 AAESFKAFRG REPSIDALLR HSGFDNAA* 

ml28-l/al28-l ORFs 128-1 and 128- La showed a 97.8% identity in 677 aa overlap 

10 20 30 40 50 60 

al28-l.pep mtdnallhlgeeprfdqiktedikpalqtaiaeareqiaaika.qthtgwantvepltgtt 

1 I ! I 1 I M I I I I I I I I I I I I I I I 1 I I I I I I I M I ! I I I I I H | | | | 

mi28-l mt dnallhlgeeprfdqiktedikpalqtaiaeareqiaaikaqthtgwantvepltgit 

10 20 30 40 50 60 



al2B-l .Deo 



70 80 90 100 U0 120 

ERVGRIWGVVSHLNSVTDTPELRAAYNELMPEITVFFTEIGQDIELYNRFKTIKNSPEFD 

100 , 1 11,11 M ""HM: III Mill! IIIIIIIM inn | 

ml28-l ERVGRIWGVVSHLNSVADTPELRAVYNELMPEITVFFTEIGQDIFLYNRFKTIKNSPEFD 

70 80 90 100 110 120 



130 140 150 160 



170 160 
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a 128-1 . pep T LS HAQKT K L N H D L R D FY L S G AE L P PE QQAE LAK LQT E G AQ L 5 AK FS QN V L DA T DA FG 1 Y 
Ml I I I I t i I I I I I I I I I f ! I I I I ! I I I 1 I I I 1 M I I I M I i I I I I t I I I I I f I i I I I I 
ml 28-1 TLSPAQKTKLNHDLRDFVLSGAELPPEQQAELAKLQTEGAQLSAKFSQNVLDATDAFGIY 
130 140 150 160 170 180 



190 200 210 220 230 240 

al2S-l . pep FDDAAPLAGI PEDALAMFAAAAQSEGKTGYKIGLQI PHYLAVIQYADNRKLREQI YRAYV 
! ! I I I I I I I i i I i I I I I M ! 1 I I I I : ! I i ! I N I ! I I I i I I i i I M I M : f M i I I I I I ! 
m 1 2 8 - 1 F D D AA P L AG I PE D A LAM FAAAAQS E SKTG YK I GLQ I ?H YLA V I Q YADN RE LRE Q I Y RAYV 

190 200 210 220 230 240 



250 260 270 280 290 300 

al28-i . pep TRASELSDDGKFDNTANIDRTLENALQTAKLLGFKNYAELSLATKMADTPSQVLNFLHDL 
! I I I M i I I I I I i I I I I I ! II 1 I I I I I I I I I I I I I i I ( I I ! i M I M ! M I I I I I I 1 I I 
ml28-l TRASELSDDGKFDNTANIDRTLANALQTAKLLGFKNYAELSLATKMADTPEQVLNFLHDL 
250 260 270 280 290 300 



310 320 330 340 350 360 

al28-l.pep ARRAKPYAEKDLAEVKAFARESLGLADLQPWDLGYAGEKLREAKYAFSETEVKKYFPVGK 
I I I I I I I i I I i I I I i I I I I I I I I : I I I ! I I I I I I ! : I I I ! I I I I I I I I II i II I I I I M 
ml28-l ARRAKPYAEKDLAEVKAFARESLNLADLQPWDLGYASEKLREAKYAFSETEVKKYFPVGK 

310 320 330 340 350 360 



370 380 390 400 410 420 

al28-l .pep VLNGLFAQIKKLYGIGFTEKTVPVWHKDVRYFELQQNGETIGGVYMDLYAREGKRGGAWM 
I I I I I M I I I I I I I I ! M I I ! f I I I I I I I I I i I I 11 I I i I I I I I ! I I I I i M 1 I M i I i I 
m!28-l VLNGLFAQIKKLYGIGFTEKTVPVWHKDVRYFELQQNGETIGGVYMDLYAREGKRGGAWM 
370 380 390 400 410 420 



430 440 450 460 470 480 

al 2 6-1 - pep NDYKGRRRFSDGTLQLPTAYLVCNFTPPVGGKEARLSHDEILTLFHETGKGLHHLLTQVD 
I I 1 II I I M M I I I I I I I i I I I I II : I I I II : I I I II I I I I I I M I I I I I I M I! I I I I 
KV128-1 NDYKGRRRFSDGTLQLPTAYLVCNFAPPVGGREARLSHDEILILFHETGHGLHHLLTQVD 

430 440 450 460 470 480 



490 500 510 520 530 540 

al28-l . pep ELGVSGINGVEWDAVELPSQFMENFVWEYNVLAQMSAHEETGVPLPKELFDKMLAAKNFQ 
I I M I M II I I II I I I ! I I II I I I I I I I I II ! I II I f i I I I I I I 11 I I I I I I I I I I I I I I 
ml2 8-l ELGVSGINGVEWDAVEijPSOFMENFVWEYNVLAQMSAHEETGVPLPKELFDKMLAAKNFQ 

490 500 510 520 530 540 



550 560 570 580 590 600 

al28-l . pep RGMFLVRQMEFALFDMMI YSEDDEGRLKNWQQVLDSVRKEVAWRPPEYNRFANSFGKI F 
1 M M I I I I I I I 1 I M M t 1 I Ml 1 I )) I I I 1 I I I 1 1 M : III : : 11 ! i I I I I I t I I I I 
ml 2 8-1 RGMFLVRQMEFALFDMMIYSEDDEGRLKNWQQVLDSVRKKVAVIQPPEYNRFALSFGHIF 

550 560 570 580 590 600 



610 620 630 640 650 660 

al28-l .pep AGGYSAGYYSYAWAEVLSADAYAAFEESDDVAATGKRFWQEILAVGGSRSAAESFKAFRG 
i 1 I I II I t II I I I I I i ! I I 1 I I i I I I I I I I M I I M 1 I I 1 ! i ! I i It 1 I \ \ I I i I i I I I I 
ml 2 8- 1 AGGYSAGYYS YAWAEVLSADAYAAFEES DDVAATGKRFWQE I LA VGGSRSAAES FKAFRG 

610 620 630 640 650 660 



670 679 
al28-l.pep REPSI DALLRHSGFDNAAX 

i I i I IE I I I I i I I I I I I : 
ml2 8-l REPSI DAL LRHSGFDNAVX 

670 



206 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 3097>: 

n\2 06 . seq 
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I ATGTTTCCCC CCGACAAAAC CCTTTTCC7C TGTCTCAGCG CAGTGCTCCT 

51 CGGCTCATGC GGCAGGACCT CCGGCAAACA CCGCCAACGG AAACCCAAAC 

101 AGACAGTGCG GCAAATCCAA GCCGTCCGCA TCAGGCACAT CGACCGCACA 

151 CAAGGCTCGG AGGAACTCAT GGTCCACAGG CTCGGACTCA TCGGCACGCC 

201 CTACAAATGG GGCGGCAGCA GCACCGCAAC CGGCTTCGAT TGCAGCGGCA 

2 51 TGATTCAATT CGTTTACAAr AACGCCCTCA AGGTCAAGCT GCCGCGCACC 

3 01 GCCCGCGACA TGGGGGCGGC AAGCCGsAAA ATGCCCGAcA GCCGCyTCAA 
351 GGCCGGCGAC CTGGTATTCT TCAACACCGG CGGCGCACAC CGCTACTCAC 

4 01 ACGTCGGACT CTACATCGGC AACGGCGAAT TCATCCATGC CCCCAGCAGC 
4 51 GGCAAAACCA TCAAAACCGA AAAACTCTCC ACACCGTTTT ACGCCAAAAA 
501 CTACCTCGGC GGACATACTT TTTTTACAGA ATGA 

This corresponds to the amino acid sequence <SEQ ID 3098; ORF 206>: 

m206 .pep. . 

1 MFPPDKTLFL CLSALLLASC GTTSG KHRQP KPKQTVRQIQ AVRISHIDRT 

51 QGSQELMLHS LGLIGTPYKW GGSSTATGFD CSGMIQFVYK NALNVKLPRT 

101 ARDMAAASRK IPDSRXKAGD LVFFNTGGAH RYSHVGLYIG NGEFIHAPSS 

151 GKTIKTEKLS TPFYAKNYLG AHTFFTE* 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 3099>: 

g206 . seq 

1 atgttttccc ccgacaaaac ccttttcctc tgtctcggcg cactgctcct 

51 cgcctcatgc ggcacgacct ccggcaaaca ccgccaaccg aaacccaaac 

101 agacagtccg gcaaatccaa gccgtccgca tcagccacat cggccgcaca 

151 caaggctcgc aggaactcat gctccacagc ctcggactca tcggcacgcc 

201 ccacaaatgg ggcggcagca gcaccgcaac cggattcgac tgcagcggca 

251 tgattcaatt ggtttacaaa aacgccctca acgtcaagct gccgcgcacc 

301 gcccgcgaca tggcggcggc aagccgcaaa atccccgaca gccgcctcaa 

351 ggccggcgac atcgtattct tcaacaccgg cggcgcacac cgctacccac 

401 acgtcggact ctacatcggc aacggcgaat tcanccatgc ccccggcagc 

451 ggcaaaacca ccaaaaccga aaaactctcc acaccgtttt acgccaaaaa 

501 ctaccttgga gcgcatacgt tttttacaga atga 

This corresponds to the amino acid sequence <SEQ ID 3100; ORF 206. ng>: 

g2 06 .pep 

1 MFSPDKTLFL CLGALLLASC GTTSG KHRQP KPKQTVRQIQ AVRISKIGRT 

51 QGSQELMLHS LGLIGTPYKW GGSSTATGFD CSGMIQLVYK NALNVKLPRT 

101 ARDMAAASRK I PDSRLKAGD IVFFNTGGAH RYSHVGLYIG NGEFIHAPGS 

151 GKTIKTEKLS TPFYAKNYLG AHTFFTE* 



ORF 206 shows 96.0% identity over a 177 aa overlap with a predicted ORF (ORF 206.ng) 
from N. gonorrhoeae: 

m206/g206 

10 20 30 40 50 60 

m206 .pep MFPPDKTLFLCLSALLLASCGTTSGKKRQPKPKQTVRQIQAVRISHIDRTQGSQELMLHS 

II llllllllhlllllllllllMlllllllllMIIIIIIIIII IIIIIIIIIMI 
g2 06 MFSPDKTLFLCLGALLIASCGTTSGKHRQPKPKQTVRQIQAVRISHIGRTQGSQELMLHS 
10 20 30 40 50 60 



70 80 90 100 110 120 

m206 .pep LGLIGTPYKWGGSSTATGFDCSGMIQFVYKNALNVKLPRTARDMAAASRKIPDSRXKAGD 

itmmimummmiihmmimmmiiimimii iiii 

g206 LGLIGTPYKWGGSSTATGFDCSGMIQLVYKNALNVKLPRTARDMAAASRKI PDSRLKAGD 

70 80 90 100 110 120 



130 140 150 160 170 

m2 06 .pep LVFFNTGGAHRYSHVGLYIGNGEFIHAPSSGKTIKTEKLSTPFYAKNYLGAHTFFTEX 

:|IMII1MIMIIII!!IIIIIIII|:||IIIMIINIIIIIII!IIIIIIIII 

g206 IVFFNTGGAKRYSHVGLYIGNGEFIHAPGSGKTIKTEKLSTPFYAKNYLGAHTFFTE 
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130 140 150 160 170 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 3 101>: 

a206 . seq 

1 ATGTTTCCCC CCGACAAAAC CCTTTTCCTC TGTCTCAGCG CACTGCTCCT 

51 CGCCTCATGC GGCACGACCT CCGGCAAACA CCGCCAACCG AAACCCAAAC 

101 AGACAGTCCG GCAAATCCAA GCCGTCCGCA TCAGCCACAT CGACCGCACA 

151 CAAGGCTCGC AGGAACTCAT GCTCCACAGC CTCGGACTCA TCGGCACGCC 

201 CTACAAATGG GGCGGCAGCA GCACCGCAAC CGGCTTCGAT T3CAGCGGCA 

251 TGATTCAATT CGTTTACAAA AACGCCCTCA ACGTCAAGCT GCCGCGCACC 

301 GCCCGCGACA TGGCGGCGGC AAGCCGCAAA ATCCCCGACA GCCGCCTTAA 

3 51 GGCCGGCGAC CTCGTATTCT TCAACACCGG CGGCGCACAC CGCTACTCAC 

4 01 ACGTCGGACT CTATATCGGC AACGGCGAAT TCATCCATGC CCCCAGCAGC 
4 51 GGCAAAACCA TCAAAACCGA AAAACTCTCC ACACCGTTTT ACGCCAAAAA 
501 CTACCTCGGC GCACATACTT TCTTTACAGA ATGA 

This corresponds to the amino acid sequence <SEQ ID 3102; ORF 206, a>: 

a206.pep 

1 MFPPDK TLFL CLSALLLASC GTT SGKHRQP KPKQTVRQIQ AVRISHIDRT 

51 QGSQELMLHS LGLIGTPYKW GGSSTATGFD CSGMIQFVYK NALNVKLPRT 

101 ARDMAAASRK IPDSRLKAGD LVFFNTGGAH RYSHVGLYIG NGEFIHAPSS 

151 GKTIKTEKLS TPFYAKNYLG AHTFFTE* 

m206/a206 ORFs 206 and 206.a showed a 99.4% identity in 1 77 aa overlap 

10 20 30 40 50 60 

m2 0 6.pep MFPPDKTLFLCLSALLLASCGTTSGKHRQPKPKQTVRQIQAVRI SHIDRTQGSQELMLHS 

M I I 1 t I I ! I I ! I I ! I i 1 I M ! I I I I 1 I I ! j ! I I | M 1 1 | i ! | | | | | | | | | j | | j j j | | | 
a206 MFPPDKTLFLCLSALLLASCGTTSGKHRQPKPKQTVRQIQAVRI SHIDRTQGSQELMLHS 

10 20 30 40 50 60 

70 80 90 100 110 120 

m2 06.pep LGLIGTPYKWGGSSTATGFDCSGMIQFVYKNALNVKLPRTARDMAAASRKI PDSRXKAGD 

I I I I I II I I M I f I I I! I M I I I M I I I i I i I I I i I I I I I i I I | M M II I I [ I I MM 
a20 6 LG L I GTPYKWGGSSTATGFDCSGM I QFVYKNALNVKLPRT ARDMAAASRK I PDSRLKAGD 

70 80 90 100 110 120 

130 140 150 160 170 

m206 . pep LVFFNTGGAHRYSHVGLYIGNGEFIHAPSSGKTIKTEKLSTPFYAKNYLGAHTFFTEX 
I I I i I I I M I I I I M I I II I II I II M M M M I! II M M I M I II M M M I 11 M 
a2 06 LVFFNTGGAHRYSHVGLYIGNGEFIHAPSSGKTIKTEKLSTPFYAKNYLGAHTFFTEX 
130 140 150 160 170 



287 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 3103>: 

m287 . seq 

1 ATGTTTAAAC GCAGCGTAAT CGCAATGGCT TGTATTTTTG CCCTTTCAGC 

51 CTGCGGGGGC GGCGGTGGCG GATCGCCCGA TGTCAAGTCG GCGGACACGC 

101 TGTCAAAACC TGCCGCCCCT GTTGTTTCTG AAAAAGAGAC AGAGGCAAAG 

151 GAAGATGCGC CACAGGCAGG TTCTCAAGGA CAGGGCGCGC CATCCGCACA 

201 AGGCAGTCAA GATATGGCGG CGGTTTCGGA AGAAAATACA GGCAATGGCG 

251 GTGCGGTAAC AGCGGATAAT CCCAAAAATG AAGACGAGGT GGCACAAAAT 

301 GATATGCCGC AAAATGCCGC CGGTACAGAT AGTTCGACAC CGAATCACAC 

3 51 CCCGGATCCG AATATGCTTG CCGGAAATAT GGAAAATCAA GCAACGGATG 

4 01 CCGGGGAATC GTCTCAGCCG GCAAACCAAC CGGATATGGC AAATGCGGCG 
4 51 GACGGAATGC AGGGGGACGA TCCGTCGGCA GGCGGGCAAA ATGCCGGCAA 
501 TACGGCTGCC CAAGGTGCAA ATCAAGCCGG AAACAATCAA GCCGCCGGTT 
551 CTTCAGATCC CATCCCCGCG TCAAACCCTG CACCTGCGAA TGGCGGTAGC 
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601 AATTTTGGAA GGGTTGATTT GGCTAATGGC GTTTTGATTG ACGGGCCGTC 

651 GCAAAATATA ACGi.GACCC ACTGTAAAGG CGATTCTTGT AGTGGCAATA 

701 ATTTCTTGGA TGAAGAAGTA CAGCTAAAAT CAGAATTTGA AAAATTAAGT 

7 51 GATGCAGACA AAATAAGTAA TTACAAGAAA GATGGGAAGA ATGATAAATT 

801 TGTCGGTTTG GTTGCCGATA GTGTGCAGAT GAAGGGAATC AATCAATATA 

851 TTATCTTTTA TAAACCTAAA CCCACTTCAT TTGCGCGATT TAGGCGTTCT 

901 GCA~GGTCGA GGCGGTCGCT TCCGGCCGAG ATGCCGCTGA TTCCCGTCAA 

951 TCAGGCGGAT ACGCTGATTG TCGATGGGGA AGCGGTCAGC CTGACGGGGC 

1001 ATTCCGGCAA TATCTTCGCG CCCGAAGGGA ATTACCGGTA TCTGACTTAC 

1051 GGGGCGGAAA AATTGCCCGG CGGATCGTAT GCCCTTCGTG TTCAAGGCGA 

1101 ACCGGCAAAA GGCGAAATGC TTGCGGGCGC GGCCGTGTAC AACGGCGAAG 

1151 TACTGCATTT CCATACGGAA AACGGCCGTC CGTACCCGAC CAGGGGCAGG 

1201 TTTGCCGCAA AAGTCGATTT CGGCAGCAAA TCTGTGGACG GCATTATCGA 

12 51 CAGCGGCGAT GATTTGCATA TGGGTACGCA AAAAT T C AAA GCCGCCATCG 
1301 ATGGAAACGG CTTTAAGGGG ACTTGGACGG AAAATGGCAG CGGGGATGTT 

13 51 TCCGGAAAGT TTTACGGCCC GGCCGGCGAG GAAGTGGCGG GAAAATACAG 
1401 CTATCGCCCG ACAGATGCGG AAAAGGGCGG ATTCGGCGTG TTTGCCGGCA 

14 51 AAAAAGAGCA GGATTGA 



This corresponds to the amino acid sequence <SEQ ID 3104; ORF 287>: 

m287 .pep 

1 MFKRSVIAMA CIFALSA CGG GGGGSPDVKS ADTLSKPAAP WSEKETEAK 

51 EDAPQAGSQG QGAPSAQGSQ DMAAVSEENT GNGGAVTADN PKNEDEVAQN 

101 DMPQNAAGTD SSTPNHTPDP MMLAGNMENQ ATDAGESSQP ANQPDMANAA 

151 DGMQGDDPSA GGQNAGNTAA QGANQAGNNQ AAGSSDPIPA SNPAPANGGS 

201 NFGRVDLANG VLIDGPSQNI TLTHCKGDSC SGNNFLDEEV QLKSEFEKLS 

251 DADKISNYKK DGKNDKFVGL VADSVQMKGI NQYI IFYKPK PTSFARFRRS 

301 ARSRRSLPAE MPLIPVNQAD TLIVDGEAVS LTGHSGNIFA PEGNYRYLTY 

3 51 GAEKLPGGSY ALRVQGEPAK GEMLAGAAVY NGEVLHFHTE NGRPYPTRGR 

4 01 FAAKVDFGSK SVDGIIDSGD DLHMGTQKFK AAIDGNGFKG TWTENGSGDV 
451 SGKFYGPAGE EVAGKYSYRP TDAEKGGFGV FAGKKEQD* 

The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 3105>: 

g2S7 . seq 

1 atgtttaaac gcagtctgat tgcaatggct tgtatttttc ccctttcagc 

51 ctgtgggggc ggcggtggcg gatcgcccga tgtcaagtcg gcggacacgc 

101 cgccaaaacc ggccgccccc gttgttgctg aaaatgccgg ggaaggggtg 

151 ctgccgaaag aaaagaaaga tgaggaggca gcgggcggtg cgccgcaagc 

201 cgatacgcag gacgcaaccg ccggagaagg cagccaagat atggcggcag 

251 tttcggcaga aaatacaggc aatggcggtg cggcaacaac ggacaacccc 

301 aaaaatgaag acgcgggggc gcaaaatgat atgccgcaaa atgccgccga 

351 atccgcaaat caaacaggga acaaccaacc cgccggttct tcagattccg 

401 cccccgcgtc aaaccctgcc cctgcgaatg gcggtagcga ttttggaagg 

451 acgaacgtgg gcaattctgt tgtgattgac ggaccgtcgc aaaatataac 

501 gttgacccac tgtaaaggcg attcttgtaa tggtgataat ttattggatg 

551 aagaagcacc gtcaaaatca gaatttgaaa aattaagtga tgaagaaaaa 

601 attaagcgat ataaaaaaga cgagcaacgg gagaattttg tcggtttggt 

651 tgctgacagg gtaaaaaagg atggaactaa caaatatatc atcttctata 

701 cggacaaacc acctactcgt tctgcacggt cgaggaggtc gcttccggcc 

751 gagattccgc tgattcccgt caatcaggcc gatacgctga ttgtggatgg 

801 ggaagcggtc agcctgacgg ggcattccgg caatatcttc gcgcccgaag 

851 ggaattaccg gtatctgact tacggggcgg aaaaattqcc cggcggatcg 

901 tatgccctcc gtgtgcaagg cgaaccggca aaaggcgaaa tgcttgttgg 

951 cacggccgtg tacaacggcg aagtgctgca rttccatatg gaaaacggcc 

1001 gtccgtaccc gtccggaggc aggtttgccg caaaagtcga tttcggcagc 

1051 aaatctgtgg acggcattat cgacagcggc gatgatttgc atatgggtac 

1101 gcaaaaattc aaagccgcca tcgatggaaa cggctttaag gggacrtgga 

1151 cggaaaatgg cggcggggat gtttccggaa ggttttacgg cccggccggc 

1201 gaggaagtgg cgggaaaata cagctatcgc ccgacagatg ctgaaaaggg 

1251 cggattcggc gtgtttgccg gcaaaaaaga tcgggattga 

This corresponds to the amino acid sequence <SEQ ID 3106; ORF 287. ng>: 

g287 . pep 

1 MFKRSVIAMA CIFPLSACGG GGGGSPDVKS ADTPSKPAAP VVAENAGEGV 
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106 



101 

151 
201 
251 
301 
351 
401 



LPKEKKDEEA 
KNEDAGAQND 
TNVGNSVVID 
IKRYKKDEQR 
EIPLIPVNQA 
YALRVQGEPA 
KSVDGIIDSG 
EEVAGKYSYR 



AGGAPQADTQ 
MPQNAAESAN 
GPSQNITLTH 
EN FVGLVADR 
DTLI VDGEAV 
KGEMLVGTAV 
DDLHMGTQKF 
PTDAEKGGFG 



DATAGEGSQD 
QTGNNQPAGS 
CKGDSCNGDN 
VKKDGTNKYI 
SLTGHSGNIF 
YNGEVLHFHM 
KAAI DGNGFK 
VFAGKKDRD* 



MAAVSAENTG 
SDSAPASNPA 
LLDEEAPSKS 
I FYTDKPPTR 
APEGNYRYLT 
ENGRPYPSGG 
GTWTENGGGD 



NGGAATTDNP 
PANGGSDFGR 
EFEKLSDEEK 
SARSRRSLPA 
YGAEKLPGGS 
RFAAKVDFGS 
VSGRFYGPAG 



m287/g287 ORFs 287 and 287. ng showed a 70.1% identity in 499 aa overlap 

10 20 30 40 49 

ki287 . pep MFKRSVIAMACI FALSACGGGGGGSPDVKSADTLSKPAAPWSE KETEA 

! I i I I I I I I I I ! i I i I I I ! I I I I I I I I I I M ! I I I I I ! I I : I I : I I 

g2 87 MFKRSVIAMACI FPLSACGGGGGGSPDVKSADTPSKPAAPVVAENAGEGVLPKEKKDEEA 

10 20 30 40 50 60 

50 60 70 80 90 100 109 

m287 . pep KEDAPQAGSQGQGAPSAQGSQDMAAVSEENTGNGGAVTADNPKNEDEVAQN DMPQNAAGT 

Mil : I I : : : I 1 i ! I ! I I I i I I I II ! I : I : i I I I ! ! ! I M ! I I I 11 I 
g2 87 AGGAPQADTQD — ATAGEGSQDMAAVSAENTGNGGAATTDNPKNEDAGAQNDMPQNAA— 

70 80 90 100 110 

110 120 130 140 150 160 169 

m287 . pep DSSTPNHTPDPNMLAGNMENQATDAGESSQPANQPDMANAADGMQGDDPSAGGQNAGNTA 



g287 



170 180 190 200 210 220 229 



m237 . pep AQGANQAGNNQAAGSSDPIPASNPAPANGGSNFGRVDLANGVLI DGPSQNITLTHCKGDS 

: : I I I : I I I i I I M I I I I I I I I I I 1 I I : M I : : : : I : I : I I I I I t t I I 1 I M ! i I 1 
g287 -ESANQTGNNQPAGSSDSAPASNPAPANGGSDFGRTNVGNSW I DGPSQNITLTHCKGDS 

120 130 140 150 160 170 

230 240 250 260 270 280 289 

m287 . pep CSGNNFLDEEVQLKSEFEKLSDADKISNYKKDGKNDKFVGLVADSVQMKGINQYI I FYKP 

I : I : I : I I I I : I I ( I M I I 1 : I I : Mil : I : I I : I I M I 

g287 CNGDNLLDEEAPSKSEFEKL5DEEKIKRYKKDEQRENFVGLVADRVKKDGTNKYI IFYTD 

180 190 200 210 220 230 

290 300 310 320 330 340 349 

m287 .pep KPTSFARFRRSARSRRSLPAEMPLIPVNQADTLIVDGEAVSLTGHSGNI FAPEGNYRYLT 

! 1 : M M I I I M II 1 M II II M II I I I II i I M I M M I I M M M M I 1 11 I 

g287 KPPT RSARSRRSLPAEIPLIPVNQADTLIVDGEAVSLTGHSGNI FAPEGNYRYLT 

240 250 260 270 230 290 

350 360 370 380 390 4 00 409 

m287 . pep YGAEKLPGGS YALRVQGEPAKGEMLAGAAVYNGEVLHFHTENGRPYPTRGRFAAKVDFGS 

M M II I M I I I I II I I M II I I I I M M M I M M I M I I M M I : I II I I II I I I I 
g287 YGAEKLPGGSYALRVQGEPAKGEMLVGTAVYNGEVLHFHMENGRPYPSGGRFAAKVDFGS 
300 310 320 330 340 350 

410 420 430 440 450 4 60 469 

m287 .pep KSVDGI IDSGDDLHMGTQKFKAAIDGNGFKGTWTENGSGDVSGKFYGPAGEEVAGKYSYR 

I II I I I M I M I II I I I I I M I I M ! II I II I I I M I M I II I M I I M I II I M M I I I 
g2S~ KSVDGI IDSGDDLHMGTQKFKAAIDGNGFKGTWTENGGGDVSGRFYGPAGEEVAGKYSYR 

360 370 380 390 400 410 

470 480 489 

m287 . pep PTDASKGGFGVFAGKKEQDX 
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I I ! II It I t i II I t I I : : I I 
g2 8 7 PTDAEKGGFGVFAGKKDRDX 
420 430 

The following partial DNA sequence was identified in A\ meningitidis <SEQ ID 3107>: 

a287 . seq 

1 ATGTTTAAAC GCAGTGTGAT TGCAATGGCT TGTATTGTTG CCCTTTCAGC 

51 CTGTGGGGGC GGCGGTGGCG GATCGCCCGA TGTTAAGTCG GCGGACACGC 

101 TGTCAAAACC TGCCGCCCCT GTTGTTACTG AAGATGTCGG GGAAGAGGTG 

151 CTGCCGAAAG AAAAGAAAGA TGAGGAGGCG GTGAGTGGTG CGCCGCAAGC 

201 CGATACGCAG GACGCAACCG CCGGAAAAGG CGGTCAAGAT ATGGCGGCAG 

2 51 TTTCGGCAGA AAATACAGGC AATGGCGGTG CGGCAACAAC GGATAATCCC 

301 GAAAATAAAG ACGAGGGACC GCAAAATGAT A T G C C G C AAA ATGCCGCCGA 

351 TACAGATAGT TCGACACCGA ATCACACCCC TGCACCGAAT ATGCCAACCA 

4 01 GAGATATGGG AAACCAAGCA CCGGATGCCG GGGAATCGGC ACAACCGGCA 

4 51 AACCAACCGG ATATGGCAAA TGCGGCGGAC GGAATGCAGG GGGACGATCC 

501 GTCGGCAGGG GAAAATGCCG GCAATACGGC AGATCAAGCT GCAAATCAAG 

551 CTGAAAACAA TCAAGTCGGC GGCTCTCAAA ATCCTGCCTC TTCAACCAAT 

601 CCTAACGCCA CGAATGGCGG CAGCGATTTT GGAAGGATAA ATGTAGCTAA 

651 TGGCATCAAG CTTGACAGCG GTTCGGAAAA TGTAACGTTG ACACATTGTA 

701 AAGACAAAGT ATGCGATAGA GATTTCTTAG ATGAAGAAGC ACCACCAAAA 

7 51 TCAGAATTTG AAAAATTAAG TGATGAAGAA AAAATTAATA AATATAAAAA 
801 AGACGAGCAA CGAGAGAATT TTGTCGGTTT GGTTGCTGAC AGGGTAGAAA 

8 51 AGAATGGAAC TAACAAATAT GTCATCATTT ATAAAGACAA GTCCGCTTCA 
901 TCTTCATCTG CGCGATTCAG GCGTTCTGCA CGGTCGAGGC GGTCGCTTCC 
951 GGCCGAGATG CCGCTGATTC CCGTCAATCA GGCGGATACG CTGATTGTCG 

1001 ATGGGGAAGC GGTCAGCCTG ACGGGGCATT CCGGCAATAT CTTCGCGCCC 

10 SI GAAGGGAATT ACCGGTATCT GACTTACGGG GCGGAAAAAT TGTCCGGCGG 

1101 ATCGTATGCC CTCAGTGTGC AAGGCGAACC GGCAAAAGGC GAAATGCTTG 

1151 CGGGCACGGC CGTGTACAAC GGCGAAGTGC TGCATTTCCA TATGGAAAAC 

1201 GGCCGTCCGT CCCCGTCCGG AGGCAGGTTT GCCGCAAAAG TCGATTTCGG 

1251 CAGCAAATCT GTGGACGGCA TTATCGACAG CGGCGATGAT TTGCATATGG 

1301 GTACGCAAAA ATTCAAAGCC GTTATCGATG GAAACGGCTT TAAGGGGACT 

1351 TGGACGGAAA ATGGCGGCGG GGATGTTTCC GGAAGGTTTT ACGGCCCGGC 

14 01 CGGCGAAGAA GTGGCGGGAA AATACAGCTA TCGCCCGACA GATGCGGAAA 

14 51 AGGGCGGATT CGGCGTGTTT GCCGGCAAAA AAGAGCAGGA TTGA 

This corresponds to the amino acid sequence <SEQ ID 3108; ORF 287. a>: 

a287 .pep 

1 MFKRSVIAMA CIVALSA CGG GGGGSPDVKS ADTLSKPAAP VVTEDVGEEV 

51 LPKEKKDEEA VSGAPQADTQ DATAGKGGQD MAAVSAENTG NGGAATTDNP 

101 ENKDEGPQND MPQNAADTDS STPNHTPAPN MPTRDMGNQA PDAGESAQPA 

151 NQPDMANAAD GMQGDDPSAG ENAGNTADQA ANQAENNQVG GSQNPASSTN 

201 PNATNGGSDF GRINVANGIK LDSGSENVTL THCKDKVCDR DFLDEEAPPK 

251 SEFEKLSDEE KINKYKKDEQ RENFVGLVAD RVEKNGTNKY VIIYKDKSAS 

301 SSSARFRRSA RSRRSLPAEM PLIPVNQADT LIVDGEAVSL TGHSGNIFAP 

351 EGNYRYLTYG AEKLSGGSYA LSVQGEPAKG EMLAGTAVYN GEVLHFHMEN 

401 GRPSPSGGRF AAKVDFGSKS VDGIIDSGDD LHMGTQKFKA VIDGNGFKGT 

4 51 WTENGGGDVS GRFYGPAGEE VAGKYSYRPT DAEKGGFGVF AGKKEQD* 

m287/a287 ORFs 287 and 287. a showed a 77.2% identity in 501 aa overlap 

10 20 30 40 49 

m287 .pep MFKRSVIAMACIFALSACGGGGGGSPDVKSADTLSKPAAPWSE KETEA 

( I I I I I i I I I I I !! I I I i I I t ! I I I I I I I I i I I I 1 I ! I I t I : I ( : M 

a28 7 MFKRSVIAMACIVALSACGGGGGGSPDVKSADTLSKPAAPVVTEDVGEEVLPKEKKDEEA 

10 20 30 40 50 60 

50 60 70 80 90 100 109 

m287 . pep KEDAPQAGSQGQGAPSAQGSQDMAAVSEENTGNGGAVTADNPKNEDEVAQNDMPQNT^AGT 
I I I I : I i : : : I : f I I I It I I I I I M I I : I : I I I : \ : I I I I I I I I I i I | 
a 2 8 7 VSGAPQADTQ- - DATAGKGGQDMAAVSAENTGNGG AATT DN PENKDEG PQN DMPQNAADT 

70 80 90 100 110 
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110 120 130 140 150 160 169 

m287 .pep DSSTPNHTPDPNMLAGNMENQATDAGESSQPANQPDMANAADGMQGDDPSAGGQNAGNTA 
1 I M I ! I I I 111 : : I III I I I I 1 : I I I I I I I ! i I I I I I I I I i I I I I I : I I I I I I 
a287 DSSTPNHTPAPNMPTRDMGNQAPDAGESAQPANQPDt-lANAADGMQGDDPSAG-ENAGNTA 
120 130 140 150 160 170 

170 130 190 200 210 220 229 

m287 . pep AQGANQAGNNQAAGSSDPI PASNPAFANGGSNFGRVDLANGVLIDGPSQNITLTHCKGDS 

I : I I I I ! I I : : I I : : I : : | | : I | I I : | t I : : : I M : : I : I : I : I I I I I I 
a2 8 7 DQAANQAENNQVGGSQN PASSTNPNATNGGSDFGRINVANGIKLDSGSENVTLTHCKDKV 

180 190 200 210 220 230 

230 240 250 260 270 280 289 

m287 .pep CSGNNFLDEEVQLKSEFEKLSDADKISNYKKDGKNDKFVGLVADSVQMKGINQYII FYKP 

I : : I i I I I : I i M if I I I : I I : : I I I I : : : f I I I I I I I : : I I : I : I : | I 
a2 87 CD-RDFLDEEAPPKSEFEKLSDEEKINKYKKDEQRENFVGLVADRVEKNGTNKYVI IYKD 

240 250 260 270 230 290 



290 300 310 320 330 340 

m2 87 .pep KP--TSFARFRRSARSRRSLPAEMPLI PVNQADTLI VDGEAVSLTGHSGNIFAPEGNYRY 

i : I t M I I M I I ! I I I 1 , I I I I I I ! I I I I M I I I I I I f I I I I I I I I I I I I I I | | I I 
a2 87 KSASSSSARFRRSARSRRSLPAEMPLI PVNQADTLIVDGEAVSLTGHSGNI FAPEGNYRY 

300 310 320 730 340 350 



350 360 370 380 390 400 

m287 . pep LTYGAEKLPGGSYALRVQGEPAKGEMLAGAAVYNGEVLHFHTENGRPYPTRGRFAAKVDF 

MINIM II M II I II I M I I M II I : M M M I II I I II II I I : I I I I I I i I I 
a2 87 LTYGAEKLSGGSYALSVQGEPAKGEMLAGTAVYNGEVLHFHMENGRPSPSGGRFAAKVDF 
360 370 380 390 400 410 



410 420 430 440 450 4 60 

m287. pep GSKSVDGI IDSGDDLHMGTQKFKAAIDGNGFKGTWTENGSGDVSGKFYGPAGEEVAGKYS 

I M M I M I I M M I I II M II I I : I I M II I M M M M II I M : ll M M i I M i I I I 
a2 87 GSKSVDGI I DSGDDLHMGTQKFKAVIDGNGFKGTWTENGGGDVSGRFYGPAGEEVAGKYS 

420 430 440 450 4 60 470 



470 480 489 

m287 . pep YRPTDAEKGGFGVFAGKKEQDX 

II I II I M I II I M ! 1 ! I I I I I 
a2 87 YRPTDAEKGGFGVFAGKKEQDX 
480 490 



406 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 3109>: 

m406 . seq 

1 ATGCAAGCAC GGCTGCTGAT ACCTATTCTT TTTTCAGTTT TTATTTTATC 

51 CGCCTGCGGG ACACTGACAG GTATTCCATC GCATGGCGGA GGTAAACGCT 

101 TTGCGGTCGA ACAAGAACTT GTGGCCGCTT CTGCCAGAGC TGCCGTTAAA 

151 GACATGGATT TACAGGCATT ACACGGACGA AAAGTTGCAT TGTACATTGC 

201 CACTATGGGC GACCAAGGTT CAGGCAGTTT GACAGGGGGT CGCTACTCCA 

251 TTGATGCACT GATTCGTGGC GAATACATAA ACAGCCCTGC CGTCCGTACC 

3 01 GATTACACCT ATCCACGTTA CGAAACCACC GCTGAAACAA CATCAGGCGG 

351 TTTGACAGGT TTAACCACTT CTTTATCTAC ACTTAATGCC CCTGCACTCT 

401 CTCGCACCCA ATCAGACGGT AGCGGAAGTA AAAGCAGTCT GGGCTTAAAT 

451 ATTGGCGGGA TGGGGGATTA TCGAAATGAA ACCTTGACGA CTAACCCGCG 

501 CGACACTGCC TTTCTTTCCC ACTTGGTACA GACCGTATTT TTCCTGCGCG 

551 GCATAGACGT TGTTTCTCCT GCCAATGCCG ATACAGATGT GTTTATTAAC 

6C1 ATCGACGTAT TCGGAACGAT ACGCAACAGA ACCGAAATGC ACCTATACAA 

651 TGCCGAAACA CTGAAAGCCC AAACAAAACT GGAATATTTC GCAGTAGACA 
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701 GAACCAATAA AAAATTGCTC ATCAAACCAA AAACCAATGC GTTTGAAGCT 

751 GCCTATAAAG AAAATTACGC ATTGTGGATG GGGCCGTATA AAGTAAGCAA 

801 AGGAATTAAA CCGACGGAAG GATTAATGGT CGATTTCTCC GATATCCGAC 

851 CATACGGCAA TCATACGGGT AACTCCGCCC CATCCGTAGA GGCTGATAAC 

901 AGTCATGAGG GGTATGGATA CAGCGATGAA GTAGTGCGAC AACATAGACA 

951 AGGACAACCT TGA 

This corresponds to the amino acid sequence <SEQ ID 3 1 10; ORF 406>: 



I MQARLLIPIL FSVFILSA CG TLTGIPSHGG GKRF AVEQEL VAASARAAVK 

51 DMDLQALHGR KVALYIATMG DQGSGSLTGG RYSIDALIRG EYINSPAVRT 

101 DYTYPRYETT AETTSGGLTG LTTSLSTLNA PALSRTQSDG SGS KSSLGLN 

151 IGGMGDYRNE TLTTNPRDTA FLSHLVQTVF FLRGIBWSP ANADTDVFIN 

201 IDVFGTIRNR TEMKLYNAET LKAQTKLEYF AVDRTNKKLL I KPKTNAFEA 

251 AYKENYALWM GPYKVSKGIK PTEGLMVDFS DIRPYGNHTG NSAPSVEADN 

301 SHEGYGYSDE WRQHRQGQP * 

The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 31 1 1>: 



1 ATGCGGGCAC GGCTGCTGAT ACCTATTCTT TTTTCAGTTT TTATTTTATC 

51 CGCCTGCGGG ACACTGACAG GTATTCCATC GCATGGCGGA GGCAAACGCT 

101 TCGCGGTCGA ACAAGAACTT GTGGCCGCTT CTGCCAGAGC TGCCGTTAAA 

151 GACATGGATT TACAGGCATT ACACGGACGA AAAGTTGCAT TGTACATTGC 

201 AACTATGGGC GACCAAGGTT CAGGCAGTTT GACAGGGGGT CGCTACTCCA 

251 TTGATGCACT GATTCGCGGC GAATACATAA ACAGCCCTGC CGTCCGCACC 

3 01 GATTACACCT ATCCGCGTTA CGAAACCACC GCTGAAACAA CATCAGGCGG 

3 51 TTTGACGGGT TTAACCACTT CTTTATCTAC ACTTAATGCC CCTGCACTCT 

401 CGCGCACCCA ATCAGACGGT AGCGGAAGTA GGAGCAGTCT GGGCTTAAAT 

451 ATTGGCGGGA TGGGGGATTA TCGAAATGAA ACCTTGACGA CCAACCCGCG 

501 CGACACTGCC TTTCTTTCCC ACTTGGTGCA GACCGTATTT TTCCTGCGCG 

551 GCATAGACGT TGTTTCTCCT GCCAATGCCG ATACAGATGT GTTTATTAAC 

601 ATCGACGTAT TCGGAACGAT ACGCAACAGA ACCGAAATGC ACCTATACAA 

651 TGCCGAAACA CTGAAAGCCC AAACAAAACT GGAATATTTC GCAGTAGACA 

701 GAACCAATAA AAAATTGCTC ATCAAACCCA AAACCAATGC GTTTGAAGCT 

751 GCCTATAAAG AAAATTACGC ATTGTGGATG GGGCCGTATA AAGTAAGCAA 

801 AGG AAT C AAA CCGACGGAAG GATTGATGGT CGATTTCTCC GATATCCAAC 

851 CATACGGCAA TCATACGGGT AACTCCGCCC CATCCGTAGA GGCTGATAAC 

901 AGTCATGAGG GGTATGGATA CAGCGATGAA GCAGTGCGAC AACATAGACA 

951 AGGGCAACCT TGA 

This corresponds to the amino acid sequence <SEQ ID 3 1 12; ORF 406>: 

g406 .pep 

1 MRARLLIPIL FSVFILSA CG TLTGIPSHGG GKRFAVEQEL VAASARAAVK 

51 DMDLQALHGR KVALYIATMG DQGSGSLTGG RYSIDALIRG EYINSPAVRT 

101 DYTYPRYETT AETTSGGLTG LTTSLSTLNA PALSRTQSDG SGSRSSLGLN 

151 IGGMGDYRNE TLTTNPRDTA FLSHLVQTVF FLRGIDWSP ANADTDVFIN 

201 IDVFGTIRNR TEMHLYNAET LKAQTKLEYF AVDRTNKKLL I KPKTNAFEA 

251 AYKENYALWM GPYKVSKGIK PTEGLMVDFS DIQPYGNHTG NSAPSVEADN 

301 SHEGYGYSDE AVRQHRQGQP * 



ORF 406 shows 98.8% identity over a 320 aa overlap with a predicted ORF (ORF406.a) from 
N. gonorrhoeae: 

g406/m406 



m406 .pep 



g406 . seq 



g4 06 . pep 



10 20 30 40 50 60 

MRARLLIPILFSVFILSACGTLTGIPSHGGGKRFAVEQELVAASARAAVKDMDLQALHGR 



m4 06 



MQARLLI PI LFSVFI LSACGTLTGI PSHGGGKRFAVEQELVAASARAAVKDMDLQALHGR 
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10 20 30 40 50 60 

7 0 80 90 100 110 120 

q406 . pep KVALYI ATMGDQGSGSLTGGRYS IDALI RGE Y INS PAVRTDYTYPRYETTAETTSGGLTG 

MMMIIIUMIIIilllllMMIIiillilMlllllllllillllilllllllll 

m4 06 KVALYI ATMGDQGSGSLTGGRYS IDALI RGE Y INS PAVRTDYTYPRYETTAETTSGGLTG 

7 0 80 90 100 110 120 

130 140 150 160 170 130 

g4 06 .pep LTTSLSTLNAPALSRTQSDGSGSRSSI^I^IGGMGDYRNETLTTNPRDTAFLSHLVQTVF 

MIIIMMIIIIIIIIIIillhllMllliniMIIMIMIIllllllllillil 

m4 06 LTTSLSTLNAPALSRTQSDGSGSKSSLGLNIGGMGDYRNETLTTNPRDTAFLSHLVQTVF 
130 140 150 160 170 180 

190 200 210 220 230 240 

g4 0 6 .pep FLRGIDWSPANADTDVFINIDVFGTIRNRTEMHLYNAETLKAQTKLEYFAVDRTNKKLL 

IMMIMIIMIMIMIIIIIIIIIIIIIMIIIMIIIIIIIMIMIIIIIIIIII 

m4 06 FLRGIDWSPANADTDVFINIDVFGTIRNRTEMHLYNAETLKAOTKLEYFAVDRTNKKLL 
190 200 210 220 230 240 

250 260 270 280 290 300 

g4 06 . pep IKPKTNAFEAAYKENYALWMGPYKVSKGIKPTEGLMVDFSDIQPYGNHTGNSAP3VEADN 

MlMMMIMMilMMIIMMIIIiillMIMillhllllllllliliMIM 

m4 06 IKPKTNAFEAAYKENYALWMGPYKVSKGIKPTEGLMVDFSDIRPYGNHTGNSAPSVEADN 
250 260 270 230 290 300 

310 320 
g4 0 6 . pep SHEGYGYSDEAVRQHRQGQPX 

IIMIilllhllllllllll 

m4 06 SHEGYGYSDEWRQHRQGQPX 
310 320 



The following partial DNA sequence was identified in A 7 , meningitidis <SEQ ID 3 1 13>: 

a406. seq 

1 ATGCAAGCAC GGCTGCTGAT ACCTATTCTT TTTTCAGTTT TTATTTTATC 

51 CGCCTGCGGG ACACTGACAG GTATTCCATC GCATGGCGGA GGTAAACGCT 

101 TCGCGGTCGA ACAAGAACTT GTGGCCGCTT CTGCCAGAGC TGCCGTTAAA 

151 GACATGGATT TACAGGCATT ACACGGACGA AAAGTTGCAT TGTACATTGC 

201 AACTATGGGC GACCAAGGTT CAGGCAGTTT GACAGGGGGT CGCTACTCCA 

2 51 TTGATGCACT GATTCGTGGC GAATACATAA ACAGCCCTGC CGTCCGTACC 

301 GATTACACCT ATCCACGTTA CGAAACCACC GCTGAAACAA CATCAGGCGG 

351 TTTGACAGGT TTAACCACTT CTTTATCTAC ACTTAATGCC CCTGCACTCT 

4 01 CGCGCACCCA ATCAGACGGT AGCGGAAGTA AAAGCAGTCT GGGCTTAAAT 

4 51 ATTGGCGGGA TGGGGGATTA TCGAAATGAA ACCTTGACGA CTAACCCGCG 

501 CGACACTGCC TTTCTTTCCC ACTTGGTACA GACCG TATTT TTCCTGCGCG 

551 GCATAGACGT TGTTTCTCCT GCCAATGCCG ATACGGATGT GTTTATTAAC 

601 ATCGACGTAT TCGGAACGAT ACGCAACAGA ACCGAAATGC ACCTATACAA 

651 TGCCGAAACA CTGAAAGCCC AAACAAAACT GGAATATTTC GCAGTAGACA 

701 GAACCAATAA AAAATTGCTC ATCAAACCAA AAACCAATGC GTTTGAAGCT 

7 51 GCCTATAAAG AAAATTACGC ATTGTGGATG GGACCGTATA AAGTAAGCAA 
801 AGGAATTAAA CCGACAGAAG GATTAATGGT CGATTTCTCC GATATCCAAC 

8 51 CATACGGCAA TCATATGGGT AACTCTGCCC CATCCGTAGA GGCTGATAAC 
901 AGTCATGAGG GGTATGGATA CAGCGATGAA GCAGTGCGAC GACATAGACA 
951 AGGGCAACCT TGA 

This corresponds to the amino acid sequence <SEQ ID 3 1 14; ORF 406.a>: 

a4 06 . pep 

1 MQARLLI PIL FSVFILSA CG TLTGIPSHGG GKR FAVEQEL VAASARAAVK 

51 DMDLQALHGR KVALYIATMG DQGSGSLTGG RYSIDALIRG EYINSPAVRT 

101 DYTYPRYETT AETTSGGLTG LTTSLSTLNA PALSRTQSDG SGSKSSLGLN 

151 IGGMGDYRNE TLTTNPRDTA FLSHLVQTVF FLRGIDWSP ANADTDVFIN 

201 IDV FGTIRNR TEMHLYNAET LKAQTKLEYF AVDRTNKKLL IKPKTNAFEA 
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2 51 AYKENYALWM GPYKVSKGIK PTEGLMVDFS DIQPYGNHMG NSAPSVEADN 
301 SHEGYGYSDE AVRRHRQGQP * 

m406/a406 ORFs 406 and 406. a showed a 98.8% identity in 320 aa overlap 

10 20 30 40 50 60 

m4 06 . pep MQARLL I ? I L F S V F I LS ACGT LTG I P S HGGGKR FA VE QE L V AAS ARAA VK DMDLQALHGR 

i I I ! 1 M M I M I I I I! I I M I I 1 I M I I I I j M I I i I I E j I i I I | | | | M M I i I M ! I 
a 4 06 MQARLLIPILFS VFILSACGTLTGI PSHGGGKRFAVEQELVAASARAAVKDMDLQALHGR 

10 20 30 40 50 60 

70 80 90 100 110 120 

m4 06 .pep KVALY IATMGDQGSGSLTGGRYSIDALIRGEYINSFAVRTDYTYPRYETTAETTSGGLTG 

I I I I I I I I I I I I I I I i I I I I I I I I I I I M i i 1 M I I I I I I I It I I i I I M I I I I I I I t I I 
a4 0 6 KVALYIATMGDQGSGSLTGGRYSIDALIRGEYINSPAVRTDYTYPRYETTAETTSGGLTG 

70 80 90 100 110 120 

130 140 150 160 170 180 

m4 06.pep LTTSLSTLNAPALSRTQSDGSGSKSSLGLNIGGMGDYRNETLTTNPRDTAFLSHLVQTVF 
I I I I I I I I I I i I I M I I I I ( I! I I M 1 t I I ! I I i I I I I I I I I 1 | I | | | | ! | | | j | | | | j | 
a4 06 LTTSLSTLNAPALSRTQSDGSGSKSSLGLNIGGMGDYRNETLTTNPRDTAFLSHLVQTVF 

130 140 150 160 170 180 

190 200 210 220 230 240 

m4 06 . pep FLRGIDVVSPANADTDVFINIDVFGTIRNRTEMHLYNAETLKAQTKLEYFAVDRTNKKLL 
I I I I I t I I I I I I I I It I! I I I I I I ! I I ! I ! I I I ! I! I I I I M ! i I I I I I I M I I I I I ! I I 
a4 06 FLRGIDWSPANADTDVFINIDVFGTIRNRTEMKLYNAETLKAQTKLEYFAVDRTNKKLL 

190 200 210 220 230 240 

250 260 270 280 290 300 

m4 0 6. pep IKPKTNAFEAAYKENYALWMGPYKVSKGIKPTEGLMVDFSDIRPYGNHTGNSAPSVEADN 
I I I i II ! I t M I I I I I I I M I I I I I I M I I I I I I I ! I 1 I ! I i : I I I i I I I I M I ! I M I 
a4 06 IKPKTNAFEAAYKENYALWMGPYKVSKGIKPTEGLMVDFSDIQPYGNHMGNSAPSVEADN 

250 260 270 280 290 300 

310 320 
m4 C 6 . pep SHEGYGYSDEWRQHRQGQPX 
I II i 1 I I I I ! : I I : I I I I I I I 
a4 0 6 SHEGYGYSDEAVRRHRQGQPX 

310 320 



EXAMPLE 2 
Expression ofORF919 
The primer described in Table 1 for ORF 919 was used to locate and clone ORF 919. 
The predicted gene 979 was cloned in pET vector and expressed in E. coli. The product of 
protein expression and purification was analyzed by SDS-PAGE. In panel A) is shown the 
analysis of 919-His fusion protein purification. Mice were immunized with the purified 919- 
His and sera were used for Western blot (panel B), FACS analysis (panel C), bactericidal 
assay (panel D), and ELISA assay (panel E). Symbols: Ml, molecular weight marker; PP, 
purified protein, TP, N. meningitidis total protein extract; OMV. N. meningitidis outer 
membrane vesicle preparation. Arrows indicate the position of the main recombinant protein 
product (A) and the N. meningitidis immunoreactive band (B). These experiments confirm 
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that 919 is a surface-exposed protein and that it is a useful immunogen. The hydrophilicity 
plots, antigenic index, and amphipatic regions of ORF 919 are provided in Figure 10. The 
AMPHI program is used to predict putative T-cell epitopes (Gao et al 1989, J. Immunol 
143:3007; Roberts et al. 1996, AIDS Res Human Retroviruses 12:593; Quakyi et ah 1992, 
Scand J Immunol Suppl 1 1 :9). The nucleic acid sequence of ORF 919 and the amino acid 
sequence encoded thereby is provided in Example 1 . 

EXAMPLE 3 
Expression of ORF 279 
The primer described in Table 1 for ORF 279 was used to locate and clone ORF 279. 
The predicted gene 279 was cloned in pGex vector and expressed in E. coli. The product of 
protein expression and purification was analyzed by M'S-PAGE. In panel A) is shown the 
analysis of 279-GST purification. Mice were immunized with the purified 279-GST and sera 
were used for Western blot analysis (panel B), FACS analysis (panel C), bactericidal assay 
(panel D), and ELISA assay (panel E). Symbols: Ml, molecular weight marker; TP, N. 
meningitidis total protein extract; OMV, N. meningitidis outer membrane vescicle preparation. 
Arrows indicate the position of the main recombinant protein product (A) and the N. 
meningitidis immunoreactive band (B). These experiments confirm that 279 is a surface- 
exposed protein and that it is a useful immunogen. The hydrophilicity plots, antigenic index, 
and amphipatic regions of ORF 279 are provided in Figure 1 1 . The AMPHI program is used 
to predict putative T-cell epitopes (Gao et al 1989, J. Immunol 143:3007; Roberts et al. 1996, 
AIDS Res Human Retroviruses 12:593; Quakyi et al. 1992, Scand J Immunol Suppl 1 1:9). 
The nucleic acid sequence of ORF 279 and the amino acid sequence encoded thereby is 
provided in Example 1 . 

EXAMPLE 4 
Expression of ORF 576 and 576-1 
The primer described in Table 1 for ORF 576 was used to locate and clone ORF 576. 
The predicted gene 576 was cloned in pGex vector and expressed in E. coli. The product of 
protein purification was analyzed by SDS-PAGE. In panel A) is shown the analysis of 576- 
GST fusion protein purification. Mice were immunized with the purified 576-GST and sera 
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were used for Western blot (panel B), FACS analysis (panel C), bactericidal assay (panel D), 
and ELISA assay (panel E). Symbols: Ml, molecular weight marker; TP, N. meningitidis 
total protein extract; OMV, N. meningitidis outer membrane vescicle preparation. Arrows 
indicate the position of the main recombinant protein product (A) and the N. meningitidis 
immunoreactive band (B).. These experiments confirm that ORF 576 is a surface-exposed 
protein and that it is a useful immunogen. The hydrophilicity plots, antigenic index, and 
amphipatic regions of ORF 576 are provided in Figure 12. The AMPHI program is used to 
predict putative T-cell epitopes (Gao et al 1989, J. Immunol 143:3007; Roberts et al. 1996, 
AIDS Res Human Retroviruses 12:593; Quakyi et al. 1992, Scand J Immunol Suppl 1 1:9). 
The nucleic acid sequence of ORF 576 and the amino acid sequence encoded thereby is 
provided in Example 1 . 

EXAMPLE 5 
Expression of ORF 519 and 519-1 
The primer described in Table 1 for ORF 519 was used to locate and clone ORF 519. 
The predicted gene 519 was cloned in pET vector and expressed in E. coli. The product of 
protein purification was analyzed by SDS-PAGE. In panel A) is shown the analysis of 519- 
His fusion protein purification. Mice were immunized with the purified 519-His and sera were 
used for Western blot (panel B), FACS analysis (panel C), bactericidal assay (panel D), and 
ELISA assay (panel E). Symbols: Ml , molecular weight marker; TP, N. meningitidis total 
protein extract; OMV, N. meningitidis outer membrane vesicle preparation. Arrows indicate 
the position of the main recombinant protein product (A) and the N. meningitidis 
immunoreactive band (B). These experiments confirm that 519 is a surface-exposed protein 
and that it is a useful immunogen. The hydrophilicity plots, antigenic index, and amphipatic 
regions of ORF 519 are provided in Figure 13. The AMPHI program is used to predict 
putative T-cell epitopes (Gao et al 1989, J. Immunol 143:3007; Roberts et al. 1996, AIDS Res 
Human Retroviruses 12:593; Quakyi et al. 1992, Scand J Immunol Suppl 1 1:9). The nucleic 
acid sequence of ORF 519 and the amino acid sequence encoded thereby is provided in 
Example 1 . 
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EXAMPLE 6 
Expression of QRF 121 and 121-1 
The primer described in Table 1 for ORF 121 was used to locate and clone ORF 121. 
The predicted gene 121 was cloned in pET vector and expressed in E. colt. The product of 
protein purification was analyzed by SDS-PAGE. In panel A) is shown the analysis of 121- 
His fusion protein purification. Mice were immunized with the purified 121 -His and sera were 
used for Western blot analysis (panel B), FACS analysis (panel C), bactericidal assay (panel 
D), and ELISA assay (panel E). Results show that 121 is a surface-exposed protein. Symbols: 
Ml, molecular weight marker; TP, N. meningitidis total protein extract; OMV, N. 
meningitidis outer membrane vescicle preparation .Arrows indicate the position of the main 
recombinant protein product (A) and the N. meningitidis immunoreactive band (B). These 
experiments confirm that 121 is a surface-exposed protein and that it is a useful immunogen. 
The hydrophilicity plots, antigenic index, and amphipatic regions of ORF 121 are provided in 
Figure 14. The AMPHI program is used to predict putative T-cell epitopes (Gao et al 1989, J. 
Immunol 143:3007; Roberts et al. 1996, AIDS Res Human Retroviruses 12:593; Quakyi et al. 
1992, Scand J Immunol Suppl 11:9). The nucleic acid sequence of ORF 121 and the amino 
acid sequence encoded thereby is provided in Example 1 . 

EXAMPLE 7 
Expression of ORF 128 and 128-1 
The primer described in Table 1 for ORF 128 was used to locate and clone ORF 128. 
The predicted gene 128 was cloned in pET vector and expressed in E. coli. The product of 
protein purification was analyzed by SDS-PAGE. In panel A) is shown the analysis of 128- 
His purification. Mice were immunized with the purified 128-His and sera were used for 
Western blot analysis (panel B), FACS analysis (panel C), bactericidal assay (panel D) and 
ELISA assay (panel E). Results show that 128 is a surface-exposed protein. Symbols: Ml, 
molecular weight marker; TP, N. meningitidis total protein extract; OMV, N. meningitidis 
outer membrane vesicle preparation. Arrows indicate the position of the main recombinant 
protein product (A) and the N. meningitidis immunoreactive band (B). These experiments 
confirm that 128 is a surface-exposed protein and that it is a useful immunogen. The 
hydrophilicity plots, antigenic index, and amphipatic regions of ORF 128 are provided in 
Figure 15. The AMPHI program is used to predict putative T-cell epitopes (Gao et al 1989, J. 
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Immunol 143:3007; Roberts et al. 1996, AIDS Res Human Retroviruses 12:593; Quakyi et al. 
1992, Scand J Immunol Suppl 1 1 :9). The nucleic acid sequence of ORF 128 and the amino 
acid sequence encoded thereby is provided in Example 1 . 

EXAMPLE 8 
Expression of ORP 206 
The primer described in Table 1 for ORF 206 was used to locate and clone ORF 206. 
The predicted gene 206 was cloned in pET vector and expressed in E. coli. The product of 
protein purification was analyzed by SDS-PAGE. In panel A) is shown the analysis of 206- 
His purification. Mice were immunized with the purified 206-His and sera were used for 
Western blot analysis (panel B). It is worthnoting that the immunoreactive band in protein 
extracts from meningococcus is 38 kDa instead of 17 kDa (panel A). To gam information on 
the nature of this antibody staining we expressed ORF 206 in E. coli without the His- tag and 
including the predicted leader peptide. Western blot analysis on total protein extracts from E. 
coli expressing this native form of the 206 protein showed a recative band at a position of 38 
kDa, as observed in meningococcus. We conclude that the 38 kDa band in panel B) is specific 
and that anti-206 antibodies, likely recognize a multimeric protein complex. In panel C is 
shown the FACS analysis, in panel D the bactericidal assay, and in panel E) the ELISA assay. 
Results show that 206 is a surface-exposed protein. Symbols: Ml, molecular weight marker; 
TP, N. meningitidis total protein extract; OMV, N. meningitidis outer membrane vesicle 
preparation. Arrows indicate the position of the main recombinant protein product (A) and the 
N. meningitidis immunoreactive band (B). These experiments confirm that 206 is a surface- 
exposed protein and that it is a useful immunogen. The hydrophilicity plots, antigenic index, 
and amphipatic regions of ORF 519 are provided in Figure 16. The AMPHI program is used 
to predict putative T-cell epitopes (Gao et al 1989, J. Immunol 143:3007; Roberts et al. 1996, 
AIDS Res Human Retroviruses 12:593; Quakyi et al. 1992, Scand J Immunol Suppl 11:9). 
The nucleic acid sequence of ORF 206 and the amino acid sequence encoded thereby is 
provided in Example 1 . 
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EXAMPLE 9 
Expression of ORF 287 
The primer described in Table 1 for ORF 287 was used to locate and clone ORF 287. 
The predicted gene 287 was cloned in pGex vector and expressed in £. coli. The product of 
protein purification was analyzed by SDS-PAGE, In panel A) is shown the analysis of 287- 
GST fusion protein purification. Mice were immunized with the purified 287-GST and sera 
were used for FACS analysis (panel B), bactericidal assay (panel C), and ELISA assay (panel 
D). Results show that 287 is a surface-exposed protein. Symbols: Ml, molecular weight 
marker. Arrow indicates the position of the main recombinant protein product (A). These 
experiments confirm that 287 is a surface-exposed protein and that it is a useful immunogen. 
The hydrophilicity plots, antigenic index, and amphipatic regions of ORF 287 are provided in 
Figure 17. The AMPHI program is used to predict pu*a:;ve T-cell epitopes (Gao et al 1989, J. 
Immunol 143:3007; Roberts et al. 1996, AIDS Res Human Retroviruses 12:593; Quakyi et al. 
1992, Scand J Immunol Suppl 1 1 :9). The nucleic acid sequence of ORF 287 and the ammo 
acid sequence encoded thereby is provided in Example 1. 

EXAMPLE 10 
Expression of ORF 406 
The primer described in Table 1 for ORF 406 was used to locate and clone ORF 406. 
The predicted gene 406 was cloned in pET vector and expressed in E. coll. The product of 
protein purification was analyzed by SDS-PAGE. In panel A) is shown the analysis of 406- 
His fusion protein purification. Mice were immunized with the purified 406-His and sera were 
used for Western blot analysis (panel B), FACS analysis (panel C), bactericidal assay (panel 
D), and ELISA assay (panel E). Results show that 406 is a surface-exposed protein. Symbols: 
Ml , molecular weight marker; TP, N. meningitidis total protein extract; OMV, N. 
meningitidis outer membrane vescicle preparation. Arrows indicate the position of the main 
recombinant protein product (A) and the N. meningitidis immunoreactive band (B). These 
experiments confirm that 406 is a surface-exposed protein and that it is a useful immunogen. 
The hydrophilicity plots, antigenic index, and amphipatic regions of ORF 406 are provided in 
Figure 18. The AMPHI program is used to predict putative T-cell epitopes (Gao et al 1989, J. 
Immunol 143:3007; Roberts et al. 1996, AIDS Res Human Retroviruses 12:593; Quakyi et al. 
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1992, Scand J Immunol SuppI 1 1 :9). The nucleic acid sequence of ORF 406 and the amino 
acid sequence encoded thereby is provided in Example 1 . 

EXAMPLE 1 1 

Table 2 lists several Neisseria strains which were used to assess the conservation of 



the sequence of ORF 225 among different strains. 
Table 2 



225 gene variability: List of used Neisseria strains 


Identification Strains 


Source / reference 


number 






Group B 




zoOl 225 


NG6/88 


R. Moxon / Seiler et al., 1 996 


zo02 225 


BZ198 


R. Moxon / Seiler et al., 1996 


zo03 225 


NG3/88 


R. Moxon / Seller et al, 1996 


zo04 225 


297-0 


R. Moxon / Seiler et al, 1996 


zo05_225 


1000 


R. Moxon / Seiler et al, 1996 


zo06 225 


BZ147 


R. Moxon / Seiler et al, 1996 


zo07 225 


BZ169 


R. Moxon / Seiler et al., 1996 


zo08_225 


528 


R. Moxon / Seiler et al., 1 996 


zo09 225 


NGP165 


R. Moxon / Seiler et al., 1996 


zolO 225 


BZ133 


T? Mnvnn / ^pilpr Pt nl 1 

rv. iviuAwii / otiiti t-t en., iyy\j 


zoll 225 


NGE31 


R. Moxon / Seiler et al, 1996 


zol2_225 


NGF26 


R. Moxon / Seiler et al, 1996 


zol3 225 


NGE28 


R. Moxon / Seiler et al, 1996 


zol4 225 


NGH38 


R. Moxon / Seiler et al, 1996 


zol5 225 


SWZ107 


R. Moxon / Seiler et al, 1996 


zol6_225 


NGH15 


R. Moxon / Seiler et al, 1 996 


zol7_225 


NGH36 


R. Moxon / Seiler et al, 1996 


zol8_225 


BZ232 


R. Moxon / Seiler et al, 1996 


zol9 225 


BZ83 


R. Moxon / Seiler et al, 1996 


zo20 225 


44/76 


R. Moxon / Seiler et al, 1996 


zo21 225 


MC58 


R. Moxon 


zo96 225 


2996 


Our collection 


Group A 




zo22 225 


205900 


R. Moxon 


zo23 225 


F6124 


R. Moxon 


z2491 


Z2491 


R. Moxon / Maiden et al, 1998 


Group C 




zo24 225 


90/18311 


R. Moxon 


zo25_225 


93/4286 


R. Moxon 
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Others 

zo26_225 A22 (group W) R. Moxon / Maiden et al., 1 998 
zo27_225 E26 (group X) R. Moxon / Maiden et al, 1 998 
zo28_225 860800 (group Y) R. Moxon / Maiden el al, 1998 
zo29_225 E32 (group Z) R. Moxon / Maiden ex al, 1998 

Goaococcus 

zo32_225 Ng F62 R. Moxon / Maiden et cl, 1 998 

zo33_225 Ng SN4 R. Moxon 

fal090 FA1090 R. Moxon 



References: 

Seiler A. et al, Mol. Microbiol, 1996, 19(4):841-856. 
Maiden et al, Proc. Natl. Acad. Scu USA, 1998, 95:3140-3145. 



The amino acid sequences for each listed strain are as follows: 

>FA1090 <SEQ ID 3115> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPVNRAPARRAG 
NADELIGSAMGLNEQPVLPVNRAPARRAGNADELIGSAMGLLGIAYRYGGTSVSTGFDCS 
GFMQHIFKRAMGINLPRTSAEQARMGAPVARSELQPGDMVFFRTLGGSRISKVGLYIGNN 
RFIHAPRTGKNIEITSLSHKYWSGKYAFARRVKKNDPSRFLN* 

Z2491 <SEQ ID 3116> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRVPARRAGNA 
DELIGNAMGLNEQPVLPVNRAPARRAGNADELIGNAMGLLGIAYRYGGTSISTGFDCSGF 
MQHIFKRAMGINLPRTSAEQARMGTPVARSELQPGDMVFFRTLGGSRISHVGLYIGNNRF 
IHAPRTGKNIEITSLSHKYWSGKYAFARRVKKNDPSRFLN* 

ZO01_225 <SEQ ID 3117> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNSQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSISTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

ZO02_225 <SEQ ID 3118> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DEL IGN AMGLLG I AYR YGGT S V STGFDCSG FMQHI FKRAMG I N L PRT S AEQARMGT PVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

ZO03_225 <SEQ ID 3119> 

MDSFFKPAVWAVLWLMFAVRLALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAt'ARRAGNA 
DELIGNAMGLLGIAYRYGGTSVSTGFDCSG FMQHI FKRAMG INLPRTSAEQARMGT PVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN * 
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ZO04_225 <SEQ ID 3120> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQI LRQFAEDEQPVLPINRAPARRAG 
NADELI GSAMG LNEQ PVLPVNRV PARRAGNADEL I GNAMG LNEQPVL PVN RAP ARRAGN A 
DELIGNAMGLLGIAYRYGGTSVSTGFDCSGFMQHI FKRAMGINLPRTSAEQARMGTPVAR 
SELQPG DM V FFRT LGG SR I S KVGLY I GNN RF I K APRTGKN I E IT S LS HKYW SGKY A FARR 
VKKNDPSRFLN* 

ZG05_225 <SEQ ID 3121> 

MDSFFKPAVWAVLWLM FAVRPALADELTNLLSSREQI LRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGSAMGLNEQPVLPVNRAPARRAGNA 
DEL I GNAMGLLG I AYR YGGTS I STGFDCSGFMQH I FKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIKAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN * 

ZO06_225 <SEQ ID 3122> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQI LRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQFVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSVSTGFDCSGFMQHI FKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

ZO07_225 <SEQ ID 3123> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQI LRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMG LNEQ PVLPVNRV PARRAGNADEL IGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLG I AYR YGGTSVSTGFDCSGFMQH I FKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

ZO08_225 <SEQ ID 3I24> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQI LRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGSAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSISTGFDCSGFMQHI FKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

ZO09_225 <SEQ ID 3125> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQI LRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGI AYR YGGTS I STGFDCSGFMQHI FKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

ZO10_225 <SEQ ID 3126> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQI LRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLG I AYR YGGTSVSTGFDCSGFMQH I FKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN*" 

Z011_225 <SEQ ID 3127> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELI GSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLNEQPVLPVNRAPARRAGNADELIGNAMGLLGIAYRYGGTSVSTGFDCSGF 
MQHIFKRAMGINLPRTSAEQARMGTPVARSELQPGDMVFFRTLGGSRISHVGLYIGNNRF 
IHAPRTGKNIEITSLSHKYWSGKYAFARRVKKNDPSRFLN* 

Z012_225 <SEQ ID 3128> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQI LRQFAEDEQPVLPINRAPARRAG 
NADELI GSAMG LNEQPVL PVNRVPARRAGNADELIGNAMG LNEQPVL PVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSISTGFDCSGFMQHI FKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 



Z013 225 <SEQ ID 3I29> 
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MDSF FKPAVWAV LW LM FA V R P A L ADE LT N L L S S REQ I LRQ FAX DEQ P V LPVN RA P ARRAG 
NADELIGSAMGLNEQPVLPVNRVFARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSVSTGFDCSGFIQHIFKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN * 

Z014_225 <SEQ ID 3130> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSVSTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGFNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

Z015_225 <SEQ ID 3131> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLLGIAYRYGGTSVSTGFDCS 
GFMQHIFKRAMGINLPRTSAEQARMGTPVARSELQPGDIWFFRTLGGSRISHVGLYIGNN 
RFIHAPRTGKNIEITSLSHKYVJSGKYAFARRVKKNDPSRFLN* 

Z016_225 <SEQ ID 3132> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSRECILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSVSTGFDCSGFMQHIFKRAMGJ" .TRTSASCARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEI rSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

2017_225 <SEQ ID 3133> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFASDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSVSTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

Z018_225 <SEQ ID 3134> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSVSTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

Z019_225 <SEQ ID 313S> 

MDSFFKPAWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSVSTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

ZO20_225 <SEQ ID 3136> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPINRAPARRAGNADELIGSAMGLNEQPVLPVNRVPARRAGNA 
DELIGNAMGLNEQPVLPVNRAPARRAGNADELIGNAMGLLGIAYRYGGTSVSTGFDCSGF 
MQHIFKRAMGINLPRTSAEQARMGTPVARSELQPGDMVFFRTLGGSRISHVGLYIGNNRF 
I HAPRTGKN IE IT SLSHKYWSGKYAFARRVKKN DPSRFLN * 

Z021_225 <SEQ ID 3137> 

MDS FFKPAVWAVLWLMFAVRPALADELTNLLS S REQI LRQFAEDEQPVLP INRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSVSTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

Z022_225 <SEQ ID 3I38> 

MDS FFKPAVWAVLWLMFAVRPALADELTNLLSS REQI LRQFAEDEQPVLP INRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSISTGFDC3GFMQHIFKRAMGINLPRT c nEQARMGTPVAR 
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SELQPG DM V F FRT LGG S R I S H VG L V I GNNR F I HA P R T G KN I E I T S L S H K Y W S G K Y A F ARR 
VKKNDPSRFLN* 

Z023_225 <SEQ ID 3139> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPAKRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIG N AMG LLGIAYRYGGTS I STGFDC3G FMQH IF K RAM GINLPRTSAE Q ARMGT PVAR 
SELQPG DM V F FRT LGG S R I S H V G L V I GNNR F I HA P R T G KN I E I T S L S H K Y W S G K Y A F ARR 
VKKNDPSRFLN * 

Z024_225 <SEQ ID 3140> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSISTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN * 

Z025_225 <SEQ ID 314I> 

MDSFFKPAVWAVLWLM FAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTS I STGFDCSGFMQH I FKRAMGINLPRTSAEQARMGT PVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN * 

2026^225 <SEQ ID 3142> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTS I STGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRI SHVGLY IGNNRFIHAPRTGKNIEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

Z027_225 <SEQ ID 3143> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DEL IGNAMGLLGIAYRYGGTSVSTGFDCSGFMQHI FKRAMGINLPRTSAEQARMGT PVAR 
SELQPGDMVFFRTLGGSRI SHVGLY I GNNRFIHAPRTGKN IE I TSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 

Z028_225 <SEQ ID 3144> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSVSTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRI SHVGLYIGNNRFIHAPRTGKN IE ITS LSHKYWSGKYAFARR 
VKKNDPSRFLN * 

2029^225 <SEQ ID 3145> 

MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEQPVLPVNRVPARRAGNADELIGNAMGLNEQPVLPVNRAPARRAGNA 
DELIGNAMGLLGIAYRYGGTSVSTGFDCSGFMQHIFKRAMGINLPRTSAEQARMGTPVAR 
SELQPGDMVFFRTLGGSRI SHVGLYIGNNRFIHAPRTGKN IE ITS LSHKYWSGKYAFARR 
VKKNDPSRFLN* 

Z032_225 <SEQ ID 3146> 

MDS FFKPAVWAVLWLMFAVRPALADELTNLLSSREQILRQFAE DEQPVLPVNRAPARRAG 
NADELIGSAMGLNEQPVLPVNRAPARRAGNADELIGSAMGLLGIAYRYGGTSVSTGFDCS 
GFMQHIFKRAMGINLPRTSAEQARMGAPVARSELQPGDMVFFRTLGGSRI SHVGLY IGNN 
RFIHAPRTGKNIEITSLSHKYWSGKYAFARRVKKNDPSRFLN+ 

Z033_225 <SEQ ID 3147> 

MDSFFK PAVWAVLWLMFAVRSALADELTNLLSSREQILRQFAEDEQPVLPVNRAPARRAG 
NADELIGSAMGLNEQPVLPVNRAPARRAGNADELIGSAMGLLGIAYRYGGTSVSTGFDCS 
GFMQHIFKRAMGINLPRTSAEQARHGAPVARSELQPGDMVFFRTLGGSRISHVGLYIGNN 
RFIHAPRTGKNIEITSLSHKYWSGKYAFARRIKKNDPSRFLN* 

ZQ96 225 <SEQ ID 3I46> 
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MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQI LRQFAEDEQPVLPINRAPARRAG 
NADELIGSAMGLNEOPVLPVNRVPARRAGNADELIGNAf-lGLNEOPVLPVNRAPARRAGNA 
D E L I G N AMG L LG I A Y R Y GGT S I S T G F DC S G FM QH I FKRAMG INLPRTS AEQARMG TPVAR 
SELQPGDMVFFRTLGGSRISHVGLYIGNNRFIHAPRTGKN IEITSLSHKYWSGKYAFARR 
VKKNDPSRFLN* 



Figure 19 shows the results of aligning the sequences of each of these strains. Dark 
shading indicates regions of homology, and gray shading indicates the conservation of ammo 
acids with similar characteristics. As is readily discernible, there is significant conservation 
among the various strains of ORF 225, further confirming its utility as an antigen for both 
vaccines and diagnostics. 



Table 3 lists several Neisseria strains which were used to assess the conservation of 
the sequence of ORF 235 among different strains. 
Table 3 

235 gene variability: List of used Neisseria strains 



EXAMPLE 12 



Identification Strains 
number 

Group B 



Reference 



gnmzqOl NG6/88 

gnmzq02 BZ198 

gnmzq03 NG3/88 

gnmzq04 1000 



Seiler e/a/. s 1996 
Seiler et aL, 1996 
Seiler et aL, 1996 
Seiler ef aL, 1996 
Seiler et aL, 1996 
Seiler et aL, 1996 
Seiler et aL, 1996 
Seiler et aL, 1996 
Seiler et aL, 1996 
Seiler et aL, 1996 
Seiler et aL, 1996 
Seiler et aL, 1996 
Seiler et aL, 1996 
Seiler et aL, 1996 
Seiler et aL, 1996 
Seiler et aL, 1996 
Seiler et aL, 1996 
Virji etaL, 1992 



gnmzqOS 1000 



gnmzq07 BZ169 
gnmzq08 528 



gnmzq09 NGP165 

gnmzqlO BZ133 

gnmzqll NGE31 

gnmzql3 NGE28 

gnmzq!4 NGH38 

gnmzql5 SWZ107 

gnmzql6 NGH15 

gnmzql7 NGH36 

gnmzq!8 BZ232 

gnmzql9 BZ83 

gnmzq21 MC58 



Group A 

gnmzq22 205900 



Our collection 
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gnmzq23 F6124 
z2491 Z2491 



Our collection 
Maiden et ai, 1998 



Group C 

gnmzq24 90/18311 
gninzq25 93/4286 



Our collection 
Our collection 



Others 

gnmzq26 A22 (group W) Maiden et al, 1998 
gnmzq27 E26 (group X) Maiden et al, 1998 
gninzq28 860800 (group Y) Maiden et al, 1998 
gnmzq29 E32 (group Z) Maiden et al, 1998 
gnrnzq31 NAactamica Our collection 

Gonococcus 

gnmzq32 Ng F62 Maiden et al, 1998 

gnmzq33 Ng SN4 Our collection 

fa!090 FA1090 Dempsey et al 1991 



References: 

Seiler A. etal, Mol. Microbiol, 1996, 19(4):841-856. 

Maiden R. etal, Proc. Natl. Acad. Sci. USA, 1998, 95:3140-3145. 

Virji M. et al, Mol. Microbiol., 1992, 6:1271-1279 

Dempsey J.F. etal, J. Bacterid., 1991, 173:5476-5486 



The amino acid sequences for each listed strain are as follows: 



FA1090 <SEQ ID 3149> 

MKPLILGLAAVLALSACQVRKAPDLDYTSFKESKPASILVVPPLNESPDVNGTWGMLAST 
AAPISEAGYYVFPAAWEETFKENGLTNAADIHAVRPEKLHQI FGNDAVLY ITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVGAWNQIANSL? 
DRGYQVSKTAAYNLLSPYSRNGILKGPRFVEEQPK* 

GNMZQ01 <SEQ ID 3150> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILWPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKENGLTNAADIHAVRPEKLHQI FGNDAVLY ITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAWNQIANNLT 
DRGYQVSKTAAYMLLSPYSHNGILKGPRFVEEQPK* 

GNMZQ02 <SEQ ID 3151> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILWPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQI FGNDAVLY ITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAWNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 

GNMZQ03 <SEQ ID 3152> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILWPPLHESPDVNGTWGVLAST 



WO 99/57280 



PCT/US99/09346 



124 



AAPLSEAGYYV FPAAVVEE T FKQNGLT NAA D I H AV R P E K LH Q I FG N D A V L Y I T VT E Y G T S 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAVVNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 

GNMZQ04 <SEQ ID 3153> 

MKPL1LGLAAVLALSACQVQKAPDFDYTSFKESKPASILVVPPLNESPDVNGTWGVLAST 
AAPLSEAG Y Y V F PAAW EE" FKQNGLT NAAD I H AV RPEKLHQIFGN D A V LYITVTEYGTS 
YQILDSVTTVS AKARL V D S RN GKE LW S G S AS I REG S N N 5 N SG L LG AL V SAWN QIANSLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 

GNMZQ05 <SEQ ID 3154> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILVVPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAVVEETFKENGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAWNQIANNLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 

GNMZQ07 <SEQ ID 3155> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILVVPPLNESPDVNGTWGVLAST 
AAPLSEAG YYVFPAAWEETFKQNGLTNAAD I HAVRPEKLHQIFGNDAVLY I TVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAWNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 

GNMZQ08 <SEQ ID 3156> 

MKPLILGLAAVLALSACQVQKAPDFDYTS FKESKPAS ILVVPPLJESPDVNGTWGVLAST 
AAPLSEAG YYVFPAAWEETFKENGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSAS IREGSNNSNSGLLGALVSAWNQIANNLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 

GNMZQ09 <SEQ ID 3157> 

MKPLILGLAAALVLSACQVQKAPDFDYTSFKESKPASILWPPLNESPDVNGTWGMLAST 
AEPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVQPEKLHQI FGNDAVLYITITEYGTS 
YQILDSVTTVSARARLVDSRNGKVLWSGSASIREGSNNSNSGLLGALVSAVVNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 

GKM2Q10 <SEQ ID 3158> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILWPPLNESPDVNGTWGVLAST 
AAPLSEAG YYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQI FGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAVVNQIANSLT 
DRGYQVS KT AAYNLLS PYSHNGILKG PRFVEEQPK * 

GNMZQ11 <SEQ ID 3159> 

MKPLI LGLAAVLALSACQVQKAPDFDYTS FKESKPAS I LVVP PLNE S P DVNGTWGVLAST 
AAPLSEAGY YVF PAAWEET FKQNGLTN AAD I HAVRPEKLKQI FGN DAVLY IT VTEYGT S 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAVVNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK+ 

GNMZQ13 <SEQ ID 3160> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILWPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQI FGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAVVNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 

GNMZQ1 4 <SEQ ID 3161> 

MKPLILGLAAVLALSACQVQKAPDFDYTS FKESKPAS I LWPPLMESPDVNGTWGV LAST 
AAPLSEAGYYVFPAAVVEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVGAWNQIANSLT 
DRGYQVSKTAAYNLLS PYSHNGILKG PR FVEEQPK* 

GNMZ.Q15 <SEQ ID 3162> 

MKPLI LGLAAVLALSACQVQKAPDFDYTS FKESKPAS I LWPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQI FGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAWNQIAlNSLT 
DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPK* 



GNMZQ16 <SEQ ID 3163> 
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MK PL I LG L AAV L A L S AC Q V QKAPD FDYTSFKESKPASI L W P ?LN ESPDVNGTWGV LAST 
AAPLSEAGYYVFPAAVVEETFKQNGLTNAADIHAVRPEKLHQI FGNDAVLYITVTEYGTS 
YQI LDS VTTVSAKARLVDSRNGKELWSGSAS IREGSNNSNSGLLGALVSAWNQIANSLT 
DRGYQVSKTAAYNLLS PYSHNGILKGPRFVEEQPK* 

GNMZQ17 <SEQ ID 3i64> 

MKPLI LG L AAV LA L S rt C Q VQKA P D F D Y T S FKE S K P A Z I L V V P P LN E S ? D V N G T WG V LA S T 
AAPLSEAGYYVFPAAVVEETFKQNGLTNAADIHAVRPEKLHQI FGNDAVLYITVTEYGTS 
YQI LDS VTTVSAKARLVDSRNGKELWSGSAS IREGSNNSNSGLLGALVSAWNQIANSLT 
DRGYQVSKTAAYNLLS P YSKNGI LKG PRFVEEQPK * 

GNMZQ18 <SEQ ID 3165> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILVVPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQI FGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVGAVVNQIANSLT 
DRGYQVSKTAAYNLLS PYSHNGILKGPRFVEEQPK* 

GNMZQ19 <SEQ ID 3166> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILWPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQI FGNDAVLYITVTEYGTS 
YQI LDSVTTVSAKARLVDSRNGKELWSGSAS IREGSNNSNSGLLGALVSAWNQIANSLT 
DRGYQVSKTAAYNLLS PYSHNGILKGPRFVEEQPK* 

GNMZQ21 <SEQ ID 3166> 

MKPLILGIjAAVLALSACQVQKAPDFDYTSFKESKPASILWPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQI FGNDAVLYITVTEYGTS 
YQI LDSVTTVSAKARLVDSRNGKELWSGSAS IREGSNNSNSGLLGALVSAWNQIANSLT 
DRGYQVSKTAAYNLLS PYSHNGILKGPRFVEEQPK* 

GNMZQ22 <SEQ ID 3167> 

MKPLILGLAAVLALSACQVQKAFDFDYTSFKESKPASILVVPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQI FGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGS AS IREGSNNSNSGLLGALVSAWNQIANSLT 
DRGYQVSKTAAYNLLS PYSHNGILKGPRFVEEQPK* 

GNMZQ23 <SEQ ID 3168> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILVVPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQI FGNDAVLYITVTEYGTS 
YQI LDS VT TVS AKARLVDSRNGKELWSGSAS IREGSNNSNSGLLGALVSAWNQIANSLT 
DRGYQVSKTAAYNLLS PYSHNGILKGPRFVEEQPK* 

GNMZQ24 <SEQ ID 3169> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILWPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQI FGNDAVLYITVTEYGTS 
YQI LDS VTTVSAKARLVDSRNGKELWSGSAS IREGSNNSNSGLLGALVSAWNQIANSLT 
DRGYQVSKTAAYNLLS PYSHNGILKGPRFVEEQPK* 

GNMZQ25 <SEQ ID 3170> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILWPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQI FGNDAVLY I TVTEYGTS 
YQI LDS VTTVSAKARLVDSRNGKELWSGSAS IREGSNNSNSGLLGALVSAWNQIANSLT 
DRGYQVSKTAAYNLLS PYSHNGILKGPRFVEEQPK* 

GNMZQ26 <SEQ ID 3171> 

MK PL I LGLAAVLALS ACQVQKAP D FD YT S FKE SK PAS I LVVPPLNES PDVNGTWGMLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQI FGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVGAWNQIANSLT 
DRGYQVSKTAAYNLLS PYSHNGILKGPRFVEEQPK* 

GNMZQ27 <SEQ ID 3172> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILWPPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAAD I HAVRPEKLHQI FGNDAVLY I TVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGS AS IREGSNNSNSGLLGALVSAWNQIANSLT 
DRGYQVSKTAAYNLLS PYSHNG I LKG PRFVEEQPK* 
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GNMZQ28 <SEQ ID 3173> 

MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILVVFPLNESPDVNGTWGVLAST 
AAPLSEAGYYVFPAAVVEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAVVNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGI LKGPRFVEEQPK* 

GNMZQ2 9 <SEQ ID 3174 > 

MKPLILGLAAVLALSACQVQKAPDFDYTS FKESKPAS I LVVPPLNES PDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQIFGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAVVNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGI LKGPRFVEEQPK* 

GNMZQ31 <SEQ ID 3175> 

MKPLILGLAAVLALSACQVQKAPDFDYTAFKESKPASILWPPLNES PDVNGTWGMLAST 
AEPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHOIFGNDAVLYITITEYGTS 
YQILDSVTTVSARARLVDSRNGKVLWSGSAS IREGSNNSNSGLLGALVGAVVNQIANSLT 
DRGYQVSKAAAYDLLSPYSHNG I LKGPRFVEEQPK* 

GNMZQ32 <SEQ ID 3176> 

MKPLILGLAAVLALSACQVRKAPDLDYTSFKESKPASILVVPt'LNES PDVNGTWGMLAST 
AAPISEAGYYVFPAAWEETFKENGLTNAADIHAVRPEKLHQ I FGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSAS IREGSNNSNSGLLGALVGAWNQIANSLT 
DRGYQVSKTAAYNLLSPYSRNG I LKGPRFVEEQPK* 

GNMZQ3 3 <SEQ ID 3177> 

MKPL I LGLAAVLALSACQVRKAPDLDYTSFKESKPAS I LWPPLNES PDVNGTWGMLAST 
AAPI SEAGYYVFPAAWEETFKENGLTNAADIHAVRPEKLHQI FGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSAS IREGSNNSNSGLLGALVGAVVNQIANSLT 
DRGYQVSKTAAYNLLSPYSRNG I LKGPRFVEEQPK* 

Z2491 <SEQ ID 3178> 

MKPLILGLAAVLALSACQVQKAPDFDYTS FKESKPASILWPPLNES PDVNGTWGVLAST 
AAPLSEAGYYVFPAAWEETFKQNGLTNAADIHAVRPEKLHQI FGNDAVLYITVTEYGTS 
YQILDSVTTVSAKARLVDSRNGKELWSGSAS IREGSNNSNSGLLGALVSAVVNQIANSLT 
DRGYQVSKTAAYNLLSPYSHNGI LKGPRFVEEQPK* 



Figure 20 shows the results of aligning the sequences of each of these strains. Dark 
shading indicates regions of homology, and gray shading indicates the conservation of amino 
acids with similar characteristics. As is readily discernible, there is significant conservation 
among the various strains of ORF 235, farther confirming its utility as an antigen for both 
vaccines and diagnostics. 



EXAMPLE 13 



Table 4 lists several Neisseria strains which were used to assess the conservation of 
the sequence of ORF 287 among different strains. 
Table 4 



287 gene variability: List of used Neisseria strains 

Identification Strains Reference 
number 
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Group B 



287_2 
237_9 
2S7J4 
287 21 



BZ198 
NGP165 
NGH38 
MC58 



Seiler et aL, 1996 
Seller etal., 1996 
Seiler et aL, 1996 
Virji etal., 1992 



z2491 



Group A 

Z2491 



Maiden et al. 9 1998 



Gonococcus 

fa 1 090 FA 1 090 Dempsey et al 1 99 1 



References: 

Seiler A. etai, Mol. Microbiol, 1996, 19(4):841-856. 

Maiden R. et ai, Proc. Natl. Acad. Sci. USA, 1998, 95:3140-3145. 

Virji M. et a/., Mol. Microbiol, 1992, 6:1271-1279 

Dempsey IF. et al., J. Bacterid, 1991, 173:5476-5486 



The amino acid sequences for each listed strain are as follows: 



287_14 <SEQ ID 3179> 

MFKRSVIAMACI FALSACGGGGGGSPDVKSADTLSKPAAPWSEKETEAKEDAPQAGSQG 
QGAPSAQGGQDMAAV3EENTGNGGAAAT DKPKNEDEGAQNDMPQNAADTDSLT PNHT PAS 
NMPAGNMENQAPDAGESEQPANQPDMANTADGMQGDDPSAGGENAGNTAAOGTNQAENNQ 
TAG SQN PAS STN P S ATN SGGDFGRTN VGN S W I DGP SQN IT LTKCKGDSC SGNN FLDEEV 
QLKSEFEKLSDADKISNYKKDGKNDGKNDKFVGLVADSVQMKGINQYI I FYKPKPTSFAR 
FRRSARSRRSLPAEMPLIPVNQADTLIVDGEAVSLTGHSGNIFAPEGNYRYLTYGAEKLP 
GGSYALRVQGEPSKGEMLAGTAVYNGEVLHFHTENGRPSPSRGRFAAKVDFGSKSVDGII 
DSGDGLHMGTQKFKAAIDGNGFKGTWTENGGGDVSGKFYGPAGEEVAGKYSYRPTDAEKG 
GFGVFAGKKEQD* 

287_2 <SEQ ID 3180> 

MFKRSVIAMACIFALSACGGGGGGSPDVKSADTLSKPAAPWSEKETEAKEDAPQAGSQG 
QGAPSAQGGQDMAAVSEENTGNGGAAATDKPKNEDEGAQNDMPQNAADTDSLTPNHTPAS 
NMPAGNMENQAPDAGESEQPANQPDMANTADGMQGDDPSAGGENAGNTAAQGTNQAENNQ 
TAG SQN PASSTNPSATNSGGDFGRTNVGNSWIDGPSQNITLTHCKGDSCSGNNFLDEEV 
QLKSEFEKLSDADKISNYKKDGKNDGKN DKFVGLVADSVQMKGIMQYI I FYKPKPTSFAR 
FRRSARSRRSLPAEMPLIPVNQADTLIVDGEAVSLTGHSGNIFAPEGNYRYLTYGAEKLP 
GGSYALRVQGEPSKGEMLAGTAVYNGEVLHFHTENGRPSPSRGRFAAKVDFGSKSVDGII 
DSGDGLHMGTQKFKAAIDGNGFKGTWTENGGGDVSGKFYGPAGEEVAGKYSYRPTDAEKG 
GFGVFAGKKEQD* 

287_21. <SEQ ID 3181> 

MFKRSVIAMACIFALSACGGGGGGSPDVKSADTLSKPAAPVVSEKETEAKEDAPQAGSQG 
QGAPSAQGSQDMAAVSEENTGNGGAVTADNPKNEDEVAQNDMPQNAAGTDSSTPNHTPDP 
NMLAGNMENQATDAGESSQPANQPDMANAADGMQGDDPSAGGQNAGNTAAQGANQAGNNQ 
AAGSS DPI PASNPAPANGGSNFGRVDLANGVL I DGPSQNITLTHCKGDSCSGNN FLDEEV 
QLKSEFEKLSDADKISNYKKDGKNDKFVGLVADSVQMKGINQYI I FYKPKPTSFAR FRRS 
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ARSRRSLPAEMPLIPVNQADTLIVDGEAVSLTGHSGNIFAPEGNYRYLTYGAEKLPGGSY 
ALRVQGEPAKGEMLAGAAVYNGEVLHFHTENGRPYPTRGRFAAKVDFGSKSVDGIIDSGD 
DLKMGTQKFKAAIDGNGFKGTWTENGSGDVSGKFYGPAGEEVAGKYSYRPTDAEKGGFGV 
FAGKKEQD* 

287_9 <SEQ ID 3I82> 

MFKRSVIAMACIVALSACGGGGGGSPDVKSADTLSKPAAPVVTEDVGEEVLPKEKKDEEA 
VSGAPQADTQDATAGKGGQDMAAVSAENTGNGGAATTDNPENKDEGPQNDMPQNAADTDS 
STPNHTPAPNMPTRDMGNQAPDAGESAQPANQPDMANAADGMQGDDPSAGENAGNTADQA 
ANQAENNQVGGSQNPASSTNPNATNGGSDFGRINVANGIKLDSGSENVTLTHCKDKVCDR 
DFLDEEAPPKSEFEKLSDEEKINKYKKDEQRENFVGLVADRVEKNGTNKYVIIYKDKSAS 
SSSARFRRSARSRRSLPAEMPLIPVNQADTLIVDGEAVSLTGHSGNIFAPEGNYRYLTYG 
AEKLSGGSYALSVQGEPAKGEMLAGTAVYNGEVLHFHMENGRPS PSGGRFAAKVDFGSKS 
VDGIIDSGDDLHMGTQKFKAVIDGNGFKGTWTENGGGDVSGRFYGPAGEEVAGKYSYRPT 
DAEKGGFGV FAGKKEQD * 

FA1090 <SEQ ID 3183> 

MFKRSVIAMACIFPLSACGGGGGGSPDVKSADTPSKPAAPVVAENAGEGVLPKEKKDEEA 
AGGAPQADTQDATAGEGSQDMAAVSAENTGNGGAATTDNPKNEDAGAQNDMPQNAAESAN 
QTGNNQPAGSSDSAPASNPAPANGGSDFGRTNVGNSWIDGPSQNITLTHCKGDSCNGDN 
LLDEEAPSKSEFEKLSDEEKIKRYKKDEQRENFVGLVrtDRVKKDGTNKYII FYTDKPPTR 
SARSRRSLPAEIPLIPVNQADTLIVDGEAVSLTGHSGNIFAPEGMYRYLTYGAEKLPGGS 

YALRVQGEPAKGEMLVGTAVYNGEVLHFHMENGRPYPSGGRF£ ^TGSKSVDGI IDSG 

DDLHMGTQKFKAAIDGNGFKGTWTENGGGDVSGRFYGPAGEEVAGKYSYRPTDAEKGGFG 
VFAGKKDRD* 

Z2491 <SEQ ID 3184> 

MFKRSVIAMACIFALSACGGGGGGSPDVKSADTLSKPAAPWSEKETEAKEDAPQAGSQG 
QGAPSAQGSQDMAAVSEENTGNGGAVTADNPKNEDEVAQN DMPQNAAGTDSSTPNHTPDP 
NMLAGNMENQATDAGESSQPANQPDMANAADGMQGDDPSAGGQNAGNTAAQGANQAGNNQ 
AAGSSDPIPASNPAPANGGSNFGRVDLANGVLIDGPSQNITLTHCKGDSCSGNNFLDEEV 
QLK SE FEKL S DADK I Y KKDGKN DK FV G LVAD S VQMKG I NQY 1 1 FYK PKPT S FAR FRR S 
ARSRRSLPAEMPLIPVNQADTLIVDGEAVSLTGHSGNIFAPEGNYRYLTYGAEKLPGGSY 
ALRVQGE PAKGEMLAGAAVYNGE VLH FHTENGRPYPTRGR FAAKVD FGSKSVDG IIDSGD 
DLHMGTQKFKAAIDGNGFKGTWTENGSGDVSGKFYGPAGEEVAGKYSYRPTDAEKGGFGV 
FAGKKEQD" 



Figure 21 shows the results of aligning the sequences of each of these strains. Dark 
shading indicates regions of homology, and gray shading indicates the conservation of amino 
acids with similar characteristics. As is readily discernible, there is significant conservation 
among the various strains of ORF 287, further confirming its utility as an antigen for both 
vaccines and diagnostics. 



EXAMPLE 14 



Table 5 lists several Neisseria strains which were used to assess the conservation of 
the sequence of ORF 519 among different strains. 
Table 5 



519 gene variability: List of used Neisseria strains 
Identification Strains Source / reference 
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number 

Group 

zv01_519 

zv02_519 

zv03_519ass 

zv04_519 

zv05_519 

zv06_519ass 

zv07_519 

zvll_519 

zv!2_519 

zv!8_519 

zvl9_519 

zv20_519ass 

zv21_519ass 

zv96 519 



B 

NG6/88 

BZ198 

NG3/88 

297-0 

1000 

BZ147 

BZ169 

NGE31 

NGF26 

BZ232 

BZ83 

44/76 

MC58 

2996 



Moxon / Seiler et al 
Moxon / Seiler et al 
Moxon / Seiler et al 
Moxon / Seiler et al 
Moxon / Seiler et al 
R. Moxon / Seiler et al 
R. Moxon / Seiler et al 
R. Moxon / Seiler et al, 
R. Moxon / Seiler et al, 
R. Moxon / Seiler et al, 
R. Moxon / Seiler et al, 
R. Moxon / Seiler et al, 
R. Moxon 
Our collection 



R. 
R. 
R. 
R. 

R. 



1996 
1996 
1996 
1996 
1996 
1996 
1996 
1996 
1996 
1996 
1996 
1996 



Group A 

zv22_519ass 205900 
z2491 519 Z2491 



R. Moxon 
R. Moxon / Maiden et al, 1998 



Others 

zv26_519 A22 (group W) R. Moxon / Maiden et al, 1 998 
zv27__519 E26 (group X) R. Moxon / Maiden et al, 1998 
zv28_519 860800 (group Y) R. Moxon / Maiden et al, 1998 
z^29_5 1 9ass E32 (group Z) R. Moxon / Maiden ei al., 1 998 



Gonococcus 

zv32_519 Ng F62 

fa!090 519 FA1090 



R. Moxon / Maiden et al., 1998 
R. Moxon 



References: 

Seiler A. et al, Mol. Microbiol, 1996, 19(4):841-856. 
Maiden et al, Proc, Natl. Acad. Sci. USA, 1998, 95:3140-3145. 



The amino acid sequences for each listed strain are as follows: 



FA1090_519 <SEQ ID 3185> 

ME FFI I LLAAVAVFGFKS FWI PQQEVHVVERLGRFHRALTAGLN I LI PF1 DRVAYRHSL 
KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSWYIMAITQLAQTTLRSVIG 
RMELDKT FEERDE INST WSALDEAAGAWGVKVLRYE I KDLVPPQE I LRAMQAQI TAERE 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
LVAEANAEAIRQIAAALQTQGGADAVNLKIAEOYVAAFNNLAKESNTLIMPANVADIGSL 
I SAGMKI IDSSKTAK* 
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Z249I_519 <SEQ ID 3186> 

MEFFI I LLAAWVFGFKSFWI PQQEVHVVERLGRFHRALTAGLN I LI PFI DRVAYRKSL 
KEI PLDVPSQVCITRDNTQLTVDGI I YFQVTDPKLASYGSSNY IMAITQLAQTTLRSVIG 
RMELDKTFEERDEINSTWSALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
LVAEANAEAI RQI AAA LQT QG G AD A VN LKIAE QYVAA FNNLAKESNTLI MPANVA D I G 5 L 
I S AGMK IIDSSKTAK* 

ZV01_519 <SEQ ID 3187> 

MEFFI I LLVAVAVFGFKS FVVI PQQEVH WERLGRFHRALTAGLN I LI PFI DRVAYRKSL 
KEI PLDVPSQVCITRDNTQLTVDGI IYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKTFEERDEINSTWAALDEAAGAWGVKVLRYEIKDLVFPQEILRSMQAQITAERE 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
I SAGMKI I DSSKTAK* 

ZV02_519 <SEQ ID 3188> 

MEFFI ILLVAVAVFGFKSFWI PQQEVHVVERLGRFHRALTAGLMILI PFIDRVAYRKSL 
KEI PLDVPSQVCITRDNTQLTVDGI IYFQVTDPKLASYGSSN i IMAITQLAQTTLRSVIG 
RMELDKTFEERDEINSTWSALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESWTLIMPANVADIGSL 
I SAGMKI I DSSKTAK* 

ZV03_519 <SEQ ID 3189> 

MEFFI ILLVAVAVFGFKSFWI PQQEVH WERLGRFHRALTAG LNILI PFIDRVAYRHSL 
KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKTFEERDEINSTWSALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAE5LR 
LVAEANAEAI RQI AAALQTQGGADAVNLKIAEOYVAAFNNLAKESNTL I MPANVAD I GSL 
I SAGMKI I DSSKTAK* 

ZV04_5I9 <SEQ ID 3L90> 

MEFFI ILLVAVAVFGFKSFWI PQQEVH WERLGRFHRALTAGLN I LI PFI DRVAYRHSL 
KEI PLDVPSQVCITRDNTQLTVDGI IYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKTFEERDEINSTWSALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
I SAGMKI I DSSKTAK* 

ZV05_519 <SEQ ID 3191> 

MEFFI I LLVAVAVFGFKSFWIPQQEVHWERLGRFHRALTAGLN I LI PFI DRVAYRHSL 
KEI PLDVPSQVCITRDNTQLTVDGI IYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKTFEERDEINSTWSALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
I SAGMKI I DSSKTAK * 

ZV06_519ASS <SEQ ID 3192> 

MEFFI I LLVAVAVFGFKSFWIPQQEVHWERLGRFHRALT AG LNILI PFI DRVAYRHSL 
KEI PLDVPSQVCITRDNTQLTVDGI I YFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKTFEERDEINSTVFSALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERK 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKI ARINRAKGEAESLR 
LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMFANVADIGSL 
I SAGMKI I DSSKTAK* 

ZV07_519 <SEQ ID 3193> 

MEFFIILLVAVAVFG FKSFVVI PQQEVHVVERLGRFHRALTAGLNI LI PFIDRVAYRHSL 
KEI PLDVPSQVCITRDNTQLTVDGI I YFQVTDPKLASYGSSNY IMAITQLAQTTLRSVIG 
RMELDKTFEERDEINSTWAALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKI ARINRAKGEAESLR 
LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNT1 IMPANVADIGSL 
I SAGMKI I DSSKTAK* 
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ZVll 519 <SEQ ID 3I94> 

MEFFIILLAAVAVFGFKSFVVIPQQEVHVVERLGRFHRALTAGLNILI PFIDRVAYRHSL 
KEIPLDVPSQVCITRDNTQLTVDGI I YFQVTDPKLASYGSSNYIMAITQLAQTTLR3VIG 
RMELDKT FEERDE INST WAALDEAAGAWGVKVLRYEIKDLVt'PQEILRSMQAQITAERE 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
LVAEANAEAIRQIAAALQTQGGADAWLKIAEQYVAAFTWLAKESNTLIMPANVADIGSL 
I SAGMKI IDSSKTAK* 

ZV12_5I9 <SEQ ID 3195> 

MEFFIILLVAVAVFGFKSFWI PQQEVHVVERLGRFHRALTAGLNILI PFIDRVAYRHSL 
KEIPLDVPSQVCITRDNTQLTVDGI I YFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKTFEERDEINSTWAALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIAJUNRAKGEAESLR 
LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
ISAGMKIIDSSKTAK* 

ZV18_519 <S£Q ID 3196> 

MEFFIILLVAVAVFGFKSFWIPQQEVHWERLGRFHRALTAGLNILIPFI DRVAYRHSL 
KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKT FEERDE IN STVVAALDEAAGAWGVKVLRYEIKDLVPPQE I LRSMQAQITAERE 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTL1MPANVADIGSL 
ISAGMKIIDSSKTAK* 

ZV19_519 <SEQ ID 3197> 

MEFFIILLVAVAVFGFKSFWI PQQEVHVVERLGRFHRALTAGLNILI PFIDRVAYRHSL 
KEIPLDVPSQVCITRDNTQLTVDGI I YFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKT FEERDEINSTVVAALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNN LAKESNTLIMPANVADIGSL 
ISAGMKIIDSSKTAK* 

ZV20_519ASS <SEQ ID 3198> 

MEFFIILLVAVAVFGFKSFWI PQQEVHVVERLGRFHRALTAGLNILI PFIDRVAYRHSL 
KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKTFEERDEINSTWAALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSM 
ISAGMKIIDSSKTAK* 

ZV21_513ASS <SEQ ID 3199> 

MEFFIILLVAVAVFG FKSFVVI PQQEVHVVERLGRFHRALTAGLNILI PFIDRVAYRHSL 
KEIPLDVPSQVCITRDNTQLTVDGI IYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKTFEERDEINSTVVAALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
LVAKANAEAIRQIAAALQTQGGADAWLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
ISAGMKIIDSSKTAK* 

ZV22_519ASS <SEQ ID 3200> 

ME FFI I LLAAWVFGFKSFW I PQQEVHWERLGRFHRALTAGLN I LI PFIDRVAYRHSL 
KEIPLDVPSQVCITRDNTQLTVDGI I YFQVTDPK LAS YGSSNYIMAITQLAQTTLRSVIG 
RMELDKT FEERDEINSTVVSALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 
KRARIAESEGRKIEQINLASGQREAKIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
ISAGMKIIDSSKTAK* 

ZV26_519 <SEQ ID 3201> 

ME F FI I LLAAWVFGFKSFWIPQQEVHWERLGRFHRALTAGLNI LI PFIDRVAYRHSL 
KEIPLDVPSQVCITRDNTQLTVDGI IYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKT FEERDE INST WAALDEAAGAWGVKVLRYE IKDLVP PQE I LRSMQAQI TAERE 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
ISAGMKIIDSSKTAK* 



ZV21 519 <3EQ ID 3202> 
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MEFFI ILLVAVAVFGFKSFWI PQQEVHWERLGRFHRALTAGLNILIPFIDRVAYRHSL 
KEI PLDVPSQVCITRDNTQLTVDGI I YFQVTDPKLASVGSSNYIMAITQLAQTTLRSVIG 
RMELDKTFEERDEINSTVVAALDEAAGAWGVKVLRYE IKDLVPPQEILRSMQAQITA£RE 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
I SAGMKI I DSSKTAK* 

ZV28_519 <SEQ ID 3203> 

MEFFI ILLAAVAVFGFKSFWI PQQEVHWERLGRFHRALTAGLNILIPFIDRVAYRHSL 
KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMA1TQLAQTTLRSVIG 
RMELDKTFEERDEINSTWAALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
LVAEANAEAIRQXAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
I5AGMKIIDSSKTAK* 

2V29_519ASS <SEQ ID 3204> 

MEFFI ILLAAVAVFGFKSFWI PQQEVHWERLGRFHRALTAGLNILIPFIDRVAYRHSL 
KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RMELDKTFEERDEINSIWSALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 
KRARIAESEGRKIEQINLASGQREPEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFinNLAKESNTLIMPANVADIGSL 
ISAGMKIIDSNKTAK* 

ZV32_519 <SEQ ID 3205> 

MEFFI ILLAAVAVFGFKSFWI PQQEVHWERLGRFHRALTAGLNIL I PFIDRVAYRHSL 
KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RME LDKT FEERDE I N S T W S AL DEAAGAWGVKVLR YE I KD LV P PQE I LRAMQAQ I TAERE 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
L V AE AN AE A I RQI AAALQTQGG A DA VN LK I AE QYV AAFNNLAKES NT L I M P AN VA D I G S L 
I S AGMK 1 1 DS S KTAK * 

ZV96_519 <SEQ ID 3206> 

HEFFIILLAAVAVFGFKSFWIPQQEVHWERLGRFHRALTAGLNILIPFIDRVAYRHSL 
KEIPLDVPSQVCITRDNTQLTVDGIIYFQVTDPKLASYGSSNYIMAITQLAQTTLRSVIG 
RME LDKT FEERDEINSTWAALDEAAGAWGVKVLRYEIKDLVPPQEILRSMQAQITAERE 
KRARIAESEGRKIEQINLASGQREAEIQQSEGEAQAAVNASNAEKIARINRAKGEAESLR 
LVAEANAEAIRQIAAALQTQGGADAVNLKIAEQYVAAFNNLAKESNTLIMPANVADIGSL 
I SAGMKI I DSS KTAK* 

Figure 22 shows the results of aligning the sequences of each of these strains. Dark 
shading indicates regions of homology, and gray shading indicates the conservation of amino 
acids with similar characteristics. As is readily discernible, there is significant conservation 
among the various strains of ORF 519, further confirming its utility as an antigen for both 
vaccines and diagnostics. 



EXAMPLE 15 



Table 6 lists several Neisseria strains which were used to assess the conservation of 
the sequence of ORE 919 among different strains. 
Table 6 



919 gene variability: List of used Neisseria strains 
Identification Strains Source / reference 
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number 








Group B 






zmOl 


NG6/8S 


R. 


Moxon / Seiler et al, 1 996 


zm02 


BZ198 


R. 


Moxon / Seiler et al, 1996 


zm03 


NG3/88 


R. 


Moxon / Seiler et al, 1 996 


zm04 


297-0 


R. 


Moxon / Seiler et al., 1996 


zm05 


1000 


R. 


Moxon / Seiler et al., 1996 


zm06 


BZ147 


R. 


Moxon / Seiler et al., 1996 


zm07 


BZ169 


R. 


Moxon / Seiler et al. , 1 996 


zm08n 528 


R. 


Moxon / Seiler et al., 1996 


zm09 


NGP165 


R. 


Moxon / Seiler et al, 1996 


zmlO 


BZ133 


R. 


Moxon / Seiler et al, 1996 


zml lasbc NGE31 


R. 


Moxon / Seiler et al, 1996 


zm!2 


NGF26 


R. 


Moxon / Seiler et al, 1996 


zml 3 


NGE28 


R. 


Moxon / Seiler et al, 1996 


zml 4 


NGH38 


R. 


Moxon / Seiler et al, 1996 


onl5 


SWZ107 


R. 


Moxon / Seiler et al., 1 996 




NGH15 


R. 


Moxon / Seiler et al., 1996 


zm!7 


NGH36 


R. 


Moxon / Seiler et al., 1996 


zml8 


BZ232 


R. 


Moxon / Seiler et al, 1996 


zml9 


BZ83 


R. 


Moxon / Seiler et al, 1996 


zm20 


44/76 


R. 


Moxon / Seiler et al., 1996 


zm21 


MC58 


R. 


Moxon 


zm96 


2996 


Our collection 




Group A 






zm22 


205900 




R. Moxon 



zm23asbc F6124 



z2491 



Z2491 



Group C 

zm24 90/18311 
zm25 93/4286 



R. Moxon 

R. Moxon / Maiden et al, 1998 



R. Moxon 
R. Moxon 



Others 

zm26 A22 (group W) R. Moxon / Maiden et al, 1998 
zm27bc E26 (group X) R. Moxon / Maiden ex al, 1998 
zm28 860800 (group Y) R. Moxon / Maiden et al, 1998 
zm29asbc E32 (group Z) R. Moxon / Maiden et al, 1998 
zm3 lasbc N. lactamica R. Moxon 



Gonococcus 

zm32asbc Ng F62 
zm33asbc Ng SN4 



R. Moxon / Maiden et al, 1998 
R. Moxon 



fal090 FA1090 



R. Moxon 
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References: 

Seiler A. etai, Mol. Microbiol., 1996, 19(4):841-856. 
Maiden et aL Proc. Natl. Acad. Sci. USA, 1998, 95:3140-3145. 



The amino acid sequences for each listed strain are as follows: 



FA1090 <SEQ ID 3207> 

MKKHLLRSALYGIAAAILAACQSRSIQTFPQPDTSVINGPDRPAGI PDPAGTTVAGGGAV 
VTWPHLSMPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVKSFQAKRFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDGRRTERARFPIYGIPDDFISVPLPAGLRGGKN 
LVRIRQTGKNSGTI DNAGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNL.iPYVSIGRYMADKGYL 
KLGQTSMQGIKAYMRQNPQRLAEVLGQNPSYIFFRELAGSGNEGPVGALGTPLMGEYAGA 
IDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

22491 <SEQ ID 3208> 

MKKYLFRAALCGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSVQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFI SVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSQFPITARTTAIKGRFEGSRFLPYKTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
KLGQTSMQGIKAYMQQN PQRLAEVLGQNPSYI FFRELTGSSNDGPVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM01 <SEQ ID 3209> 

MKKYLFRAALYGIAAAILAACQSK3 IQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELAGSSNDGPVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTG5AIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

2M02 <SEQ ID 3210> 

MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFI SVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELAGSSNDGPVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM03 <SEQ ID 3211> 

MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQT PVHSFQAKQFFER 
YFT PWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFP I YG I PDDFI SVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELAGSSNDGPVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYF^GYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 



ZM04 <SEQ ID 3212> 
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MKKYLFRAALCGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGI PAPAGTTVAGGGAV 
yTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWODVCAQAFQTPVHSFQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNAGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMKIQGSGRLKTPSGKYIRIGYADKNEKPYVSIGRYMADKGYL 
KLGQTSMQGIKAYMQQNPQRLAEVLGQNPSYIFFRELTGSSNDGPVGALGTPLMGEYAGA 
VDRKYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM05 <SEQ ID 3213> 

MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 
YTVVPHLSLPHWAAQDFAKSLQSFRLSCANLKNRQGWQDVCAQAFQT PVHSFQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFI SVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVS IGRYMADKGYL 
KLGQTSMQGIKAYMRQNPQRLAEVLGQNPSYIFFRELAGSSNDGPVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM06 <SEQ ID 3214> 

MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGI PDPAGTTVGGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQT PVHSFQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVS IGKYMADKGYL 
KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELAGSSNDGPVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQ T LPNGMKPEYRP* 

ZM07 <SEQ ID 3215> 

MKKYLFRAALYG I AAAILAACQSKSIQTFPQPDTSVINGPDRPVGI PDPAGTTVGGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQT PVHSFQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVS IGRYMADKGYL 
KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELAGSSNDGPVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM08N <SEQ ID 3216> 

MKKYLFRAALYGIAAAI LAACQSKS IQT FPQPDTSVINGPDRPVG I PDPAGTTVGGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQT PVHSFQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVS IGRYMADKGYL 
KLGQTSMQGIKAYMRQN PQRLAEVLGQNPSYIFFRELAGSSNDGPVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM09 <SEQ ID 3217> 

MKKYLFRAALCG I AAAI LAACQSKS IQT FPQPDT S VINGPDRPVG I PAPAGTTVAGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQT PVHSFQAKQFFER 
YFT PWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSQFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVS IGKYMADKGYL 
KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELTGSGNDGPVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM10 <SEQ ID 3218> 

MKKYLFRAALCGIAAAI LAACQSKS IQT FPQPDTSVINGPDRPVG I PAPAGTTVAGGGAV 
YT WPHLSLPHWAAQDFAKS LQS FRLGCANLKNRQGWQDVCAQAFQT PVHS FQAKQFFER 
YFT PWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 
LVRIROTGKNSGTIDNTGGTHTADLSOFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVS IGKYMADKGYL 
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KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELTGSGNDGPVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM11ASBC <SEQ ID 3219> 

MKKY L FRAA LCGI AAA I LAAC QSKS IQTFPQPDTSVINGPDRPVGI PA PAG TTVGGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSVQAKQFFER 
YFT PWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYG I PDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNAGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVS IGKYMADKGYL 
KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELTGSSMDGFVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAH PVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM12 <SEQ ID 3220> 

MKKYLFRAALYGIAAAILAACQSKS IQTFPQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQT PVHS FQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVS IGRYMADKGYL 
KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELAoSSNDGPVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM13 <SEQ ID 3221> 

MKKYLFRAALYGIAAAILAACQSKS IQTFPQPDTSVINGPDRPVG I PDPAGTTVGGGGAV 
YTVVPHLSLPHWAEQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQT PVHS FQAKQFFER 
YFT PWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGI PDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAP ILGYAEDPVELFFMH I QGSGRLKTPSGKYIRIGYADKNEHPYVS IGRYMADKGYL 
KLGQTSMQGIKAYMRQNPQRLAEVLGQNPSYI FFRELAGSSNDGPVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP * 

ZM14 <SEQ ID 3222> 

MKKYL FRAALCGIAAAILAACQSKS IQTFPQPDTSVINGPDRPVG I PAPAGTTVAGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQT PVHS FQAKQFFER 
YFT PWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGI PDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNAGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKY I RIGYADKNEHPYVS IGKYMADKGYL 
KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELTGSRNDGPVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTG3AIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP * 

ZM15 <SEQ ID 3223> 

MKKYLFRAALYGIAAAI LAACQSKS IQTFPQPDTSVINGPDRPVG I PDLAGTTVGGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNHQGWQDVCAQAFQTPVHS FQAKQFFER 
YFT PWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGI PDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADL3RFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAP I LGYAEDPVELFFMHI QGSGRLKTPSGKYIRIGYADKNEHPYVS IGKYMADKGYL 
KLGQTSMQGIKS YMRQNPQRLAEVLGQNPSYIFFRELTGSGNDGPVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM16 <SEQ ID 3224> 

MKKYLFRAALCGIAAAILAACQSKS I QTFPQPDTSVINGPGRPVG I PDPAGTTVGGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHS FQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFP I YGI PDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSQFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPI LGYAEDPVELFFMHIQGSGRLKTPSGKY IRIGYADKNEH PYVS IGKYMADKGYL 
KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELTGSSNDGPVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 



ZM17 <SEQ ID 3225> 
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MKKYLFRAALYGIAAAILAACQSKSIQT^PQPDTSVINGPDRPVGIPDPAGTTVGGGGAV 
YTVVPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEKPYVSIGKYMADKGYL 
KLG QTSMQG IKSYMRQN PQRLAE VLGQN PS Y I FFRELTGSSNDGPVGALGT PLMGE YAGA 
VDRHYITLGAPLFVATAhPVTRKALNRLlMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM18 <SEQ ID 322&> 

MKKYLFRAALYGIAAAILAACQSKS I QTFPQPDTSVINGPDRPVG I PDPAGTTVGGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 
LVRJRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
K LGQT SMQG IKSYMRQN PQRLAE V LGQNPSYI FFRE LAG SSNDG P VGALGT PLMGE YAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP * 

ZM19 <SEQ ID 3227> 

MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGI PDPAGTTVGGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
KLGQTSMQG IKSYMRQN PQRLAEVLGQNPSY I FFRELAGSSNDGPVGALGT PLMGE YAGA 
VDRHYITLGAPLFVATAH PVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYWQLLPNGMKPEYRP * 

ZM20 <SEQ ID 3228> 

MKKYLFRAALYG IAAAILAACQSKS I QTFPQPDTSVINGPDRPVG I PDPAGTTVGGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
KLGQTSMQG IKSYMRQN PQRLAE VLGQN PS Y I FFRE LAG SSNDG PVGALGT PLMGE YAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP * 

ZM21 <SEQ ID 3229> 

MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVG I PDPAGTTVGGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
KLGQTSMQG IKS YMRQNPQRLAEVLGQNPSYIFFRELAGSSNDGPVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAH PVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM22 <SEQ ID 3230> 

MKKYLFRAALCGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGI PDPAGTTVGGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQT PVHSVQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSQFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
K LGQT S MQG I KA YMQQN PQRLAE V LGQN P S Y I F FRE LT G S SN DG PVG A LGT P LMGE YAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM23ASBC <SEQ ID 3231> 

MKKYLFRAALYGIAAAILAACQSKSIQTFPQPDTSVINGPDRPVGI PDPAGTTVGGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGI PDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNAGGTHTADLSRFPITARTTAIKGRFEGSRFLPYKTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGKYMADKGYL 
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KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELAGSSNDGPVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAHPVT3KALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGETAGK 
MKEPGYVWQLLPNGMKPEYRP* 

ZM24 <SEQ ID 3232> 

MKKYLFRAALCGIAAAILAACQSKS IQTFPQPDTSVINGPDRPVGI PAPAGTTVAGGGAV 
YTWFHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGI PDDFI SVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSQFPITARTTAIKGRFEGSRFLFYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVS IGKYMADKGYL 
KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELTGSGNDG PVGALGTPLMGEYAGA 
VDRHYITLGAPLr/ATAHPVTRKALNRLlMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP T 

ZM25 <SEQ ID 3233> 

MKKYLFRAALCGIAAAILAACQSKS IQTFPQPDTSVINGPDRPVGI PAPAGTTVAGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 
YFT PWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGI PDDFI SVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSQFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPlLGYAEDPVELFFMHIQGSGRLKTPSGKYIRlGYADKNLnPYVS IGKYMADKGYL 
KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELTGSGNDG PVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM26 <SEQ ID 3234> 

MKKYLFRAALYGIAAAILAACQSKS IQTFPQPDTSVINGPDRPVGI PDPAGTTVGGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQT PVHSVQAKQFFER 
YFT PWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGI PDDFI SVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSQFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEKPYVSIGRYMADKGYL 
KLGQTSMQGIKAYMQQNPQRLAEVLGQNPSYIFFRELTGSSNDGPVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYWJQLLPNGMKPEYRP * 

ZM27BC <SEQ ID 3235> 

MKKYLFRAALYGISAAILAACQSKSIQTFPQPDTSVINGPDRPAGI PDPAGTTVAGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 
YFT PWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFP I YG I PDDFI SVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNAGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELTGSSNDGPVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGETAGK 
MKEPGYVWQLLPNGMKPEYRP* 

ZM28 <SEQ ID 3236> 

MKKYLFRAALCGIAAAILAACQSKS IQTFPQPDTSVINGPDRPVGI PDPAGTTVGGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVKSFQAKQFFER 
YFT PWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGI PDDFI SVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSQFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMKIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
KLGQTSMQGIKAYMRQNPQRLAEVLGQNPSYIFFRELAGSSNDGPVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM29ASBC <SEQ ID 3237> 

MKKYLFRAALCGIAAAILAACQSKS IQTFPQPDTSVINGPDRPVGI PDPAGTTVGGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCAN LKNRQGWQDVCAQAFQTPVHSFQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYG I PDDFI SVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSQFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
KLGQTSMQGIKSYMRQNPQRLAEVLGQNPSYIFFRELTGSGNDG PVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATTHPITRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP^ 



ZM31AS3C <SEQ ID 3238> 
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MKKH L FRAALYGI AAA I LAACQSKS I QT FPQP DT S 1 1 KG PDRPAG I PDPAGTT VGGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNAGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPI LGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
KLGQTSMQGIKAYMRQNPQRLAEVLGQNPSYVFFRELAGSGNDGFVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM32ASBC <SEQ ID 3239> 

MKKH LLRS AL YG I AAAI LAACQSRS I QT FPQPDT S V ING P DRPAG I PD PAGTTVAGGGA V 
YTVVPHLSMPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPVHSFQAKRFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDGRRTERARFPIYGIPDDFISVPLPAGLRGGKA 
LVRIRQTGKNSGTIDNAGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEKPYVSIGRYMADKGYL 
KLGQTSMQGIKAYMRQNPQRLAEVLGQNPSYIFFRELAGSGGDGPVGALGTPLMGGYAGA 
IORHYITLGAPLP/ATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM33ASBC <SEQ ID 3240> 

MKKHLLRSALYGIAAAI LAACQSRS I QTFPQPDTSVINGPDRPAG I PDPAGTTVAGGGAV 
YTWPHLSMPHWAAQDFAKSLQSFRLGCANLKNRQGWQDVCAQAFQTPIHSFQAKRFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDGRRTERARFPIYGIPDDFISVPLPAGLRGGKN 
LVRIRQTGKKSGTIDNAGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKNEHPYVSIGRYMADKGYL 
KLGQTSMQGIKSYMRQNPHKLAEVLGQNPSYI FFRELAGSGNEGPVGALGTPLMGEYAGA 
I DRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 

ZM96 <SEQ ID 3241> 

MKKYLFRAALYGIAAAI LAACQSKS I QTFPQPDTSVINGPDRPVGI PDPAGTTVGGGGAV 
YTWPHLSLPHWAAQDFAKSLQSFRLGCANLrCNRQGWQDVCAQAFQTPVHSFQAKQFFER 
YFTPWQVAGNGSLAGTVTGYYEPVLKGDDRRTAQARFPIYGIPDDFISVPLPAGLRSGKA 
LVRIRQTGKNSGTIDNTGGTHTADLSRFPITARTTAIKGRFEGSRFLPYHTRNQINGGAL 
DGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRIGYADKKEHPYVSIGRYMADKGYL 
KLGQTSMQGIKAYMRQNPQRLAEVLGQNPSYIFFRELAGSSNDGPVGALGTPLMGEYAGA 
VDRHYITLGAPLFVATAHPVTRKALNRLIMAQDTGSAIKGAVRVDYFWGYGDEAGELAGK 
QKTTGYVWQLLPNGMKPEYRP* 



Figure 23 shows the results of aligning the sequences of each of these strains. Dark 
shading indicates regions of homology, and gray shading indicates the conservation of amino 
acids with similar characteristics. As is readily discernible, there is significant conservation 
among the various strains of ORF 919, further confirming its utility as an antigen for both 
vaccines and diagnostics. 



EXAMPLE 16 

Using the above-described procedures, the following oligonucleotide primers were 
employed in the polymerase chain reaction (PCR) assay in order to clone the ORFs as 
indicated: 



Table 7: Oligonucleotides used for PCR to amplify complete or partial ORFs 
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ORF 


primer 


Sequence 


Restriction 

sites 


001 


Forward 


CGCGGATCCCATATG-TGGATGGTGCTGGTCAT 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-TGCCGTCTTGTCCCAC 


Xhol 


003 


Forward 


CGCGGATCCCATATG-GTCGTATTCGTGGC 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-AAAATCATGAACACGCGC 


Xhol 


005 


Forward 


CGCGGATCCCATATG-GACAATATTGACATGT 


BamHI- 

CXI 1 1 J. X 1 








Ndel 




Reverse 


CCCGCTCGAG-CATCACATCCGCCCG 


Xhol 


006 


Forward 


CGCGGATCCCATATG-CTGCTGGTGCTGG 


BamHI- 

Cll 111 1 I 








Ndel 




Reverse 


CCCGCTCGAG-AGTTCCGGCTTTOATGT 


Xhol 


007 


Forward 


CGCGGATCCCATATG-GCCGACAACAGCATCAT 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-AAGGCGTTCATGATATAAG 


Xhol 


008 


Forward 


CGCGGATCCCATATG-AACAACAGACATTTTG 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-CCTGTCCGGTAAAAGAC 


Xhol 


009 


Forward 


CGCGGATCCCATATG-CCCCGCGCTGCT 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-TGGCTTTTGCCACGTTTT 


Xhol 


Oil 


Forward 


CGCGGATCCCATATG-AAGACACACCGCAAG 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-GGCGGTCAGTACGGT 


Xhol 


012 


Forward 


CGCGGATCCCATATG-CTCGCCCGTTGCC 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-AGCGGGGAAGAGGCAC 


Xhol 


013 


Forward 


CGCGGATCCCATATG-CCTTTGACCATGCT 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-CTGATTCGGCAAAAAAATCT 


Xhol 


018 


Forward 


CGCGGATCCCATATG-CAGCAGAGGCAGTT 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-GACGAGGCGAACGCC 


Xhol 


019 


Forward 


AAAGAATTC-CTGCCAGCCGGCAAGACCCCGGC 


EcoRI 




Reverse 


AAACTGCAG-TCAGCGGGCGGGGACAATGCCCAT 


PstI 


023 


Forward 


AAAGAATTC-AAAGAATATTCGGCATGGCAGGC 


Eco RI 




Reverse 


AAACTGCAG-TTACCCCCAAATCACTTTAACTGA 


PstI 


025 


Forward 


AAAGAATTC-TGCGCCACCCAACAGCCTGCTCC 


EcoRI 




Reverse 


AAACTGCAG-TCAGAACGCGATATAGCTGTTCGG 


PstI 


031 


Forward 


CGCGGATCCCATATG-GTCTCCCTTCGCTT 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-ATGTAAGACGGGGACAAC 


Xhol 


032 


Forward 


CGCGGATCCCATATG-CGGCGAAACGTGC 


BamHI- 
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Reverse CCCGCICGAG-CrGGl 1111 1GA1A1 1 1GIG 


Xhol 


033 


Forward CGCGGATCCCATATG-GCGGCGGCAGACA 


BamHJ- 






Nael 




Reverse GGCGL1LGAG-A1 1 1 GCGGGA1 C ^GGAI 


Xiiol 


034 


Forward CGCGGATCLCATA iG-GCCGAAAACAGCTACGG 


BamHI- 






Ndel 




Reverse GGCGLIGGAG-1 1 1GACGA1 ITGGx ICAA1 1 


Xhol 


036 


Forward CGCGGATCCCATATG-CTGAAGCCGTGCG 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-CCGGACTGCGTATCGG 


Xhol 


038 


Forward CGCGGATCCCATATG-ACCGATTTCCGCCA 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-TTCTACGCCGTACTGCC 


Xhol 


039 


Forward CGCGGATCCCATATG-CCGTCCGAACCGC 


BamHI- 






Nuei 




.Keverse lllul i luau- i aaj i vj/\^o/\vjv_t i auu 


AJlul 


U4I 


rr^^T/orri rnrnn atppp atatp, ttpptpppppa apphp 


o dl III 1 1 






Ndel 




p^/^rcp cccgctcg ag-GCcc a a a a aptptttp a a a 


Xhol 


A/1 ") 

042 


r rtM ,, or j PPPPP ATPPP AT ATP APP ATP ATTTPPTTPP 


Ddniri I - 






lNuei 




t? M ;p rC p CCCGCTCGA P-TTTPP A PPPTPP A TTTP A P 
ix e verse v>v^ v^. vjv^ j v^vjavvj* i i i vj v^^vjv^ i vjv^/\ i 1 i vj/a v_ 


XhoT 


A/11 


rr^T-^T/QrH A A A A A A PPT A PP. ATPPTTPTTTP A A ATP A A A AT ATP 


xvpn I 




t> PUArcp AAA PTPP A P-TT A TTPPPPTTP A CCTTCCGCCGC 


Pet T 


043a 


rr^»-x,/oT-H A A A A A A PPT APP PP A A A A PTPP A TPPPPPPTTPP A PPPTPP 


ivpn l 




T-* A A A A A A r**T*/~* A 

Reverse AAAAAACTGGAG- 


Pst 1 




i 1 AA1GG IuLAAlALuAAI Iv^vjCLCvjIGGCj 




U44 


rr^r-iTrarrl PPPPP A TPPP A T A TP CCGTCCG A PT A P A P 


O dl III 11 






Ndel 




Reverse CCCGCTCGAG-ATGCGCTACGGTAGCCA 


Xhol 


046 


Forward AAAGAATTC-ATGTCGGCAATGCTCCCGACAAG 


Eco RI 




Reverse AAACTGCAG-TCACTCGGCGACCCACACCGTGAA 


Pst I 


A/17 
U4 / 


T7r*nx/^rH CGCGG A TPPP ATA TP-PTP A TP A T A P A PPPP 


OdlllXll 






NdeT 




Reverse CCCGCTCGAG-TCCGAAAAAGCCCATTTTG 


Xhol 




Forward AAAGAATTC-ATGCTCAACAAAGGCGAAGAATTGCC 


Eco RI 




Reverse AAACTGCAG-TCAAGATTCGACGGGGATGATGCC 


Pst I 


049 


Forward AAAGAATTC-ATGCGGGCGCAGGCGTTTGATCAGCC 


EcoRI 




Reverse AAACTGCAG-AAGGCGTATCTGAAAAAATGGCAG 


PstI 


050 


Forward CGCGGATCCCATATG-GGCGCGGGCTGG 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-AATCGGGCCATCTTCGA 


Xhol 


052 


Forward AAAAAAGAATTC-ATGGCTTTGGTGGCGGAGGAAAC 


EcoRI 




Reverse AAAAAAGTCGAC-TCAGGCGGCGTTTTTCACCTTCCT 


Sail 


052a 


Forward AAAAAAGAATTC-GTGGCGGAGGAAACGGAAATATCCGC 


EcoRI 
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Reverse AAAAAACTGCAG-TTAGCTGTTTTTGGAAACGCCGTCCAACCC Pst I 



073 Forward CGC GGATCCCATATG -TGTATGCCATATAAGAT BamHI- 

Ndel 

Reverse CCCG CTCGAG -CACCGGATTGTCCGAC Xhol 

075 Forward CGC GGATCCCATATG -CCGTCTTACTTCATC BamHI- 

Ndel 

Reverse CCCG CTCGAG -ATCACCAATGCCGATTATTT Xhol 

077a Forward AAAAAA GAATTC -GGCGGCATTTTCATCGACACCTTCCT Eco RJ 

Reverse AAAAAACTGCAG-TCAGACGAACATCl'GCACAAACGCAAT Pst I 

080 Forward AAA GAATTC -GCGTCCGGGCTGGTTTGGTTTTACAATTC Eco RI 
Reverse AAA CTGCAG -CTATTCTTCGGATTCTTTTTCGGG Pst I 

081 Forward AAA GAATTC -ATGAAACCACTGGACCTAAATTTCATCTG Eco RI 
Reverse AAA CTGCAG -TCACTTATCCTCCAATGCCTC Pst I 

082 Forward AAA GAATTC -ATGTGGTTGTTGAAGTVGCCTGC Eco RI 
Reverse AAA CTGCAG -TTACGCGGATTCGGCAGTTGG Pst I 

084 Forward AAA GAATTC -TATCACCCAGAATATGAATACGGCTACCG Eco RI 
Reverse AAA CTGCAG -TTATACTTGGGCGCAACATGA Pst I 

085 Forward CGC GGATCCCATATG -GGTAAAGGGCAGGACT BamHI- 

Ndel 

Reverse CCCG CTCGAG -CAAAGCCTTAAACGCTTCG Xhol 

086 Forward AAAAAA GGTACC -TATTTGGCATCAAAAGAAGGCGG Kpn I 
Reverse AAA CTGCAG -TTACTCCACCCGATAACCGCG Pst I 

087 Forward AAA GAATTC -ATGGGCGGTAAAACCTTTATGC Eco RI 
Reverse AAA CTGCAG -TTACGCCGCACACGCAATCGC Pst I 

087a Forward AAAAAA GAATTC -AAGCTATTAGGCGTGCCGATTGTGATTCA Eco RI 

Reverse AAAAAA CTGCAG -TTACGCCTGCAAGATGCCCAGCTTGCC Pst I 

088 Forward AAAAAA GAATTC -ATGTTTTTATGGCTCGCACATTTCAG Eco RI 
Reverse AAAAAA CTGCAG -TCAGCGGATTTTGAGGGTACTCAAACC Pst I 

089 Forward CGC GGATCCCATATG -CCGCCCAAAATCAC BamHl- 

Ndel 

Reverse CCCG CTCGAG -TGCGCATACCAAAGCCA Xhol 

090 Forward CGC GGATCCCATATG -CGCATAGTCGAGCA BamHl- 

Ndel 

Reverse CCCG CTCGAG -AGCAAAACGGCGGTACG Xhol 

091 Forward AAA GAATTC -ATGGAAATACCCGTACCGCCGAGTCC Eco RI 
Reverse AAA CTGCAG -TCAGCGCAGGGGGTAGCCCAAGCC Pst I 

092 Forward AAA GAATTC -ATGTTTTTTATTTCAATCCG Eco RI 
Reverse AAA CTGCAG -TCAAATCTGTTTCGACAATGC Pst I 

093 Forward AAA GAATTC -ATGCAGAATTTTGGCAAAGTGGC Eco RI 
Reverse AAACTGCAG-CTATGGCTCGTCATACCGGGC Pst I 

094 Forward AAA GAATTC -ATGCCGTCACGGAAGCGCATCAACTC Eco RI 
Reverse AAA CTGCAG -TTATCCCGGCCATACCGCCGAACA Pst I 

095 Forward AAA GAATTC -ATGTCCTTTCATTTGAACATGGACGG Eco RI 
Reverse AAA CTGCAG -TCAACGCCGCAGGCACTAACGCCC Pst I 

096 Forward AAAGAATTC-ATGGCTCGTCATACCGGGCAGGG Eco RI 
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Reverse AAA CTGCAG -TCAAAGGAAAAGGCCGTCTGAAAAGCG Pst 1 

097 Forward AAA GAATTC -ATGGACACTTCAAAACAAACACTGTTG Eco Rl 
Reverse AAA CTGCAG -TCAGCCCAAATACCAGAATTTCAG Pst I 

098 Forward AAA GAATTC -GATGAACGCAGCCCAGCATGGATACG Eco RJ 
Reverse AAA CTGCAG -TTACGACATTCTGATTTGGCA Pst I 

102 Forward AAAAAA GAATTC -GGCCTGATGATTTTGGAAGTCAACAC Eco RJ 

Reverse AAAAAACTGCAG-TTATCCTTTAAATACGGGGACGAGTTC Pst I 

105 Forward CGC GGATCCCATATG -TCCGCAAACGAATACG BamHI- 

Ndel 

Reverse CCCG CTCGAG -GTGTTCTGCCAGTTTCAG Xhol 

107 Forward AAAAAA GAATTC - Eco RI 

CTGATGATTTTGGAAGTCAACACCCATTATCC 

Reverse AAAAAA CTGCAG -TTATCCTTTAAATACGGGGACGAGTTC Pst I 

107b Forward AAAAAA GAATTC - E co rj 

GATACCCAAGCCCCCGCCGGCACAAACTACTG 
Reverse AAAAAA CTGCAG - p st I 

TTACGCGTCGCCTTTAAAGTATTTGAGCAGGCTGGAGAC 

108 Forward AAA GAATTC -ATGTTGCCGGGCTTCAACCG Eco RI 
Reverse AAA CTGCAG -TTAGCGGTACAGGTGTTTGAAGCA Pst I 

108a Forward AAA \AAGAATTC-GGTAACACATTCGGCAGCTTAGACGGTGG Eco RI 

Reverse AAA CTGCAG -TTAGCGGTACAGGTGTTTGAAGCA Pst I 

109 Forward AAA GAATTC -ATGTATTATCGCCGGGTTATGGG Eco RI 
Reverse AAA CTGCAG -CTAGCCCAAAGATTTGAAGTGTTC Pst I 

111 Forward CGC GGATCCCATATG -TGTTCGGAACAAACCGC BamHl- 

Ndel 

Reverse CCCG CTCGAG -GCGGAGCAGTTTTTCAAA Xhol 
114 Forward CGC GGATCCCATATG -GCTTCCATCACTTCGC BamHI- 

Ndel 

Reverse CCCG CTCGAG -CATCCGCGAAATCGTC Xhol 

117 Forward AAAAAA GGTACC -ATGGTCGAAGAACTGGAACTGCTG Kpn I 
Reverse AAA CTGCAG -TTAAAGCCGGGTAACGCTCAATAC Pst I 

118 Forward AAA GTCGAC ATGTGTGAGTTCAAGGATATTATAAG Sail 
Reverse AAAGCATGC-CTATTTTTTGTTGTAATAATCAAATC Sph I 

121 Forward CGC GGATCCCATATG -GAAACACAGCTTTACAT BamHI- 

Ndel 

Reverse CCCG CTCGAG -ATAATAATATCCCGCGCCC Xhol 

122 Forward CGC GGATCCCATATG -GTCATGATTAAAATCCGCA BamHl- 

Ndcl 

Reverse CCCG CTCGAG -AATCTTGGTAGATTGGATTT Xhol 

125 Forward AAA GAATTC -ATGTCGGGCAATGCCTCCTCTCC Eco RI 
Reverse AAA CTGCAG -TCACGCCGTTTCAAGACG Pst I 

125a Forward AAAAAA GAATTC - ACGGCAGGCAGCACCGCCGCACAGGTTTr Eco RI 
Reverse AAAAAA CTGCAG - p st j 

TTATTTTGCCACGTCGGTTTCTCCGGTGAACAACGC 

126 Forward CGCGGATCCCATATG-CCGTCTGAAACCC BamHI- 
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Ndel 

Reverse CCCG CTCGAG -ATATTCCGCCGAATGCC Xhol 

127 Forward AAA GAATTC -ATGGAAATATGGAATATGTTGGACACTTG Eco RJ 
Reverse AAA CTGCAG -TTAAAGTGTTTCGGAGCCGGC Pst 1 

127a Forward AAAAAA GAATTC -AAGGAACTGATTATGTGTCTGTCGGG Eco RI 

Reverse AAA CTGCAG -TTAAAGTGTTTCGGAGCCGGC Pst I 

128 Forward CGC GGATCCCATATG -ACTGACAACGCACT BamHI- 

Ndel 

Reverse CCCG CTCGAG -GACCGCGTTGTCGAAA Xhol 

130 Forward CGC GGATCCCATATG -AAACAACTCCGCGA BamHI- 

Ndel 

Reverse CCCG CTCGAG -GAATTTTGCACCGGATTG Xhol 

132 Forward AAA GAATTC -ATGGAACCCTTCAAAACCTTAATTTG Eco RI 

Reverse AAAAAA CTGCAG -TCACCATGTCGGCATTTGAAAAAC Pst I 

134 Forward CGC GGATCCCATATG -TCCCAAGAAATCCTC BamHI- 

Ndel 

Reverse CCCG CTCGAG -CAGTTTGACCGAATGTTC Xhol 

135 Forward CGC GGATCCCATATG -AAATACAAAAGAATCGTATT BamHI- 

Ndel 

Reverse CCCG CTCGAG -AAATTCGGTCAGAAGCAGG Xhol 

137 Forward AAAAAA GGTACC -ATGATTACCCATCCCCAATTCGATCC Kpn I 
Reverse AAAAAA CTGCAG -TCAGTGCTGTTTTTTCATGCCGAA Pst I 

137a Forward AAAAAA GAATTC -GGCCGCAAACACGGCATCGGCTTCCT Eco RI 

Reverse AAAAAA CTGCAG -TTAAGCGGGATGACGCGGCAGCATACC Pst I 

138 Forward AAAAAA GAATTC -AACTCAGGCGAAGGAGTGCTTGTGGC Eco RI 
Reverse AAAAAA TCTAGA- TCAGTTTAGGGATAGCAGGCGTAC Xba I 

141 Forward AAA GAATTC -ATGAGCTTCAAAACCGATGCCGAAATCGC Eco RI 
Reverse AAA CTGCAG -TCAGAACAAGCCGTGAATCACGCC Pst I 

142 Forward CGC GGATCCCATATG -CGTGCCGATTTCATG BamHI- 

Ndel 

Reverse CCCG CTCGAG -AAACTGCTGCACATGGG Xhol 

143 Forward AAAAAA GAATTC - Eco RI 

ATGCTCAGTTTCGGCTTTCTCGGCGTTCAGAC 

Reverse AAAAAA CTGCAG -TCAAACCCCGCCGTGTGTTTCTTTAAT Pst I 

144 Forward AAAAAA GAATTC -GGTCTGATCGACGGGCGTGCCGTAAC Eco RI 
Reverse AAAAAA TCTAGA -TCGGCATCGGCCGGCATATGTCCG Xba I 

146 Forward AAAAA AGAATTC - Eco RI 

CGCCAAGTCGTCATTGACCACGACAAAGTC 

Reverse AAAAAA CTGCAG -TTAGGCATCGGCAAATAGGAAACTGGG Pst I 

147 Forward AAAAAA GAATTC - ACTGAGCAATCGGTGGATTTGGAAAC Eco RI 
Reverse AAAAAA TCTAGA -TTAGGTAAAGCTGCGGCCCATTTGCGG Xba I 

148 Forward AAAAAA GAATTC - Eco RI 

ATGGCGTTAAAAACATCAAACTTGGAACACGC 

Reverse AAAAAATCTAGA-TCAGCCCTTCATACAGCCTTCGTTTTG Xba I 

149 Forward CGCGGATCCCATATG-CTGCTTGACAACAAAGT BamHI- 
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Ndel 

Reverse CCCG CTCGAG -AAACTTCACGTTCACGCC Xhol 

150 Forward CGC GGATCCCATATG -CAGAACACAAATCCG BamHI- 

Ndel 

Reverse CCCG CTCGA G -ATAAACATCACGCTGATAGC Xhol 

151 Forward AAAAAAGAATTC- Eco RI 

ATGAAACAAATCCGCAACATCGCCATCATCGC 
Reverse AAAAAACTGCAG-TCAATCCAGCTTTTTAA.AGTGGCGGCG Pst I 

152 Forward AAAAAA GAATTC - Eco RJ 

ATGAAAAACAAAACCAAAGTCTGGGACCTCCC 
Reverse AAAAAACJGCAG-TCAGGACAGGAGCAGGATGGCGGC Pst I 

153 Forward AAAAAAGAATTC- ATGGCGTTTGCTTACGGTATGAC Eco Rl 
Reverse AAAAAA CTGC AG -TC AGTC ATGTTTTTC CGTTTC A TT Pst I 

153a Forward AAAAAAGAAJTC-CGGACTTCGGTATCGGTTCCCCAGCATTG Eco RJ 
Reverse AAAAAA CTGCAG - Pst i 

TTACGCCGACGAAATACTCAGACTTTTCGG 

154 Forward CGC GGATCCCATATG -ACTGACAACAGCCC BamHI- 

Ndel 

Reverse CCCG CTCGAG -TCGGCTTCCTTTCGGG Xhol 

155 Forward AAAAAAGAATTC- ATGAAAATCGGTATCCCACGCGAGTC Eco RJ 
Reverse AAAAAACJGCAG-TTACCCTTTCTTAAACATATTCAGCAT Pst I 

156 Forward AAAAAA GAATTC -GCACAGCAAAACGGTTTTGAAGC EcoRI 
Reverse AAAAAA CTGCAG -TC AAGC AGCCGCGACAAAC A CCCC Pst I 

157 Forward CGC GGATCCCATATG -AGGAACGAGGAAAAAC BamHI- 

Ndel 

Reverse CCCG CTCGAG -AAAACACAATATCCCCGC Xhol 

158 Forward AAAAAAGAATTC-GCGGAGCAGTTGGCGATGGCAAATTCTGC EcoRJ 
Reverse AAAAAAJCJAGA-TTATCCACAGAGATTGTTTCCCAGTTC Xbal 

160 Forward CGC GGATCCCATATG -GACATTCTGGACAAAC BamHI- 

Ndel 

Reverse CCCG CTCGAG -TTTTTGCCCGCCTTCTTT Xhol 

163 Forward AAAAAAGGTACC-ACCGTGCCGGATCAGGTGCAGATGTG Kpn I 
Reverse AAAAAATCJAGA-TTACTCTGCCAATTCCACCTGCTCGTG Xbal 

163a Forward AAAAAAGAATTC-CGGCTGGTGCAGATAATGAGCCAGAC EcoRI 
Reverse AAAAAATCJAGA-TTACTCTGCCAATTCCACCTGCTCGTG Xbal 

164 Forward CGC GGATCCCATATG -AACCGGACTTATGCC BamHI- 

Ndel 

Reverse CCCG CTCGA G -TTTGTTTCCGTCAAACTGC Xhol 

165 Forward CGC GGATCCGCTAGC -GCTGAAGCGACAGACG BamHI- 

Nhei 

Reverse CCCG CTCGAG -AATATCCAATACTTTCGCG Xhol 
206 Forward CGC GGATCCCATATG -AAACACCGCCAACCGA BamHI- 

Ndel 

Reverse CCCGCTCGAG-TTCTGTAAAAAAAGTATGTGC Xhol 
209 Forward CGC GGATCCCATATG -CTGCGGCATTTAGGA BamHI- 
Ndel 



WO 99/57280 



146 



PCT/US99/09346 







PPCGPTPGAG-TACCCCTGAAGGCAAP 

V < V^ V P V^ 1 V. VJ \J 1 i L V-' V - V. V^ 1 VJ l VI * VJ V_J \ - 1 V-i V V. 


Xhol 


1 1 1 

z I 1 




A A A A A AG A A TTP- A TGTTGPGGGTTGPTPtPTGP 


Frn Rl 

LIU rvi 




Reverse 


a a AAA aptpp at, pt ATprTr,rr,r, ATTr,r,r a tth a a a 

A^A^AAAAV^ 1 UL/\U"L i A i L^C 1 vJCUCrA 1 l CvjCA I I vj AA A 


Pet T 


212 


Forward 


LutuuA 1 LLtA 1 A I U-uAtAA I CI CCr I A i GO 


baranl- 








iNaei 




Reverse 


CCCOC 1 LuAu-Auuvju 1 J AOA 1 tt I 1 tt 


yViioi 


215 


Forward 


CGCGGA 1 LLLA 1 A 1 O-GC A 1 GO J 1 uOU 1 CO 1 


rj n „ -TJT 








iNQei 




Reverse 


c^r^r^r^c^Tcr^ \ n pata tpttttpt a tc* a t a a a Tr 
CCCOC 1 CO AO-C A 1AILI 1 1 1 0 1 A 1 C A 1 AAA. i C 


AilO l 


216 


Forward 


CGCGGA 1 CCCA 1 A 1 G-GCAA 1 uul AGAAAACG 


T~> „ T T I 

BamHl- 








Ndei 




Reverse 


PrPPPTPf AP TATA ATPPPTPPPPPP 

CCCOC 1 CGAG- 1 ACAA 1 CCCr 1 GCCGCC 


Xhol 


217 


Forward 


CGCGGA 1 CCCA 1 A I G-GCGG A 1 G ACGG 1 Gl G 


n T T T 

BamHI- 








Ndel 




Reverse 


CCCCrC 1 CGAO- ACCCCOAA 1 A 1 CCr AA 1 CC 


Anol 


218 


Forward 


CGCGGA 1 CCCA 1 A 1 G-G 1 CGCGG 1 CGA 1 C 


BamHl- 








Nael 




Reverse 


r^c s c^r^n*Tc*r y a p t a a ptp a tat a a tpptpp 
CCCCrC 1 COAO- 1 AAC 1 L A 1 AuAA 1 CC I CC 


AilOi 


219 


Forward 


CGCOOA 1 CCCrC 1 AOC-ACGGC AAOG 1 1 AACr 


Damrli- 








iNnei 




T? pvprQP 
rvc v 


PPCGCTPGAG-TTTAAAPPATPTPPTPAAAAP 


Xhol 




L7 r\ t-i i / o t*/i 

rur w<ii u 


rnrnn a tctp a t a tg-g a a ttp a ggp a pp a a gt a 


13 d 1 1 U 1 J 








nuci 




TD f»VPTCf* 


PPPGPTPGAG-GGPTTCrPGPGTGTP 


Xhol 


ZZ J 


rui w ai kx 


PPJPGG ATPPP AT ATG-GAPG A GTTGAPP A APP 

V-- VJ v. VJ VJ / V L V_/V_/V_//T. 1 i \ 1 VJ VJ ^vV^ VJ /A VJ 1 L VJ rvV. V^ iTV/VV. V_. 


LJ dLJJJL 11 












"D pvprQP 


PPPGPTPGAG-GTTCAGAAAGPGGGAP 

V/V/V'VJv/ 1 V_/ VJ iV VJ VJ 1 1 V^ iv VJ iuVJl VJ V^ VJ VJ VJ A v^ 


Xhol 

yvJ ivj l 


99^ 


rui WalU 


A A AGA ATTP-PTTPtPGATTATPPtTGPGPAPGPPt 


LvLU XVI 




Reverse 


AAA PTPPA TP A A A A TPPP A A A A PPPPP A T 
AAAC 1 CrC Avj- I LAAAA 1 CCC AAAACCrOCivjA 1 


Oof T 

rSt 1 


228 


Forward 


CGCGGA 1 CCCA 1 A 1 G- 1 CGC AAGAAGCC AAAC AG 


BamHi- 








iNaei 




IvCVCl aC 


PPPGPTPG A Pt-TTTGGPGGP ATPTTTP A T 

v^ V^ VJ v^ 1 V/VjAvJ" III VJVJV-' VJ VJ V A 1LJ 1 1 V, A I 


YVinT 


99Q 


rorwaiu 


PGPGGATPPP ATATG-P A AG AGGTTTTPPPP 

V_^ VJ V^VJVJ/A. I v V v/\ 1 r\ 1 Vj-UAAvJAvJVl Mil V.J v. v v 


D am U| 

Ddnirii- 








Ndel 




Reverse 


CCCGCTCGAG-ACACAATATAGCGGATGAAC 


Xhol 


9^0 
-1 _> 


T^nrward 

1 Ul VV <4l vl 


CGCGGATCCCATATG-CATCCGGGTGCPGAC 

V/ VJ v^' \_J VJ / V L V^ V- V - k i. ^ \ i. VJ V^ a 1 1 V^ V- VJ VJ VJ A VJ V^ V^ VJ 1 V V- 


LJ uilJJ. 11 








Ndel 




Reverse 


CCCGCTCGAG-AAGTTTGGCGGCTTCGG 


Xhol 


232 


Forward 


AAAAAAGAATTC-ATGTACGCTAAAAAAGGCGGTTTGGG 


EcoRI 




Reverse 


AAAAAACTGCAG-TCAAGGTTTTTTCCTGATTGCCGCCGC 


PstI 


232a 


Forward 


AAAAAAGAATTC-GCCAAGGCTGCCGATACACAAATTGA 


EcoRI 




Reverse 


AAAAAACTGCAG-TTAAACATTGTCGTTGCCGCCCAGATG 


PstI 


233 


Forward 


CGCGGATCCCATATG-GCGGACAAACCCAAG 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-GACGGCATTGAGCAG 


Xhol 


234 


Forward 


CGCGGATCCCATATG-GCCGTTTCACTGACCG 


BamHI- 
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Ndel 




Reverse 


GCCCAAGCTT-ACGGTTGGATTGCCATG 


HinH TIT 

I 1 11 JU 111 




Th /^t-\i / Q TV I 
I VJI W dJ VI 


PGPGGATPPPATATG-GPPTGPPA AGTTPA A A 

V^ VJ VJ VJ / \ l v_/ .r^i l i v l VJ VJ v_/ V^ 1 VJ \_ V_- rVj\ v_J 1 1 V_, /A./v / \ 


D QtY1 UT 








Ndel 




R e verse 


CCCGCTCGAG-TTTGGGCTGCTCTTC 


Xhol 

yvii vji 




F Ui W dl U 


PGPGGATPPP AT ATG-GPGPGTTTPPtPPTT 

VJ v^ vj vj ii i v_ v_. r\ i i vj VJ v_ VJ v.^ VJ ill v,^ VJ V, V. 1 1 


oailTrll- 








Ndel 




Reverse 

A V W T VI JV 


CCCGCTCGAG-ATGGGTCGCGCGCCGT 




z j>o 


rui VV aJ U 


mrnri ATrrGGT a gp- a a pggtttgg a Tncrcn 

V-VJvVJVJA 1 V-- V_^ VJ V. 1 /v VJ V, r\ r\ V_ VJ VJ 111 VJVJ/-V i VJ v_ v.- V^ vJ 


ij aniri 1 - 








KThpT 

1N11C1 




R ever^p 

rvt v vi ov 


CCCGCTCGAG-TTTGTCTAAGTTrCTGATATG 

V-^ V-/ V_* \_J 1 \s VJ £ IVJ AAA. > — J 1 V^- A / Vi k VJ 1 A V_^ V^ J. V_J /V 1 / V 1 VJ 


YVinT 


ZJ7 


rorwdru 


rrnCiA attptapatatp ptpp a pp at a a a ppt a ttp 

v^v^vJvJ/AJA 1 1 vl AUrt 1 / \ 1 U v 1 v^ v_ r\ V_ v_ r\ I /ajAjt-vJVJ 1 n 1 J vJ 


JfcCOKl- 








MHpT 




R pverse 

J\L V vl o v 


CCCGCTCGAG-TGGTGAAGAGPGGTTTAG 

v^ v^ v i v — ' i v.- v_j l \ v j a v_i v_j a v_i J ui vj rvvi V 1 V 1 A X J il. V ) 


AJlOl 


Zn-U 


T7 t*\ i / q t*rl 

rorwdru. 


CCtCGCx A TPPP ATA TP-H A PPTTPP A PP A TTTP 
v^vJv^ VJ vJr\ 1 v^ v_ /A I r\ L vJ-vJ/Av^ vJ J 1 v J VJ /\ w vJ /V 1 i 1L 


bamHl- 








MHpT 




R pvpr^e 

l\v V o V 


CCCGCTCGAG-AAACGCCATTACCCGATG 


yVJlOi 


1A\ 
Z*+ I 


V* /~\ r\ i / o t*/~1 

rui WalU 


PPPPA ATTPTAPAT ATP-PP A AP APPTPP A APT 

V^v^vJvJ/Vr\ 1 1 V 1 Av r\ 1 r\ 1 VJ~V^v^/\Z\\^/\V_^\Jf 1 V_ v^/v/\v^ I 


tCOKi- 












Reverse 


CCCGCTCGAG-GAATGCGCCTGTAATTAATC 




94? 
z, t +z. 




PGPGG ATPPP AT ATG-ATPPtPP A A APTTPTTP 

v^vJVjvJVJ/a 1 V> WiT. I y \ 1 VJ / v 1 VJ VJ V, V^ 1 1 VJ 1 1 V_J 


D om LIl 








NdpT 




Reverse 


GCCCAAGCTT-ACCGATACGGTCGCAG 


HindTTI 

1 llllvJ-llJ 


1A% 


rurw ai u 


PPPPG ATPPP ATA TP- A PP A TTTTTTPP A TPPTPP 


oarnxii- 








NdeT 

L V Uvl 




Reverse 

X VV * vi 


CCCGCTCGAG-CGACTTGGTTACCGCG 


YhoT 


OA A 


1 UI W dl U. 


PPPPtGATPPPATATP-PPPTPTPt a apppp 

V-'VJ V-'VJVJ/A 1 \s v_' V / V 1 /\ 1 VJ V/VjVJ 1 V 1 VJ/Aj^VJ V- V^ V- 


tiamni- 








NdpT 




R p vpt^p 

ivv V vl Ov 


CCCGCTCGAG-TTTTTTCGGTAGGGGATTT 

V • V V/ VJ V^ A VJ ilVJ A A A A A A V^ VJ VJ A XA VJ VJ VJ VJ 111 


YVirJ 


Z*+U 


rur wdiLi 


PPPPP A TPPP ATA TP-P A P A TPPriP A ClTCir 

V^VJ V, VJ VJ/A 1 W /A. i Al VJ*Vj/\V^/\ 1 VvVJvJVvAvJ 1 UL 


hsamril- 








NHpT 




R p verse 


CCCGCTCGAG-CCCGCGCTGPTGGAPt 

V> V_^ V^ VJ V-- A V^ VJ 1 V VJ V^ V^ V^ VJ V — ' VJ V_' A VJ V_y A VJ VJ i V V 1 


YVir*T 


947 
Z 4 * / 


r oiwd.ru 


PPPPPATPPP AT ATP PTPPP A Tm A r,TT A C 

v^vJLVJVjrV 1 V. V_ L/A. 1 S\ 1 VJ~VJ 1 V^VJ VJ/\ 1 v^vJ/\vj 1 1 AL 


BamHI- 








1NUC1 




R p verse 

I\V V Vi ov 


CCCGCTCGAG- AAGTGTTPTGTTTGPGP A 

VjV/V/VJ V A V^ VJ i V VJ 1 Vi V V_J A VJ A A 1 VJ X 1 1 V 1 V VJ V_- i \ 


AJlOl 


94R 
ZHO 


r UI WalU 


PGPGGATPPP AT ATP PPP A AArAftA APA PT 


rj „ wt TTT 

BamHl- 












Reverse 


CCCGCTCGAG-CTCATCATTATTGCTAACA 


Xhol 


249 


Forward 


CGCGGATCCCATATG-AAGAATAATGATTGCTTC 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-TTCCCGACCTCCGAC 


Xhol 


251 


Forward 


CGCGGATCCCATATG-CGTGCTGCGGTAGT 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-TACGAAAGCCGGTCGTG 


Xhol 


253 


Forward 


AAAAAAGAATTC-ATGATTGACAGGAACCGTATGCTGCG 


EcoRI 




Reverse 


AAAAAACTGCAG-TTATTGGTCTTTCAAACGCCCTTCCTG 


PstI 
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253a Forward AAAAAAGAATTC-AAAATCCTTTTGAAAACAAGCGAAAACGG Eco RI 



Reverse AAAAA.A. CTGCAG -TTATTGGTCTTTCAAACGCCCTTCCTG Pst I 

254 Forward AAAAAA GAATTC -ATGTATACAGGCGAACGCTTCAATAC Eco Rl 
Reverse AAAAAA TCTAGA -TCAGATTACGTAACCGTACACGCTGAC Xba I 

255 Forward CGC GGATCCCATATG -GCCGCGTTGCGTTAC BamHI- 

Ndel 

Reverse CCCG CTCGAG -ATCCGCAATACCGACCAG Xhol 

256 Forward CGC GGATCCGCTAGC -TTTTAACACCGCCGGAC BamHI- 

Nhel 

Reverse CCCG CTCGAG -ACGCCTGTTTGTGCGG Xhol 

257 Forward CGC GGATCCCATATG -GCGGTTTCTTTCCTG BamHI- 

Ndel 

Reverse CCCGCTCGAG-GCGCGTGAATATCGCG Xhol 



258 Forward AAAAAAGAATTC-GATTATTTCTGGTGGATTGTTGCGTTCAG Eco RI 



Reverse AAAAAA CTGCAG -CTACGCATAAGTTTTTACCGTTTTTGG Pst I 

258a Forward AAAAAA GAATTC -GCGAAGGCGGTGGC^ CAAGGCGA Eco RI 

Reverse AAAAAA CTGCAG -CTACGCATAAGTTTTTACCGTTTTTGG Pst I 

259 Forward CGC GGATCCCATATG -GAAGAGCTGCCTCCG BamHl- 

Ndel 

Reverse CCCG CTCGAG -GGCTTTTCCGGCGTTT Xhol 

260 Forward CGC GGATCCCATATG -GGTGCGGGTATGGT BamHI- 

Ndel 

Reverse CCCG CTCGAG -AACAGGGCGACACCCT Xhol 

261 Forward AAAAAA GAATTC -CAAGATACAGCTCGGGCATTCGC Eco RI 
Reverse AAAAAA CTGCAG -TCAAACCAACAAGCCTTGGTCACT Pst I 

263 Forward CGC GGATCCCATATG -GCACGTTTAACCGTA BamHI- 

Ndel 

Reverse CCCG CTCGAG -GGCGTAAGCCTGCAATT Xhol 

264 Forward AAAAAA GGTACC -GCCGACGCAGTGGTCAAGGCAGAA Kpn I 
Reverse AAA CTGCAG -TCAGCCGGCGGTCAATACCGCCCG Pst I 

265 Forward AAAAAA GAATTC -GCGGAGGTCAAGAGAAGGTGTTTG Eco RI 
Reverse AAAAAA CTGCAG -TTACGAATACGTCGTCAAAATGGG Pst I 

266 Forward AAA GAATTC -CTCATCTTTGCCAACGCCCCCTTC Eco RI 
Reverse AAA CTGCAG -CTATTCCCTGTTGCGCGTGTGCCA Pst I 

267 Forward AAA GAATTC -TTCTTCCGATTCGATGTTAATCG Eco RJ 
Reverse AAA CTGCAG -TTAGTAAAAACCTTTCTGCTTGGC Pst I 

269 Forward AAA GAATTC -TGCAAACCTTGCGCCACGTGCCC Eco RI 
Reverse AAA CTGCAG -TTACGAAGACCGCAACGAAAGGCAGAG Pst I 

269a Forward AAAAAA GAATTC -GACTTTATCCAAAACACGGCTTCGCC Eco RI 

Reverse AAA CTGCAG -TTACGAAGACCGCAACGAAAGGCAGAG Pst I 

270 Forward AAA GAATTC -GCCGTCAAGCTCGTTTTGTTGCAATG Eco RI 
Reverse AAA CTGCAG -TTATTCGGCGGTAAATGCCGTCTG Pst I 

271 Forward CGC GGATCCCATATG -CCTGTGTGCAGCTCGAC BamHI- 

Ndel 

Reverse CCCGCTCGAG-TCCCAGCCCCGTGGAG Xhol 
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272 Forward AAA GAATTC -ATGACCCCAAAGGAAGAACTGTTCGC Eco RJ 
Reverse AAA CTGCAG -TCAGAGCAGTTCCAAATCGGGGCT Pst I 

273 Forward AAA GAATTC -ATGAGTCTTCAGGCGGTATTTATATACCC Eco RJ 
Reverse AAA CTGCAG -TTACGCGTAAGAAAAAACTGC Pst 1 

274 Forward CGC GGATCCCATATG -ACAGATTT GGTTACGGAC BamHI- 

Ndel 

Reverse CCCG CTCGAG -TTTGCTTTCAGTATTATTGAA Xhol 

276 Forward AAAAAA GAATTC - EcoRI 

ATGATTTTGCCGTCGTCCATCACGATGATGCG 
Reverse AAAAAA CTGCAG -CTACACCACCATCGGCGAATTTATGGC Pst I 

277 Forward AAAAAA GAATTC - ATGCCCCGCTTTGAGGACAAGCTCGTAGG Eco RI 
Reverse AAAAAA CTGCAG -TCATAAGCCATGCTTACCTTCCAACAA Pst I 

277a Forward AAAAAA GAATTC -GGGGCGGCGGCTGGGTTGGACGTAGG Eco RI 
Reverse AAAAAA CTGCAG -TCATAAGCCATGCTTACCTTCCAACAA Pst I 

278 Forward AAAAAA GGTACC -GTCAAAGTTGTATTAATCGGGCCTTTGCC Kpn I 
Reverse AAAAAA CTGCAG -TCATTCAACCATATCAAATCTGCC Pst I 

278a Forward AAAAAA GAATTC -AAAACTCTCCTAATTCGTCATAGTCG Eco RI 

Reverse AAAAAA CTGCAG -TCATTCAACCATATCAAATCTGCC Pst I 

279 Forward CGC GGATCCCATATG -TTGCCTGCAATCACGATT BamHI- 

Ndel 

Reverse CCCG CTCGAG -TTTAGAAGCGGGCGGCAA Xhol 

280 Forward AAAAAA GGTACC -GCCCCCCTGCCGGTTGTAACCAG Kpn I 
Reverse AAAAAA CTGCAG -TTATTGCTTCATCGCGTTGGTCAAGGC Pst I 

281 Forward AAAAAA GAATTC -GCACCCGTCGGCGTATTCCTCGTCATGCG Eco RI 
Reverse AAAAAA TCTAGA -GGTCAGAATGCCGCCTTCTTTGCCGAG Xbal 

281a Forward AAAAAA GAATTC -TCCTACCACATCGAAATTCCTTCCGG EcoRI 
Reverse AAAAAA TCTAGA -GGTCAGAATGCCGCCTTCTTTGCCGAG Xbal 

282 Forward AAAAAA GAATTC -CTTTACCTTGACCTGACCAACGGGCACAG Eco RI 
Reverse AAAAAA CTGCAG -TCAACCTGCCAGTTGCGGGAATATCGT Pst I 

283 Forward CGC GGATCCCATATG -GCCGTCTTTACTTGGAAG BamHI- 

Ndel 

Reverse CCCG CTCGAG -ACGGCAGTATTTGTTTACG Xhol 

284 Forward CGC GGATCCCATATG -TTTGCCTGCAAAAGAATCG BamHI- 

Ndel 

Reverse CCCG CTCGAG -CCGACTTTGCAAAAACTG Xhol 

286 Forward CGC GGATCCCATATG -GCCGACCTTTCCGAAAA BamHI- 

Ndel 

Reverse CCCG CTCGAG -GAAGCGCGTTCCCAAG Xhol 

287 Forward CCG GAATTC T AGCTAGC -CTTTCAGCCTGCGGG EcoRl- 

Nhel 

Reverse CCCG CTCGAG -ATCCTGCTCTTTTTTGCC Xhol 

288 Forward CGC GGATCCCATATG -CACACCGGACAGG BamHI- 

Ndel 

Reverse CCCG CTCGAG -CGTATCAAAGACTTGCGT Xhol 
290 Forward CGCGGATCCCATATG-GCGGTTTGGGGCGGA BamHI- 
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Ndel 

Reverse CCCG CTCGAG -TCGGCGCGGCGGGC Xhol 
292 Forward CGC GGATCCCATATG -TGCGGGCAAACGCCC BamHI- 

Ndel 

Reverse CCCG CTCGAG -TTGATTTTTGCGGATGATTT Xhol 

294 Forward AAAAAA GAATTC -GTCTGGTCG ATTCGGGTTGTC A G A A C Eco RJ 
Reverse AAAAAACTGCAG-TTACCAGCTGATATAAAACATCGCTTT Pst I 

295 Forward CGC GGATCCCATATG -AACCGGCCGGCCTCC BamHI- 

Ndel 

Reverse CCCG CTCGAG -CGATATTTGATTCCGTTGC Xhol 

297 Forward AAAAAA GAATTC -GCATACATTGCTTCGACAGAGAG Eco RI 
Reverse AAAAAA CTGCAG -TCAATCCGATTGCGACACGGT Pst I 

298 Forward AAAAAA G AATTC -CTGATTGCCGTGTGGTTC AGCC A A A A CCC Eco RI 
Reverse AAAAAA CTGCAG -TCATGGCTGTGTAC f TGATGGTTGCGT Pst I 

299 Forward CGC GGATCCGCTAGC -CTACCTGTCGCCTCCG BamHI- 

Nhel 

Reverse CCCG CTCGAG -TTGCCTGATTGCAGCGG Xhol 
302 Forward AAAAAAGAATTC-ATGAGTCAAACCGATACGCAACG Eco RJ 

Reverse AAAAAACTGCAG-TTAAGGTGCGGGATAGAATGTGGGCGC Pst I 

305 Forward AAAAAAGGTACC-GAATTTTTACCGATTTCCAGCACCGGA Kpn I 
Reverse AAAAAACTGCAG-TCATTCCCAACTTATCCAGCCTGACAG Pst I 

305a Forward AAAAAA GGTACC -TCCCGTTCGGGCAGTACGATTATGGG Kpn I 
Reverse AAAAAACTGCAG-TTACAAACCGACATCATGCAGGGTGAA Pst I 

306 Forward CGC GGATCCCATATG -TTTATGAACAAATTTTCCC BamHl- 

Ndel 

Reverse CCCG CTCGAG -CCGCATCGGCAGAC Xhol 
308 Forward CGC GGATCCCATATG -TTAaATCGGGTATTTTATC BamHl- 

Ndel 

Reverse CCCG CTCGAG -ATCCGCCATTCCCTGC Xhol 

311 Forward AAAAAA GGTACC -ATGTTCAGTTTTGGCTGGGTGTTT Kpn I 
Reverse AAACTGCAG-ATGTTCATATTCCCTGCCTTCGGC Pst I 

312 Forward AAAAAAGGTACC-ATGAGTATCCCATCCGGCGAAATT Kpn I 
Reverse AAACTGCAG-TCAGTTTTTCATCGATTGAACCGG Pst I 

313 Forward AAAAAAGAATTC-ATGGACGACCCGCGCACCTACGGATC Eco RI 
Reverse AAAAAACTGCAG-TCAGCGGCTGCCGCCGATTTTGCT Pst I 

401 Forward CGC GGATCCCATATG -AAGGCGGCAACACAGC BamHI- 

Ndel 

Reverse CCCGCTCGAG-CCTTACGTTTTTCAAAGCC Xhol 

402 Forward AAAAAAGjAATTC-GTGCCTCAGGCATTTTCATTTACCCTTGC Eco RI 
Reverse AAAAAA TCTAG A -TT AAATCCCTCTGCCGT A TTTGT A TTC Xba I 

402a Forward AAAAAAGAATTC-AGGCTGATTGAAAACAAACACGG Eco RI 

Reverse AAAAAA TCTAGA -TT AAATCCCTCTGCCGT A TTTGT A TTr Xba I 
406 Forward CGC GG ATCCC ATATG -TGCGGG AC ACTG A C A G BamHI- 

Ndel 

Reverse CCCGCTCGAG-AGGTTGTCCTTGTCTATG Xhol 
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501 


Forward 


CGCGGATCCCATATG-GCAGGCGGAGATGGC 


RamHI 
DalllTlj 








Ndel 




Reverse 


CCCGCTCGAG-GGTGTGATGTTCACCC 


Xhol 


502 


Forward 


CGCGGATCCCATATG-GTAGACGCGCTTAAGCA 


DomUT 

Ddjiin l - 








Ndel 




Reverse 


CCCGCTCGAG-AGCTGCATGGCGGCG 


Xhol 


503 


Forward 


CG CGG ATCCC AT ATG-TGTTCGGGGA A A GGCG 


d dill n l - 








Ndel 




Reverse 


CCCGCTCGAG-CCGCGCATTCCTCGCA 


Xhol 


504 


Forward 


CGCGG ATCCC ATATG-AGCG AT ATTG A A GTG A CG 


DalTiJi J- 








Ndel 




Reverse 


GCCCAAGCTT-TGATTCAAGTCCTTGCCG 


HinHTTT 

JL 111 J. kill I 


505 


Forward 


CGCGG ATCCC AT ATG-TTTCGTTT A C A A TTC A GG 


oairixii- 








Ndel 




Reverse 


CCCGCTCGAG-CGGCGTTTTATAGCGG 


YhnT 


510 


Forward 


CGCGGATCCCATATG-CCTTCGCGGACAC 


oanirll- 








Ndel 




Reverse 


CCCGCTCGAG-GCGCACTGGCAGCG 


XhnI 


512 


Forward 


CGCGGATCCCATATG-GGACATGAAGTA APGHT 

vvjvvjvjj v x V- v- V^ l v i / v l VJ \J\J/a\_//t, J. vJ iu" VJ 1 j\ 1 v vJ V_J 1 


r> , irn n| 








Ndel 




Reverse 


CCCGCTCGAG-AGGAATAGCCTTTGACG 


XhnT 


515 


Forward 

L \Jl TT til U. 


CGCGGATCCCATATG-GAGGA A AT AGCCTTCn A 

\ ± v^ v^- v^ v a l VJ V J r\. VJ vJ /vri_Jv 1 AVJvVj 1 1 V^VJ/i 


oairuiJ- 








Ndel 




Reverse 


CCCGCTCGAG-AAATGCCGCAAAGCATC 


Xhol 


516 


Forward 


CGCGGATCCCATATG-TGTACGTTGATGTTGTGG 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-TTTGCGGGCGGCATC 


Xhol 


517 


Forward 


CGCGGATCCCATATG-GGTAAAGGTGTGGAAATA 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-GTGCGCCCAGCCGT 


Xhol 


518 


Forward 


AAAGAATTC-GCTTTTTTACTGCTCCGACCGGAAGG 


Eco RI 




Reverse 


AAACTGCAG-TCAAATTTCAGACTCTGCCAC 




519 


Forward 


CGCGGATCCCATATG-TTCAAATrrTTTGTrOTrA 


Jt>amriJ- 








Ndel 




Reverse 


CCCGCTCGAG-TTTGGCGGTTTTGCTGC 


XhnT 


520 


Forward 


CGCGGATCCC AT ATG-CCTGCGCTTCTTTC A 

v — ' ^— * ^ — VJ VJ i a. i v_> v v^j V X i L. 1 V.' V-- x vJ V^ VJ Vw' A A Vv All V^ i\ 


DoITlrll- 








Ndel 




Reverse 


CCCGCTCGAG-ATATTTACATTTCAGTCGGC 


Xhol 


521 


Forward 


CGCGGATCCCATATG-GCCAAAATCTATACCTGC 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-CATACGCCCCAGTTCC 


Xhol 


522 


Forward 


CGCGGATCCCATATG-ACTGAGCCGAAACAC 


BamHI- 








Ndel 




Reverse 


GCCCAAGCTT-TTCTGATTTCAAATCGGCA 


Hindlll 


523 


Forward 


CGCGGATCCCATATG-GCTCTGCTTTCCGCG 


BamHI- 








Ndel 
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Reverse CCCG CTCGAG -AGGGTGTGTGATAATAAGAAG Xhol 

525 Forward CGC GGATCCCATATG -GCCGAAATGGTTCAAATC BamHI- 

Ndel 

Reverse CCCG CTCGAG -GCCCGTGCATATCATAAA Xhol 

527 Forward AAA GAATTC -TTCCCTCAATGTTGCCGTTTTCG Eco RI 

Reverse AAA CTGCAG -TTATGCTAAACTCGAAACAAATTC Pst I 

529 Forward CGC GGATCCGCTAGC -TGCTCCGGCAGCAAAAC BamHI- 

Nhel 

Reverse GCCC AAGCTT -ACGCAGTTCGGAATGGAG Hindlll 

530 Forward CGC GGATCCCATATG -AGTGCGAGCGCGG BamHI- 

Ndel 

Reverse CCCG CTCGAG -ACGACCGACTGATTCCG Xhol 

531 Forward AAAAAA GAATTC -TATGCCGCCGCCTACCAAATCTACGG Eco RI 
Reverse AAAAAACTGCAG-TTAAAACAGCGCCGTGCCGACGACAAG Pst I 

532 Forward AAAAAAGj^ATTC-ATGAGCGGTCAGTTGGGCAAAGGTGC Eco RI 
Reverse AAAAAA CTGCAG -TCAGTGTTCCAAGTC 37CGGTATCAAA Pst I 

532a Forward AAAAAA GAATTC -TTGGGTGTCGCGTTTG AGCCGG A A GT Eco RI 

Reverse AAAAAACTGCAG-TCAGTGTTCCAAGTGGTCGGTATCAAA Pst I 

535 Forward AAA GAATTC -ATGCCCTTTCCCGTTTTCAGAC Eco RI 

Reverse AAA CTGCAG -TCAGACGACCCCGCCTTCCCC Pst I 

537 Forward CGC GGATCCCATATG -CATACCCAAAACCAATCC BamHI- 

Ndel 

Reverse CCCG CTCGAG -ATCCTGCAAATAAAGGGTT Xhol 

538 Forward CGC GGATCCCATATG -GTCGAGCTGGTCAAAGC BamHI- 

Ndel 

Reverse CCCG CTCGAG -TGGCATTTCGGTTTCGTC Xhol 

539 Forward CGC GGATCCGCTAGC -GAGGATTTGCAGGAAA BamHI- 

Nhel 

Reverse CCCG CTCGAG -TACCAATGTCGGCAAATC Xhol 

542 Forward AAA GAATTC -ATGCCGTCTGAAACCGTGTC Eco RI 
Reverse AAA CTGCAG -TTACCGCGAACCGGTCAGGAT Pst I 

543 Forward AAAAAA GAATTC -GCCTTCGATGGCGACGTTGTAGGTAC Eco RI 
Reverse AAAAA ATCTAGA - Xba I 

TTAATGAAGAAGAACATATTGGAATTTTGG 

543a Forward AAAAAA GAATTC -GGCAAAACTCGTCATGAATTTGC Eco RI 

Reverse AAAAAA TCTAGA - Xba I 
TTAATGAAGAAGAACATATTGGAATTTTGG 

544 Forward AAAOAATTC-GCGCCCGCCTTCTCCCTGCCCGACCTGCACGG Eco RI 
Reverse AAA CTGCAG -CTATTGCGCCACGCGCGTATCGAT Pst I 

544a Forward AAAAAA GAATTC - Eco RI 
GCAAATGACTATAAAAACAAAAACTTCCAAGTACTTGC 

Reverse AAACTGCAG-CTATTGCGCCACGCGCGTATCGAT Pst I 

547 Forward AAA GAATT C-ATGTTCGTAGATAACGG ATTTA ATA A A AP Eco RI 
Reverse AAACTGCAG-TTAACAACAAAAAACAAACCGCTT Pst I 

548 Forward AAAGAATTC-GCCTGCAAACCTCAAGACAACAGTGCGGC Eco RI 
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Reverse AAA CTGCAG -TCAGAGCAGGGTCCTTACATCGGC Pst I 

550 Forward AAAAAA GTCGAC - Sal I 

ATGATAACGGACAGGTTTCATCTCTTTCATTTTCC 

Reverse AAA CTGCAG -TTACGCAAACGCTGCAAAATCCCC Pst I 

550a Forward AAAAAA GAATTC -GTAAATCACGCCTTTGGAGTCGCAAACGG Eco RJ 

Reverse AAA CTGCAG -TTACGCAAACGCTGCAAAATCCCC Pst I 

552 Forward AAAAAA GAATTC -TTGGCGCGTTGGCTGGATAC Eco RI 

Reverse AAA CTGCAG -TTATTTCTGATGCCTTTTCCCAAC Pst I 

554 Forward CGC GGA TCCCATATG -TCGCCCGCGCCCAAC BamHI- 

Ndel 

Reverse CCCG CTCGAG -CTGCCCTGTCAGACAC Xhol 

556 Forward AAA GAATTC -GCGGGCGGTTTTGTTTGGACATCCCG Eco RJ 
Reverse AAA CTGCAG -TTAACGGTGCGGACGTTTCTGACC Pst I 

557 Forward CGC GGATCCCATATG -TGCGGTTTCCACCTGAA BamHI- 

Ndel 

Reverse CCCG CTCGAG -TTCCGCCTTCAGAAAGG Xhol 

558 Forward AAA GAATTC -GAGCTTTATATGTTTCAACAGGGGACGGC EcoRl 
Reverse AAA CTGCAG -CTAAACAATGCCGTCTGAAAGTGGAGA Pst I 

558a Forward AAAAAA GAATTC -ATTAGATTCTATCGCCATAAACAGACGGG Eco RI 
Reverse AAAAAA CTGCAG -CTAAACAATGCCGTCTGAAAGTGGAGA Pst I 

560 Forward AAAAAA GAATTC - Eco RI 

TCGCCTTTCCGGGACGGGGCGCACAAGATGGC 
Reverse AAAAAA CTGCAG -TCATGCGGTTTCAGACGGCATTTTGGC Pst I 

561 Forward CCG GAATTC TA CATATG -ATACTGCCAGCCCGT EcoRI- 

Ndel 

Reverse CCCG CTCGAG -TTTCAAGCTTTCTTCAGATG Xhol 

562 Forward CGC GGATCCCATATG -GCAAGCCCGTCGAG BamHI- 

Ndel 

Reverse CCCG CTCGAG -AGACCAACTCCAACTCGT Xhol 

565 Forward CGC GGATCCCATATG -AAGTCGAGCGCGAAATAC BamHI- 

Ndel 

Reverse CCCG CTCGAG -GGCATTGATCGGCGGC Xhol 

566 Forward CGC GGATCCCATATG -GTCGGTGGCGAAGAGG BamHI- 

Ndel 

Reverse CCCG CTCGAG -CGCATGGGCGAAGTCA Xhol 

567 Forward CCG GAATTC TA CATATG -AGTGCGAACATCCTTG EcoRI- 

Ndel 

Reverse CCCG CTCGAG -TTTCCCCGACACCCTCG Xhol 

568 Forward CGC GGATCCCATATG -CTCAGGGTCAGACC BamHI- 

Ndel 

Reverse CCCG CTCGAG -CGGCGCGGCGTTCAG Xhol 

569 Forward AAAAAA GAATTC -CTGATTGCCTTGTGGGAATATGCCCG Eco RI 
Reverse AAAAAA CTGCAG -TTATGCATAGACGCTGATAACGGCAAT Pst I 

570 Forward CGC GGATCCCATATG -GACACCTTCCAAAAAATCG BamHI- 

Ndel 

Reverse CCCG CTCGAG -GCGGGCGTTCATTTCTTT Xhol 
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571 Forward AAAAAA GAATTC - Eco RJ 

ATGGGTATTGCCGGCGCCGTAAATGTTTTGAACCC 
Reverse AAAAAACTGCAG-TTATGGCCGACGCGCGGCTACCTGACG Pst 1 

572 Forward CGC GGATCCCATATG -GCGCAAAAAGGCAAAACC BamHI- 

Ndel 

Reverse CCCG CTCGAG -GCGCAGTGTGCCGATA Xhol 

573 Forward CGC GGATCCCATATG -CCCTGTTTGTGCCG BamHI- 

Ndel 

Reverse CCCG CTCGAG -GACGGTGTCATTTCGCC Xhol 

574 Forward CGC GGATCCCATATG -TGGTTTGCCGCCCGC BamHI- 

Ndel 

Reverse CCCG CTCGAG -AACTTCGATTTTATTCGGG Xhol 

575 Forward CGC GGATCCCATATG -GTTTCGGGCGAGG BamHI- 

Ndel 

Reverse CCCG CTCGAG -CATTCCGAATCTGAACAG Xhol 

576 Forward CGC GGATCCCATATG -GCCGCCCCCGCATCT BamHI- 

Ndel 

Reverse CCCG CTCGAG -ATTTACTTTTTTGATGTCGAC Xhol 

577 Forward CGC GGATCCCATATG -GAAAGGAACGGTGTATTT BamHI- 

Ndel 

Reverse CCCG CTCGAG -AGGCTGTTTGGTAGATTCG Xhol 

578 Forward CGC GGATCCCATATG -AGAAGGTTCGTACAG BamHI- 

Ndel 

Reverse CCCG CTCGAG -GCCAACGCCTCCACG Xhol 

579 Forward CGC GGATCCCATATG -AGATTGGGCGTTTCCAC BamHI- 

Ndel 

Reverse CCCG CTCGAG -AGAATTGATGATGTGTATGT Xhol 

580 Forward CGC GGATCCCATATG -AGGCAGACTTCGCCGA BamHI- 

Ndel 

Reverse CCCG CTCGAG -CACTTCCCCCGAAGTG Xhol 

581 Forward CGC GGATCCCATATG -CACTTCGCCCAGC BamHl- 

Ndel 

Reverse CCCG CTCGAG -CGCCGTTTGGCTTTGG Xhol 

582 Forward AAAAAA GAATTC -TTTGGAGAGACCGCGCTGCAATGCGC Eco RI 
Reverse AAAAAA TCTAGA -TCAGATGCCGTCCCAGTCGTTGAA Xba I 

583 Forward AAAAAAGAATTC- ACTGCCGGCAATCGACTGCATAATCG Eco RI 
Reverse AAAAAACTGCAG-TTAACGGAGGTCAATATGATGAAATTG Pst I 

584 Forward AAAAAA GAATTC - Eco RJ 

GCGGCTGAAGCATTGAATTACAATATTGTC 
Reverse AAAAAACTGCAG-TCAGAACTGAACCGTCCCATTGACGCT Pst I 

585 Forward AAAAAAGGJACC-TCTTTCTGGCTGGTGCAGAACACCCTTGC Eco RI 
Reverse AAAAAA CTGCAG -TCAGTTCGCACTTTTTTCTGTTTTGGA Pst I 

586 Forward CGC GGATCCCATATG -GCAGCCCATCTCG BamHI- 

Ndel 

Reverse CCC GCTCGAG -TTTCAGCGAATCAAGTTTC Xhol 

587 Forward CGCGGATCCCATATG-GACCTGCCCTTGACGA BamHI- 
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Ndel 

Reverse CCCG CTCGAG -AAATGTATGCTGTACGCC Xhol 

588 Forward AAAAAAGAATTC-GCCGTCCTGACTTCCTATCAAGAACCAGG Eco RI 
Reverse AAAAAACTGCAG-TTATTTGTTTTTGGGCAGTTTCACTTC Pst I 

589 Forward AAAAAA GAATTC - E co rj 

ATGCAACAAAAAATCCGTTTCCAAATCGAAGG 
Reverse AAAAAA CTGCAG -CTAATCGATTTTTACCCGTTTCAGGCG Pst I 

590 Forward AAAAAA GAATTC -ATGAAAAAACCTTTGATTTCAGTTGCGGG Eco RJ 
Reverse AAAAAA CTGCAG -TTACTGCTGCGGCTCTGAAACCAT Pst I 

591 Forward AAAAAA GAATTC -CACTACATCGTTGCCAGATTGTGCGG EcoRJ 
Reverse AAAAAA CTGCAG -CTAACCGAGCAGCCGGGTAACGTCGTT Pst I 

592a Forward AAAAAAGAATTC-CGCGATTACACCGCCAAGCTGA7VAATGGG Eco Rl 

Reverse AAAAAA CTGCAG -TTACCAAACGTCGGATTTGATACG Pst I 

593 Forward CGC GGATCCGCTAGC -CTTGAACTGAACGGACTC BamHI- 

Nhel 

Reverse CCCG CTCGAG -GCGGAAGCGGACGATT Xhol 
594a Forward AAAAAAGAATTC-GGTAAGTTCGCCGTTCAGGCCTTTCA Eco RJ 

Reverse AAAAAACTGCAG-TTACGCCGCCGTTTCCTGACACTCGCG Pst I 

595 Forward AAAAAA GAATTC -TGCC AGCCGCCGGAGGCGG AG A A A GC Eco Rl 
Reverse AAAAAACTGCAG-TTATTTCAAGCCGAGTATGCCGCG Pst I 

596 Forward CGC GGATCCCATATG -TCCCAACAATACGTC BamHI- 

Ndel 

Reverse CCCG CTCGAG -ACGCGTTACCGGTTTGT Xhol 

597 Forward CGC GGATCCCATATG -CTGCTTCATGTCAGC BamHI- 

Ndel 

Reverse GCCCAAGCTT-ACGTATCCAGCTCGAAG Hindlll 

601 Forward CGC GGATCCCATATG -ATATGTTCCCAACCGGCAAT BamHI- 

Ndel 

Reverse CCCG CTCGAG -AAAACAATCCTCAGGCAC Xhol 

602 Forward CGC GGATCCGCTAGC -TTGCTCCATCAATGC BamHI- 

Nhel 

Reverse CCCG CTCGAG -ATGCAGCTGCTAAAAGCG Xhol 

603 Forward AAAAAA GAATTC -CTGTCCTCGCGTAGGCGGGGACGGGG Eco RI 
Reverse AAAAAACTGCAG-CTACAAGATGCCGGCAAGTTCGGC Pst I 

604 Forward CGC GGATCCGCTAGC -CCCGAAGCGCACTT BamHl- 

Nhel 

Reverse CCCG CTCGAG -GACGGCATCTGCACGG Xhol 
606a Forward AAAAAAGAATTC-CGCGAATACCGCGCCGATGCGGGCGC Eco RI 
Reverse AAAAAA CTGC AG -TT AAAGCG ATTTG AGGCGGGGG ATAGG Pst I 

607 Forward AAAAAA GAATTC - ATGCTGCTCGACCTCAACCGCTTTTG Eco RI 
Reverse AAAAAACTGCAG-TCAGACGGCCTTATGCGATCTGAC Pstl 

608 Forward AAAAAAGAATTC- ATGTCCGCCCTCCTCCCCATCATCAACCG Eco Rl 
Reverse AAAAAACTGCAG-TTAGTCTATCCAAATGTCGCGTTC Pstl 

609 Forward CGC GGATCCCATATG -GTTGTGGATAGACTCG BamHI- 
. Ndel 
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Reverse CCC GCTCGAG -CTGGATTATGATGTCTGTC Xhol 

610 Forward CGC GGATCCCATATG -ATTGGAGGGCTTATGCA BamHI- 

Ndel 

Reverse CCCG CTCGAG -ACGCTTCAACATCTTTGCC Xhol 

611 Forward CGC GGATCCCATATG -CCGTCTCAAAACGGG BamHI- 

Ndel 

Reverse CCCG CTCGAG -AACGACTTTGAACGCGCAA Xhol 

613 Forward CGC GGATCCCATATG -TCGCGTTCGAGCCG3 BamHI- 

Ndel 

Reverse CCCG CTCGAG -AGCCTGTAAAATAAGCGGC Xhol 

614 Forward CGC GGATCCCATATG -TCCGTCGTGAGCGGC BamHI- 

Ndel 

Reverse CCCG CTCGAG -CCATACTGCGGCGTTC Xhol 
616 Forward AAAAAA GAATTC -ATGTCAAACACAATCAAAATGGTTGTCGG Eco RI 

Reverse AAAAAA TCTAGA -TTAGTCCGGGCuGCAGGCAGCTCG Xbal 
619a Forward AAAAAA GAATTC -GGGCTTCTCGCCGCC T °GCTTGC Eco RI 

Reverse AAAAAA CTGCAG -TCATTTTTTGTGTTTTAAAACGAGATA Pst I 
622 Forward CGC GGATCCCATATG -GCCGCCCTGCCTAAAG BamHI- 

Ndel 

Reverse CCCG CTCGAG -TTTGTCCAAATGATAAATCTG Xhol 

624 Forward CGC GGATCCCATATG -TCCCCGCGCTTTTACCG BamHI- 

Ndel 

Reverse CCCG CTCGAG -AGATTCGGGCCTGCGC Xhol 

625 Forward CGC GGATCCCATATG -TTTGCAACCAGGAAAATG BamHI- 

Ndel 

Reverse CCCG CTCGAG -CGGCAAAATTACCGCCTT Xhol 
627a Forward AAAAAA GAATTC -AAAGCAGGCGAGGCAGGCGCGCTGGG Eco RI 

Reverse AAAAAA CTGCAG - Pst I 

TTACGAATGAAACAGGGTACCCGTCATCAAGGC 
628 Forward AAAAAA GGTACC -GCCTTACAAACATGGATTTTGCGTTC Kpn I 

Reverse AAAAAA CTGCAG -CTACGCACCTGAAGCGCTGGCAAA Pst I 

629a Forward AAAAAA GAATTC -GCCACCTTTATCGCGTATGAAAACGA Eco RI 

Reverse AAAAA ACTGCAG -TTACAACACCGCCGTCCGGTTCAAACC Pst I 
630a Forward AAAAAA GAATTC -GCGGCTTTGGGTATTTCTTTCGG Eco RI 

Reverse AAAAAA CTGCAG -TTAGGAGACTTCGCCAATGGAGCCGGG Pst I 
635 Forward AAAAAA GAATTC - EcoRI 
ATGACCCAGCGACGGGTCGGCAAGCAAAACCG 

Reverse AAAAAA CTGCAG -TTAATCCACTATAATCCTGTTGCT Pst I 

638 Forward AAAAAA GAATTC -ATGATTGGCGAAAAGTTTATCGTAGTTGG Eco RI 
Reverse AAAAAA CTGCAG -TCACGAACCGATTATGCTGATCGG Pst I 

639 Forward CGC GGATCCCATATG -ATGCTTTATTTTGTTCG BamHI- 

Ndel 

Reverse CCCG CTCGAG -ATCGCGGCTGCCGAC Xhol 
642 Forward CGC GGATCCCATATG -CGGTATCCGCCGCAAT BamHI- 

Ndel 

Reverse CCCGCTCGAG-AGGATTGCGGGGCATTA Xhol 
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643 Forward CGC GGATCCCATATG -GCTTCGCCGTCGGCAG BamHI- 

Ndel 

Reverse CCCG CTCGAG -AACCGAAAAACAGACCGC Xhol 

644 Forward AAAAAA GAATTC - Eco RI 

ATGCCGTCTGAAAGGTCGGCGGATTGTTGCCC 

Reverse AAAAAA TCTAGA -CTACCCGCAATATCGGCAGTCCAATAT Pst I 

645 Forward AAAAAA GAATTC -GTGGAACAGAGCAACACGTTAAATCG Eco RI 
Reverse AAAAAA CTGCAG -CTACGAGGAAACCGAAGACCAGGCCGC Pst I 

647 Forward AAAAAA GAATTC - ATGCAAAGGCTCGCCGCAGACGG Eco RI 
Reverse AAAAAA CTGCAG -TTAGATTATCAGGGATATCCGGTAGAA Pst I 

648 Forward AAAAAA GAATTC - Eco RI 

ATGAACAGGCGCGACGCGCGGATCGAACG 

Reverse AAAAAA CTGCAG -TCAAGCTGTGTGCTGATTGAATGCGAC Pst I 

649 Forward AAAAAA GAATTC -GGTACGTCAGAACCCGCCCACCG Eco RI 
Reverse AAAAAA CTGCAG -TTAACGGCGGAAACTGCCGCCGTC Pst I 

650 Forward AAAAAA GAATTC - ATGTCCAAACTCAAAACCATCGC Eco RI 
Reverse AAAAAA CTGCAG -TCAGACGGCATGGCGGTCTGTTTT Pst I 

652 Forward AAAAAA GGTACC - Kpn I 

GCTGCCGAAGACTCAGGCCTGCCGCTTTACCG 

Reverse AAAAAA CTGCAG -TTATTTGCCCAGTTGGTAGAATGCGGC Pst I 

653 Forward AAAAAA GAATTC -GCGGCTTTGCCGGTAATTTTCATCGG Eco RI 
Reverse AAAAAA CTGCAG -CTATGCCGGTCTGGTTGCCGGCGGCGA Pst I 

656a Forward AAAAAA GAATTC -CGGCCGACGTCGTTGCGTCCTAAGTC Eco RI 

Reverse AAAAAA CTGCAG -CTACGATTTCGGCGATTTCCACATCGT Pst I 

657 Forward AAAAAA GAATTC -GCAGAATTTGCCGACCGCCATTTGTGCGC Eco RI 
Reverse AAAAAA CTGCAG -TTATAGGGACTGATGCAGTTTTTTTGC Pst I 

658 Forward CGC GGATCCCATATG -GTGTCCGGAATTGTG BamHI- 

Ndel 

Reverse CCC GCTCGAG -GGCAGAATGTTTACCGTT Xhol 

661 Forward AAAAAA GAATTC - Eco RI 
ATGCACATCGGCGGCTATTTTATCGACAACCC 

Reverse AAAAAA CTGCAG -TCACGACGTGTCTGTTCGCCGTCGGGC Pst I 

663 Forward CGC GGATCCCATATG -TGTATCGAGATGAAATT BamHI- 

Ndel 

Reverse CCCG CTCGAG -GTAAAAATCGGGGCTGC Xhol 

664 Forward CGC GGATCCCATATG -GCGGCTGGCGCGGT BamHI- 

Ndel 

Reverse CCCG CTCGAG -AAATCGAGTTTTACACCAC Xhol 

665 Forward AAAAAA GAATTC -ATGAAATGGGACGAAACGCGCTTCGG Eco RI 
Reverse AAAAA ACTGCAG -TCAATCCAAAATTTTGCCGACGATTTC Pst I 

666 Forward AAAAAA GAATTC -AACTCAGGCGAAGGAGTGCTTGTGGC Eco RI 
Reverse AAAAA ATCTAGA -TCAGTTTAGGGATAGCAGGCGTAC Xha I 

667 Forward AAAAAA GAATTC - Eco RI 

CCGCATCCGTTTGATTTCCATTTCGTATTCGTCCG 

Reverse AAAAAA CTGCAG -TTAATGACACAATAGGCGCAAGTC Pst I 
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669 


forward 


AAAAAAGAATTC-ATGCGCCGCATCATTAAAAAACACCAGCC 


Fro RT 




IxCVCI DC 


A A A A A ACTGC AC-TT AC ACT ATCCCTTTC ATCTCCCC 

-rtJ^rvz-vrVrrV^ I VJv rVVJ - 1 1 AVV^/WJ Inl V_xV^Vj 111 VJ A 1 VJ I V, vJ vJ v.^ 


Pc-t T 
r St 1 


o /ua 


Forward 


AAAAAAVjAA J 1 L-AAAAALUL 1 1 LvjvjVjLvj 111 Lu 1 LI 1L 


bco RI 




ivcversc 


A A A A A A CTCC A C 

t\s\f\J\r\J\\-s 1 Vjv_./AvJ- 


rst 1 






TTAGGAGrTTTTGGAArGrrTTrrTrTArTnnr 

1 1 AVJVjnVJv/ 1111 VJ VJ /v/ \ VJ V^ VJ 1 V^ VJ VJ /A V_ 1 VJVJV^ 




671 


Forward 


CGCGGATCCCATATG-ACCAGCAGGGTAAC 


RamHT- 

Uall 11 1 1 








Ndel 




Reverse 


CCCGCTCGAG-AGCAACTATAAAAACGCAAG 


Xhol 


672 


Forward 


CGCGGATCCCATATG-AGGAAAATCCGCACC 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-ACGGGATAGGCGGTTG 


Xhol 


673 


Forward 


AAAAAAGAATTC-ATGGATATTGAAACCTTCCTTGCAGG 


Eco RI 




Reverse 


AAAAAACTGCAG-CTACAAACCCAGCTCGCGCAGGAA 


Pst I 


674 


Forward 


AAAAAAGAATTC-ATGAAAACAGCCCGUCGCCGTTCCCG 


Eco RJ 




P pupr^p 


AAAAAACTGCAG-TfAA CGGCOTTTfinnrTrfVrrnfif; 

iiiljTaTiLri v_v i VJ v/il\j i v>iT-rA.\^v_i VJ v> VJ III VJ VJ VJ V_, 1 V, VJ I V^VJVJVJ 


Pet T 
r SI 1 


675 


Forward 

1 1 TV Ul VI 


CGCGGATCCCATATG-AACACCATCGCCCC 


RcimT4T 
Ddiiini - 




Reverse 


CCCGCTCGAG-TTCTTCGTCTTCAAACTGT 


Ndel 
Xhol 


677a 


Forward 


AAAAAAGAATTC-AGACGGCATTCCCGATCAGTCGATTTTGA 


Eco RI 




Reverse 


AAAAAACTGCAG-TTACGTATGCGCGAAATrGArrGrrGr 

■* *-* *-* ' -i vj vi i-vJ i a i 1 \_J x J i 1 VJvVJ vVJiui/i 1 V. VJ / \ V^ V VJ V- V^ VJ V.- 


Pst T 

r ol 1 


680 


Forward 


CGCGGATCCGCTAGC-ACGAAGGGCAGTTCGG 










1NI1C1 




Reverse 

X VW ▼ VI uV 


CCCGCTCGAG-CATCAAAAACCTGCCGC 


yvllUl 


681 


Forward 


AAAAAAGAATTC-ATGACGACGCCGATGGCAATCAGTGC 


Fro RI 




P Avprcp 


A A A A A ACTGCAO-TTAGrriTfTTrrf^rA A A A A AP A CC 

/vrv^vrv.rv.rvv^ l VJ V^.rAVJ l l /A V, VJ 1V_.11 v^ V ^ VJ V- /A /viA^rV /A .'A r\ VJ v.- 


Pet T 
r SI I 


Oo;S 


Unnt tor/1 

rorwara 


LvjLvjvj A 1 LLLA 1 A I Vj- 1 uLAuLALALLuLrALAA 


n T TT 

BamHI- 








iNaei 




rVCVCI oC 


CCCC1CTCC A CC A CTTTTTTTCCCC ATA CC 
v^v^v^vJV-/ 1 v^vJ/AVJ vJ/AvJ 1 i 1 1 1 I 1 v^v^vJv^/v i .tAv^vJ 


Anoi 


Oo4 


Forward 


LLrLvjOA 1 LLLA 1 A 1 vj- 1 LLLLr 1 AL 1 C_r 1 LLAAACj 


BamHI- 








iNaei 




IxCVCIoC 


CCCCCTCC A C-CTCC A CC A TCTCTTCCC 

v>L-vv VJ V-/ 1 vUaU*v> 1 v^ VJ/A v., v.. /a 1 v, 1 u 1 1 VJV^VJ 


Xhol 




Forward 


LvjLvjvjAI LLLA IAIvj-IvjI 1 1 (jL 1 1 AA 1 AA iAAALATT 


BamHI- 








iNaei 




j\cvciac 


CCCGCTCG AG-CTTTTTCCCCCCCCC A 

V^V^VxVJV^ 1 vvjAvj v 11111 V^V^V^v^VJV^V^VJV_, A 


Xnol 


UOO 


Forward 


LvjLvjvjA 1 LLLA 1 A 1 Lr- 1 vjLvjvjLvjvj 1 1 LvjvjAACj 


BamHI- 








INaei 




JTvCVCI ov 


GrGGrTrGAG-GATTGrGATTPTnATnA AC 

V^\_V'VJV/ 1 vVJAU V/A 1 1 V^ V^ VJ/A 1 i L 1 VJ/A. I VJ/A^rV VJ 


Anoi 


687 


Forward 


CGCGGATCCCATATG-TGCGACAGCAAAGTCCA 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-CTGCGCGGCTTTTTGTT 


Xhol 


690 


Forward 


CGCGGATCCCATATG-TGTTCTCCGAGCAAAGAC 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-TATTCGCCCCGTGTTTGG 


Xhol 


691 


Forward 


CGCGGATCCCATATG-GCCACGGCTTATATCCC 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-TTTGAGGCAGGAAGAAAG 


Xhol 
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694 Forward CGC GGATCCCATATG -TTGGTTTCCGCATCCGG BamHI- 

Ndel 

Reverse CCCG CTCGAG -TCTGCGTCGGTGCGGT Xhol 

695 Forward CGC GGATCCCATATG -TTGCCTCAAACTCGTCCG BamHI- 

Ndel 

Reverse CCCG CTCGAG -TCGTTTGCGCACGGCT Xhol 

696 Forward CGC GGATCCCATATG -TTGGGTTGCCGGCAGG BamHI- 

Ndel 

Reverse CCCG CTCGAG -TTGATTGCCGCAATGATG Xhol 
700a Forward AAAAAAO^ATTC-GCATCGACAGACGGTGTGTCGTGGAC Eco RI 
Reverse AAAAAACTGCAG-TTACGCTACCGGCACGACTTCCAAACC Pst I 

701 Forward CGC GGATCCCATATG -AAGACTTGTTTGGATACTTC BamHI- 

Ndel 

Reverse CCCG CTCGAG -TGCCGAC AACAGCCTC Xhol 

702 Forward AAAAAAGAATTC-ATGCCGTGTTCCAAAGCCAGTTGGATTTC Eco RJ 
Reverse AAAAAACTGCAG-TTAACCCCATTCCACCCGGAGAACCGA Pst I 

703 Forward CGC GGATCCGCTAGC -CAAACGCTGGCAACCG BamHI- 

Nhel 

Reverse CCCG CTCGAG -TTTTGCAGGTTTGATGTTTG Xhol 
704a Forward AAAAAA GAATTC -GCTTCTACCGGTACGCTGGCGCG Eco RI 

Reverse AAAAAA CTGCAG - p st ] 

TTAGTTTTGCCGGATAATATGGCGGGTGCG 

707 Forward CGC GGATCCGCTAGC -GAAATTATTAACGATGCAGA BamHI- 

Nhel 

Reverse CCCG CTCGAG -GAAACTGTAATTCAAGTTGA Xhol 

708 Forward CGC GGATCCGCTAGC -CCTTTTAAGCCATCCAAAA BamHI- 

Nhel 

Reverse CCCG CTCGAG -TTGACCGGTGAGGACG Xhol 
710 Forward CGC GGATCCCATATG -GAAACCCACGAAAAAATC BamHI- 

Ndel 

Reverse CCCG CTCGAG -AACGGTTTCGGTCAG Xhol 
714 Forward CGC GGATCCCATATG -AGCTATCAAGACATCTT BamHI- 

Ndel 

Reverse CCCG CTCGAG -GCGGTAGGTAAATCGGAT Xhol 
716 Forward CGC GGATCCCATATG -GCCAACAAACCGGCAAG BamHI- 

Ndel 

Reverse CCCG CTCGAG -TTTAGAACCGCATTTGCC Xhol 
718 Forward CGC GGATCCC ATATG -GAGCCG ATAATGGC A A A BamHI- 

Ndel 

Reverse CCCGCTCGAG-GGCGCGGGCATGGTCTTGTCC Xhol 
720 Forward CGC GGATCCCATATG -AGCGGATGGCATACC BamHI- 

Ndel 

Reverse CCCGCTCGAG-TTTTGCATAGCTGTTGACCA Xhol 
723 Forward CGC GGATCCCATATG -CGACCCAAGCCCC BamHI- 

Ndel 

Reverse CCCGCTCGAG-AATGCGAATCCGCCGCC Xhol 
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/ Zr J 


Forward CGCGGATCCCATATG-GTGCGCACGGTTAAA 


T> « T T T 

BamHl- 








i\uei 




rvcvci be 


CCGGCTCG Afi TTDPTT A TPPTT A APrr.TTA 
LLLLJL 1 LLt/\vj- I i UL 1 Ini 1 1 AAvjvjLt 1 1 A 


Anoj 


1 £.\J 


1 yji W11U 


CGGGG A TPPP ATA Tfl. A fP A TPT A TTTP A a a a AT 
v^ vj v_. vj vj /\ i v_,\^\_.rv i /\ 1 VJ-/-YV. v^/"Y J LI /\ 1 1 1 LAAAAnL 


n ~ T TT 

BamHI- 








KIH^T 

rNQei 




xvcversc 


LLLLiL I LLf/\L7-L7LLLTA 1 Lr I 1 1 ALrLvj 1 LL 


An oi 


119. 
1 Lo 


I \Jl WdlU 


PGPGGATPPP ATATP-TTTTPPPTPPP A APPnP 
vjv_/\j i \^v_^v^ a rv i a i vj- liii vjvjl i vj vj vj J\J \ L Lj Lr Lt 


BamHI- 








PMQcl 




JxCVCIdC 


gggggtggag ptpapa a a nnTrnrnr 

LLLUL 1 LVJ/AvJ VJ i UAUAAAuU 1 L VJ L VJ L 


AjIO j 


790 


r urwdiu 


PPPPPATPPP AT ATP TPtP A PP ATH A TTPPPr a 
L/UL/UUA 1 LLLA 1 /\ 1 Lr- 1 ULALLA 1 VJ A i 1 LLLLA 


BamHI- 








IN Gel 




ivcvcrSc 


pppp A A pptt TTTrTTrnrTTTr.nrT a "rr^ 

uLLLaAUL 1 1-1 1 1 vj 1 LLjLt 111 LrvJLr 1 AIL 


Hinalii 


7^ 1 


x UI Wdl U 


cgcgg atppppt a gc-gccCtTGccgg a pp 

V^ VJ V_. VJ VJ / V 1 vVyUV^ 1 AvJu"VJ\^V/VJ 1 VJ L L VJ VJ r\ VJ VJ 


BamHI- 








in nei 




xvcVciSc 


rrrnrTrnAft Apppppppppr^Ar^ 


Xnol 


I'M 


Th/m-iI/QTm 

a Ul Wdi u 


PPPP A A TTPT A P A T A TP TPP AAA PPTHTTTTT a a n a a 
L L VJVJAVA I 1L1 ALA 1 /\ 1 vj- 1 LLtAAALL 1 Lj 1 ' 1 AA'jAA 


EcoRI- 








iNaei 




ixeverse 


GGGGGTCCl A P PTTPTT A TPTTTTTT A TfTTTr 
LLLLiL 1 LLlALJ-L 11L11A1L1 lllllAlLlllL 


Xnol 


I'XX 

130 


/'vfw/nTYi 

rui wdiu 


PPPPPATPPP AT ATP PPPTPPnnrr;^^ A A 

v^ vjv^ vjvj/A i LLL/\ 1 A 1 vJ"ULL 1 LLLjvjLvjvjL AA 


BamHI- 








iNaei 




reverse 


LLLLiL 1 LLrALi- 1 LLtL 1 1 LrLL* 1 LL 111 AL 


AJlOl 




r urwdiu 


CGCGG A TPPP ATA TP GCCC ATA PTT A CCinnT A t 

LLTLLTLLrV 1 LLLA 1 f\ 1 LT-LTLLLjA 1 AL 1 1 ALVjVjL 1 A 1 


BamHI- 








iNaei 




ixe verse 


CCCGCTCC A P TTTP A P A TTTTP. A A TP A A at at 
LLLLtL 1 LLtAvj- 1 i 1 LtALj A 1 1 1 1 LtAA 1 L AAAUAvj 


Xnol 


7^ 


i WdlU 


PPPPPATPPP AT ATP A A PP A pp A pppr;r;Tr^ a 

LLILVjvJ/\ 1 LLL/A.1 r\ 1 U _ AAUL AULAUULaJU 1 LA 


BarnHI- 








iNuei 




ivcvcrse 


CCCGCTCG A Pi ATTTPPPT A prm A 
LLLLiL 1 Lvj/ALr-/\ 1 1 1 LLLr 1 ALjLLLtALtLtlt 


Xnol 


7^7 


i v7l waiu 


PPPPPATPPP AT A TP-PA PP A PP A PP/"; a c a rr, 

V-^vj v_^ vjvj/V l V^V^\_/rV 1 A 1 VJ-L rVLL/\LLr/\LLJLTAL ALvJ 


BamHI- 








iNuei 




IvCVClaC 


CCCGCTCG AG gtcgtcccccccic^c A 

LLLLJL 1 Lvj/\Lt-vj 1 LLt 1 L vjLLtvjLvjLtvj A 


Xnol 




P'nrvx/nrH 
i ui w aivi 


PGPGPATPPPATATP-PP A A A A A A APPPA AP A 
v^vjv^vjvjrt. 1 v^v^v^av l A I U ULAAAAAAALLUAALA 


"T~» x TT 

BamHI- 








iNUCl 






CCCGCTCG A P-P A A P A PTTTPTPr: ATA A tt 
LLLvjl 1 LLT/\Lj-vj/\/\Li/\vj 1 1 1 Lt i LLtALtAA 1 1 


xnol 


740 


Forward 


PGPGGATPPP AT ATP-PPP A ATPPPPPPP A AP 

^vJV-'VJ VJ-rx 1 \_-V^\_/^ 1 / V JL VJ~VJLL/VrV 1 LL VJLLL VJ/Lr\VJ 


n T TT 

BamHI- 












ivcver ac 


GCGGGTGGAG A A APPPPPP A A A A T A PTr 1 
LLLVJL 1 LL/\LJ /\/WVLLtLvjLL A7VAA 1 ALT 1 LJ 


xnol 


741 

/Hi 


Forward 


CGCGGATCCCATATG-TGCAGCAGCGGAGGG 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-TTGCTTGGCGGCAAGGC 


Xhol 


743 


Forward 


CGCGGATCCCATATG-GACGGTGTTGTGCCTGTT 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-CTTACGGATCAAATTGACG 


Xhol 


745 


Forward 


CGCGGATCCCATATG-TTTTGGCAACTGACCG 


BamHI- 








Ndel 




Reverse 


CCCGCTCGAG-CAAATCAGATGCCTTTAGG 


Xhol 


746 


Forward 


CGCGGATCCCATATG-TCCGAAAACAAACAAAAC 


BamHI- 
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Reverse CCCG CTCGAG -TTCATTCGTTACCTGACC 
747 Forward CCG GAATTC TA GCTAGC -CTGACCCCTTGGG 

Reverse GCCC AAGCTT -TTTTGATTTTAATTGACTATAGAAC 

749 Forward CGC GGATCCCATATG -TGCCAGCCGCCG 

Reverse CCCG CTCGAG -TTTCAAGCCGAGTATGC 

750 Forward CGC GGATCCCATATG -TGTTCGCCCGAACCTG 

Reverse CCCG CTCGAG -CTTTTTCCCCGCCGCAA 

758 Forward CGC GGATCCCATATG -AACAATCTGACCGTGTT 

Reverse CCCG CTCGAG -TGGCTCAATCCTTTCTGC 

759 Forward CGC GGATCCGCTAGC -CGCTTCACACACACCAC 

Reverse CCCG CTCGAG -CCAGTTGTAGCCTATTTTG 

763 Forward CGC GGATCCCATATG -CTGCCTGAAGCATGGCG 

Reverse CCCG CTCGAG -TTCCGCAAATACCGTTTCC 

764 Forward CGC GGATCCCATATG -TTTTTCTCCGCCCTGA 

Reverse CCCG CTCGAG -TCGCTCCCTAAAGCTTTC 

765 Forward CGC GGATCCCATATG -TTAAGATGCCGTCCG 

Reverse CCCG CTCGAG -ACGCCGACGTTTTTTATTAA 

767 Forward CGC GGATCCCATATG -CTGACGGAAGGGGAAG 

Reverse CCCG CTCGAG -TTTCTGTACAGCAGGGG 

768 Forward CGC GGATCCCATATG -GCCCCGCAAAAACCCG 

Reverse CCCG CTCGAG -TTTCATCCCTTTTTTGAGC 

770 Forward CGC GGATCCCATATG -TGCGGCAGCGGCGAA 

Reverse CCCG CTCGAG -GCGTTTGTCGAGATTTTC 

771 Forward CGC GGATCCCATATG -TCCGTATATCGCACCTTC 

Reverse CCCG CTCGAG -CGGTTCTTTAGGTTTGAG 

772 Forward CGC GGATCCCATATG -TTTGCGGCGTTGGTGG 

Reverse CCCG CTCGAG -CAATGCCGACATCAAACG 
774 Forward CGC GGATCCCATATG -TCCGTTTCACCCGTTCC 

Reverse CCCG CTCGAG -TCGTTTGCGCACGGCT 
790 Forward CGC GGATCCCATATG -GCAAGAAGGTCAAAAAC 



Ndel 
Xhol 
EcoRI- 
Nhel 
Hindlll 
BamHI 
Ndel 
Xhol 
BamHI 
Ndel 
Xhol 
BamHI 
Ndel 
Xhol 
BamHI 
Nhel 
Xhol 
BamHI 
Ndel 
Xhol 
BamHI 
Ndel 
Xhol 
BamHI 
Ndel 
Xhol 
BamHI 
Ndel 
Xhol 

BamHI 

Ndel 

Xhol 

BamHI 

Ndel 

Xhol 

BamHI 

Ndel 

Xhol 

BamHI 

Ndel 

Xhol 

BamHI 

Ndel 

Xhol 

BamHI 

Ndel 
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Reverse CCCG CTCGAG -GGCGTTGTTCGGATTTCG Xhol 

900 Forward CGC GGATCCCATATG -CCGTCTGAAATGCCG BamHI- 

Ndel 

Reverse CCCG CTCGAG -ATATGGAAAAGTCTGTTGTC Xhol 

901 Forward CGC GGATCCCATATG -CCCGATTTTTCGATG BamHI- 

Ndel 

Reverse CCCG CTCGAG -AAAATGGAACAATACCAGG Xhol 

902 Forward. CCG GAATTC TA CATATG -TTGCACTTTCAAAGGATAATC EcoRI- 

2 Ndel 
Reverse CCCG CTCGAG -AAAAATGTACAATGGCGTAC Xhol 

903 Forward CCG GAATTC TA GCTAGC -CAGCGTCAGCAGCACAT EcoRJ- 

Nhel 

Reverse CCCG CTCGAG -GAAACTGTAATTCAAGTTGAA Xhol 

904 Forward AAAAAA GGTACC -ATGATGCAGCACAA1 CGTTTC Kpn I 
Reverse AAA CTGCAG -TTAATATCGATAGGTTATATG Pst I 

904a Forward AAAAAA GAATTC- CGGCTCGGCATTGTGCAGATGTTGCA Eco RI 
Reverse AAA CTGCAG -TTAATATCGATAGGTTATATG Pst I 

905 Forward CGC GGATCCCATATG -AACAAAATATACCGCATC BamHI- 

Ndel 

Reverse CCCG CTCGAG -CCACTGATAACCGACAGAT Xhol 

907 Forward CGC GGATCCCATATG -GGCGCGCAACGTGAG BamHI- 

Ndel 

Reverse CCC GCTCGAG -ACGCCACTGCCAGCG Xhol 

908 Forward AAA GAATTC -GCAGAGTTAGTAGGCGTTAATAAAAATAC Eco RI 
Reverse AAA CTGCAG -TTAATATGGTTTTGTCGTTCG Pst I 

909 Forward CGC GGATCCCATATG -TGCGCGTGGGAAACTTAT BamHI- 

Ndel 

Reverse CCCG CTCGAG -TCGGTTTTGAAACTTTGGTTTT Xhol 

910 Forward AAA GAATTC -GCATTTGCCGGCGACTCTGCCGAGCG Eco RI 
Reverse AAA CTGCAG -TCAGCGATCGAGCTGCTCTTT Pst I 

911 Forward AAA GAATTC -GCTTTCCGCGTGGCCGGCGGTGC Eco RI 
Reverse AAAAAA CTGCAG -GTCGACTTATTCGGCGGCTTTTTCCGC Pst I 

912 Forward AAAAAA GAATTC - Eco RI 

CAAATCCGTCAAAACGCCACTCAAGTATTGAG 
Reverse AAAAAA CTGCAG -TTACAGTCCGTCCACGCCTTTCGC Pst 1 

913 Forward CGC GGATCCCATATG -GAAACCCGCCCCGC BamHI- 

Ndel 

Reverse CCCG CTCGAG -AGGTTGTGTTCCAGGTTG Xhol 

915 Forward CGC GGATCCCATATG -TGCCGGCAGGCGGAA BamHI- 

Ndel 

Reverse CCCG CTCGAG -TTTGAAAATATAGGTATCAGG Xhol 

914 Forward AAAGAATTC-GACAGAATCGGCGATTTGGAAGCACG Eco RI 
Reverse AAACTGCAG-CTATATGCGCGGCAGGACGCTCAACGG Pst I 

916 Forward CGC GGATCCCATATG -GCAATGATGGCGGCTG BamHl- 

Ndel 

Reverse CCCGCTCGAG-TTTGGCGGCATCTTTCAT Xhol 
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917 Forward AAAAAA GAATTC -CCTGCCGAAAAACCGGCACCGGC Eco RI 

Reverse AAAAAA CTGCAG -TTATTTCCCCGCCTTCACATCCTG Pst I 

919 Forward CGC GGATCCCATATG -TGCCAAAGCAAGAGCATC BamHI- 

Ndel 

Reverse CCCG CTCGAG -CGGGCGGTATTCGGG Xhol 

920 Forward CGC GGATCCCATATG -CACCGCGTCTGGGTC BamHI- 

Ndel 

Reverse CCCG CTCGAG -ATGGTGCGAATGACCGA Xhol 

921 Forward AAAAAA GAATTC -TTGACGGAAATCCCCGTGAATCC Eco RI 
Reverse AAAAAACTGCAG-TCATTTCAAGGGCTGCATCTTCAT Pst I 

922 Forward. CGC GGATCCGCTAGC -TGTACGGCGATGGAGGC BamHI- 

2 Nhel 
Reverse CCCG CTCGAG -CAATCCCGGGCCGCC Xhol 

923 Forward CGC GGATCCCATATG -TGTTACGCAATATTGTCCC BamHI- 

Nhel 

Reverse CCCG CTCGAG -GGACAAGGCGACGAAG Xhol 

925 Forward CGC GGATCCCATATG -AAACAAATGCTTTTAGCCG BamHI- 

Ndel 

Reverse CCCG CTCGAG -GCCGTTGCATTTGATTTC Xhol 

926 Forward CGC GGATCCCATATG -TGCGCGCAATTACCTC BamHI- 

Ndel 

Reverse CCC GCTCGAG -TCTCGTGCGCGCCG Xhol 

927 Forward CGC GGATCCCATATG -TGCAGCCCCGCAGC BamHI- 

Ndel 

Reverse CCCG CTCGAG -GTTTTTTGCTGACGTAGT Xhol 
929a Forward AAAAAA GAATTC -CGCGGTTTGCTCAAAACAGGGCTGGG Eco RI 

Reverse AAAAAA TCTAGA -TTAAGAAAGACGGAAACTACTGCC Xba I 

931 Forward AAAAAA GAATTC -GCAACCCATGTTTTGATGGAAAC Eco RI 

Reverse AAAAAACTGCAG-TTACTGCCCGACAACAACGCGACG Pst I 

935 Forward AAAAAA GAATTC - E co rj 

GCGGATGCGCCCGCGATTTTGGATGACAAGGC 
Reverse AAAAAA CTGCAG -TCAAAACCGCCAATCCGCCGACAC Pst I 

936 Forward CGC GGATCCCATATG -GCCGCCGTCGGCGC BamHI- 

Ndel 

Reverse CCCG CTCGAG -GCGTTGGACGTAGTTTTG Xhol 

937 Forward AAAAAAGAATTC-CCGGTTTACATTCAAACCGGCGCAAC Eco RI 
Reverse AAAAAA CTGCAG -TTAAAATGTATGCTGTACGCCAAA Pst I 

939a Forward AAAAAAGAATTC-GGTTCGGCAGCTGTGATGAAACC Eco RI 

Reverse AAAAAACTGCAG-TTAACGCAAACCTTGGATAAAGTTGGC Pst I 
950 Forward CGCGGATCCCATATG-GCCAACAAACCGGCAAG BamHl- 

Ndel 

Reverse CCCGCTCGAG-TTTAGAACCGCATTTGCC Xhol 
953 Forward CGCGGATCCCATATG-GCCACCTACAAAGTGGAC BamHI- 

Ndel 

Reverse CCCG CTCGAG -TTGTTTGGCTGCCTCGAT Xhol 
957 Forward CGCGG ATCCC AT ATG-TTTTGGCTGGG A A GGGG BamHI- 
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Ndel 




Reverse CCCGCTCGAG-GTGAGAAAGGTCGCGC 


Xhol 


958 


Forward CGCGGATCCCATATG-GCCGATGCCGTTGCG 


BamHI- 






Ndel 




Reverse GCCCAAGCTT-GGGTCGTTTGTTGCGTC 


Hindlll 


959 


Forward CGCGGATCCCATATG-CACCACGACGGACACG 


BamHI- 






Ndel 




Reverse LCCGC 1 LGAu-G 1 LG I CuLGuLuGGA 


Xnol 


961 


Forward CGCGGATCCCATATG-GCCACAAGCGACGACG 


t~\ T TT 

BamHJ- 






Ndel 




Reverse CCCGCTCGAG-CCACTCGTAA i 1 CjACGC 


VI. „ T 

Xhol 


972 


Forward AAAAAAGAATTC- 


Eco RI 




TTGACTAACAGGGGGGGAGCGAAATTAAAAAC 






Reverse AAAAAATCTAGA-TTAAAAATAATCATAATCTACATTTTG 


Xba I 


973 


Forward AAAAAAGAATTC- ATGGACGGCGCACAACCGAAAAC 


Eco RI 




Reverse AAAAAACTGCAG-TTACTTCACGCGGGTC Z CCATCAGCGT 


PstI 


982 


Forward CGCGGATCCCATATG-GCAGCAAAAGACGTAC 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-CATCATGCCGCCCATCC 


Xhol 


983 


Forward CGCGGATCCCATATG-TTAGCTGTTGCAACAACAC 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-GAACCGGTAGCCTACG 


Xhol 


987 


Forward CGCGGATCCCATATG-CCCCCACTGGAAGAAC 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-TAATAAACCTTCTATGGGC 


Xhol 


988 


Forward CGCGGATCCCATATG-TCTTTAAATTTACGGGAAAAAG 


BamHI- 






Ndel 




Reverse GCCCAAGCTT-TGATTTGCCTTTCCGTTTT 


Hindlll 


989 


Forward CCGGAATTCTACATATG-GTCCACGCATCCGGCTA 


EcoRI- 






Ndel 




Reverse CCCGCTCGAG-TTTGAATTTGTAGGTGTATTGC 


Xhol 


990 


Forward. CGCGGATCCGCTAGC-TTCAGAGCTCAGCTT 


BamHI- 




2 


Nhel 




Reverse CCCGCTCGAG-AAACAGCCATTTGAGCGA 


Xhol 


992 


Forward CGCGGATCCCATATG-GACGCGCCCGCCCG 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-CCAAATGCCCAACCATTC 


Xhol 


993 


r Orward CuCLjLjA i LCL.A 1 A 1 Lr-Ol_-AA 1 ut 1 OA 1 1 U AAA 1 (- A 


T~> TTT 






Ndel 




Reverse CCCGCTCGAG-GAACACATCGCGCCCG 


Xhol 


996 


Forward CGCGGATCCCATATG-TGCGGCAGAAAATCCGC 


BamHI- 






Ndel 




Reverse CCCGCTCGAG-TCTAAACCCCTGTTTTCTC 


Xhol 


997 


Forward CCGGAATTCTAGCTAGC-CGGCACGCCGACGTT 


EcoRJ- 






Nhel 




Reverse CCCGCTCGAG-GACGGCATCGCTCAGG 


Xhol 
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Underlined sequences indicate restriction recognition sites. 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 1>: 

gOOl . seq 

1 ATGCTGCCGC AGGGGAAGGC GGCGCGGAGG GTGTCGGCGA ACGAGGTGTC 

51 CGGCAGGGCT TGCGCCCGGA TGGTGCTGGT CATCTGCCAG ACGCTGCCGA 

101 AACGCGATAC TTTAAACGGC TCGGGTACGC ATACTTTACC GGTTTGGGCG 

151 ATTTTGCCGA GGTCGTTGCG CAGCAAATCG ACAATCATCA CGTTTTCGGC 

201 GCGGTTTTTC GGGTCGGTTT GTAACTCGGC GGCGCGGCGT TCGTCTTGTC 

2 51 CGTCGCCCAA AATCGGCGCG GTGCCTTTCA TCGGTTCGGT GCTGATGGTG 

301 CCGTCTGAAG CGATGTTGAG GAAGAGTTCG GGCGAGAAAC ACAGCGTCCA 

351 CGCGGATTGC CCGGCTTCAT CGGGCAGGTG GGACAATACG G CAT AG 

This corresponds to the amino acid sequence <SEQ ID 2; ORF 001 .ng>: 

gOOl .pep 

1 MLPQGKAARR VSANEVSGRA CA RMVLVICQ TLPKRDTLNG SGTHTLPVWA 

51 ILPRSLRSKS TI ITFSARFF GSVCNSAARR SSCPSPKIGA VPFIGSVLMV 

101 PSEAMLRKSS GEKHSVHADC PASSGRWDNT A* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 3>: 

mOOl . seq 

1 ATGCTGCCGC AGGGGAAGGC GGCGCGGAGG ATGTCGGCGA ACGAGGTGTG 

51 CGGcAssCTT ss . GCTTGGA yGGTGCTGGT CATCTGCCAA ACGCTGCCGA 

101 AACGCGATAC TTTAAACGGT TCGGGTACGC ATACTGTGCC GGTTTGGGCG 

151 ATTTTGCCGA GATCGTTACG CAGCAAATCG ACAATCATCA CGTTTTCGGC 

201 GCGGTTTTTC GGGTCTGCTT GCAACTCGGC GGCGCGGCGT TCGTCTTGTC 

251 CGTCGCCCAA AATCGGCGCG GTGCCTTTCA TCGGTTCGGT GCTGATGGTG 

301 CCGTCCGAAC CGATTTTGAG GAAGAGTTCG GGCGAGAAAC ACAGCGTCCA 

351 CGCGGATTGC CCCTCCGCAT CGGGCAGGTG GGACAAGACG GCATAG 

This corresponds to the amino acid sequence <SEQ ID 4; ORF 00 1>: 

mOOl .pep 

1 MLPQGKAARR MSANEVCGXL X AWXVLVICQ TLPKRDTLNG SGTHTVPVWA 

51 ILPRSLRSKS TI ITFSARFF GSACNSAARR SSCPSPKIGA VPFIGSVLMV 

101 PSEPILRKSS GEKHSVHADC PSASGRWDKT A* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 5>: 

aOOl . seq 

1 ATGCTGCCGC AGGGGAAGGC GGCGCGGAGG ATGTCGGCGA ACGAGGTGTG 

51 CGGCAAGGCT TGGGCTTGGA TGGTGCTGGT CATCTGCCAA ACGCTGCCGA 

101 AACGCGATAC TTTAAACGGT TCGGGTACGC ATACTGTGCC GGTTTGGGCG 

151 ATTTTGCCGA GGTCGTTACG CAGCAAATCG ACAATCATCA CGTTTTCGGC 

201 GCGGTTTTTC GGGTCTGCTT GCAACTCGGC GGCGCGGCGT TCGTCTTGTC 

251 CGTCGCCCAA AATCGGCGCG GTGCCTTTCA TCGGTTCGGT GCTGATGGTG 

301 CCGTCCGAAC CGATTTTGAG GAAGAGTTCG GGCGAGAAAC ACAGCGTCCA 

351 CGCGGATTGC CCTTGTGCAT CGGGCAGGTG GGACAAAACG GCATAG 

This corresponds to the amino acid sequence <SEQ ID 6; ORF 001.a>: 

aOOl .pep 

1 MLPQGKAARR MSANEVCGKA W AWMVLVICQ TLPKRDTLNG SGTHTVPVWA 

51 ILPRSLRSKS TIITFSARFF GSACNSAARR SSCPSPKIGA VPFIGSVLMV 

101 PSEPILRKSS GEKHSVHADC PCASGRWDKT A* 

mOOl/aOOl 96.2% identity over a 131 aa overlap 

10 20 30 40 50 60 

mOOl . pep MLPQGKAARRMSANEVCGXLXAWXVLVICQTLPKRDTLNGSGTHTVPVWAILPRSLRSKS 
I I t I I I I I I I M I I I I I I II II 1 I I I M I I I I I I I I I I I t II I I I I | I | I ! | | | | | 
a 001 .pep M L PQG KAA RRMSANEVCG KAW A WM V L V I C QT L PKR DT L N G S G T H T V P V W A I L P R S L R S Pi S 
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10 20 30 40 50 60 

70 80 90 100 110 120 

mOOl .pep TIITFSARFFGSACNSAARRSSCPSPKIGAVPFIGSVLMVPSEPILRKSSGEKHSVHADC 
! I I I I I I I I I I f I M I I ! ! I I II I I I I i I I I I i I i If I I I i I! t I I I I I I I I i I I I I I I I 
aOOl . pep T 1 1 T FS AR FFG S ACN S AARR S SC PS PK I G AV P F I G S V LMV ?S E P I LRKS S G £ KH S V H ADC 

70 80 90 100 110 120 

130 

mOOl.pep PSASGRWDKTAX 
I I ! I I I I I I I i 
aOOl.pep PCASGRWDKTAX 
130 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from TV. gonorrhoeae 

ORF 001 shows 89.3% identity over a 1 3 1 aa overlap with predicted ORF (ORF 00 1 .ng) 
from TV. gonorrhoeae: 

mOOl/gOOl 

10 20 30 40 50 60 

mOOl .pep MLPQGKAARRMSANEVCGXLXAWXVLVICQTLPKRDTLNGSGTHTVPVWAILPRSLRSKS 
I I I ! I I I I I I : \ I I I I I i M I I I M I M I ! I i I I I I I II : I I i I I I I I ! i I I I I 
gOOl ML PQGKAARR V S AN E V S G RA.C ARMV LV I C QT L PKRDT LN G S GT HT LPV Vff A I L PR S LR S K S 

10 20 30 40 50 60 

70 80 90 100 110 120 

mOOl .pep TIITFSARFFG3ACNSAARRSSCPSPKIGAVPFIGSVLMYPSEPILRKSSGEKHSVHADC 
I I I M I ! II ! I I : I M i I I 1 II I I M I I II I M I I I I 1 I I i I ! : I I 1 I I t I I I I I I I I I 
gOOl TIITFSARFFGSVCNSAARRSSCPSPKIGAVPFIGSVLMVPSEAMLRKSSGEKHSVHADC 

70 80 90 100 110 120 

130 

mOOl.pep PSASGRWDKTAX 
I : : I I I I I : I 1 I 
gOOl PASSGRWDNTAX 
130 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 7>: 

g003 . seq 

1 ATGGTCGTAT TCGTGGCTGA AGGCGTATTC GGTCGCGCTG TTTTGGGTCA 

51 CTTGGTATTG CTCTTCGGTC AGGGTG3GTT TGAGTTCGGC GTCACTCGGT 

101 TTTTTATACG TTGCCGCGTC GAAGCCTTTG CCTTGCGGTG CGGCTTTGGT 

151 TTTGCCCGGC AGCGGTTCGT CGGCTTTGCG GATGTCGATG TGGCAGTAGC 

201 CGTTGGGGTT TTTAATCAGG TAGTCCTGAT GGTATTCCTC GGCGTCGTAG 

251 AAGTTTTTCA GCGGTTCGTT TTCAACAACG AGGGGCAGTT GGTATTTTTG 

301 CTGCTCGCGT TTGAGGGCGG CGGCGATGAC GGCTTTTTCG GCGGGGTCGG 

351 TGTAGTACAC GCCGCTGCGG TATTGCGTGC CGGTGTCGTT ACCCTGTTTG 

4 01 TTGAGGCTGG TCGGATCAAC GACGCGGAAA TAATATTGCA GGATGTCGTC 

4 51 CAGgCTGagt TTGTCGGCAT CGTaggtcac tTTGACGGTC TCGGCATGAC 

501 CCGTATGGCG GTaggacact tctTCgcanc TcGGGtTTTC CGTGt tGCCG 

551 TTGGCgttac cGGATACCGC gtcaACCACG CCGTcgatgc gttggaAATa 

601 ggCTTCCAAg ccccaaaagc agccgccggc gaagtaaatg gtgcccgtgt 

651 tcatgattGC TGa 

This corresponds to the amino acid sequence <SEQ ID 8; ORF 003. ng>: 

g003 .pep 

1 MVVFVAEGVF GRAVLGHLVL LFGQGAFEFG VTRFFIRCRV EAFALRCGFG 

51 FARQRFVGFA DVDV AVAVGV FNQWLMVFL GVVEVFQRFV FNNEGQLVFL 

101 LLAFEGGGDD GFFGGVG VVH AAAVLRAGW TLFV EAGRIN DAE1ILQDVV 
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151 QAEFVGI VGH FDGLGMTRMA VGHFFVRVFR VAVGVTGYRV NHAVDALEIG 
201 FQAPKAAAGE VNGARVHDC 

The following partial DNA sequence was identified in A 7 , meningitidis <SEQ ID 9>: 



m003 . seq 








i 


ATGGTCGTAT 


TCGTGGCTGA 


AGGCATATTC 


51 


CTTGsTATTG 


CTCTTCGGTC 


AGGGTGCGTT 


101 


TTTTTATACG 


TTGCCGCGTC 


GAAGCCTTTG 


151 


TTTGCCCGGC 


AGCGGTTCGT 


CAGCkTTGCG 


201 


CGTTGGGGTT 


TTTAATCAAG 


TAGTCCTGAT 


251 


AAGTTTTtCA 


GCGGCTCGTT 


TTCAACAACG 


301 


CTGCTCGCGT 


TTGAGGGCGk 


CGGCGATGAC 


351 


TGTAGTACAC 


GCCGCTGCGG 


TATTGCGTAC 


401 


TTGAGGCTGG 


TCGGATCAAC 


GACGCGGAAG 


451 


TAGGCTGAGT 


TTGTCGGCAT 


CGTAGGTCAC 


501 


CCGTATGGCG 


GTAGGACACG 


TCTTCATAGC 


551 


TTGGCGTAGC 


CGGATACCGC 


GTCAACCACG 


601 


GGCTTCCAAG 


CCCCAGAAGC 


AGCg . CCGGC 


651 


TCATGATTTT 


TGA 





TGAGTTCGGC GTCACTCGGT 
CCTTGCGGGG CGGTCTTGGT 



kCGGGGTCGG 



CGTGTTGCCG 



This corresponds to the amino acid sequence <SEQ ID 10; ORF 003>: 

m003.pep Length*. 221 

1 MWFVAEGIF GRAVLGNLXL LFGQGAFEFG VTRFFIRCRV EAFALRGGLG 

51 FARQRFVSXA DVDV AVAVGV FNQVVLMVFL GIVEVFQRLV FNNEGQLVFL 

101 LLAFEGXGDD GFFXGVG VVH AAAVLRTGVV ALFV EAGRIN DAEEILQDVV 

151 * AEFVGIVGH FDGFGVARMA VGHVFIARIF RVAVGVAGYR VNHAVDALEI 

2 01 GFQ^PEAAXG EVNGARVHDF * 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 1>: 



a003 . seq 






I 


ATGGTCGTAT 


TCGTGGCTGA 


51 


CTTGGTATTG 


CTCTTCGGTC 


101 


TTTTTATACG 


TTGCCGCGTC 


151 


TTTGCCCGGC 


AGCGGTTCGT 


201 


CGTTGGGGTT 


TTTAATCAAG 


251 


AAGTTTTTCA 


GCGGCTCGTT 


301 


CTGCTCGCGT 


TTGAGGGCGG 


351 


TGTAGTACAC 


GCCGCTGCGG 


401 


TTGAGGCTGG 


TCGGATCAAC 


451 


TAGGCTGAGT 


TTGTCGGCAT 


501 


CCGTATGGCG 


GTAGGACACG 


551 


TTGGCGTAGC 


CGGATACCGC 


601 


GGCTTCCAAG 


CCCCAGAAGC 


651 


TCATGATTTT 


TGA 



AGGCATATTC 
AGGGTGCGTT 
GAAGCCTTTG 
CGGCTTTGCG 
TAGTCCTGAT 
TTCAACAACG 
CGGCGATGAC 
TATTGCGTAC 
GACGCGGAAG 
CGTAGGTCAC 
TCTTCATAGC 
GTCAACCACG 
AGCCGCCGGC 



GGTCGCGCTG 
TGAGTTCGGC 
CCTTGCGGTG 
GATATCGATG 
GGTATTCCTC 
AGGGGCAGTT 
GGCTTTTTCG 
CGGTGTCGTT 
AAATATTGCA 
TTTGACGGTT 
TCGGATTTTT 
CCGTCGATGC 
GAGGTAGATG 



TTTTGGGTAA 
GTCACTCGGT 
CGGTCTTGGT 
TGGCAGTAGC 
GGCATCGTAG 
GGTATTTTTG 
GCGGGGTCGG 
GCCCTGTTTG 
GGATGTCGTC 
TCGGCGTGGC 
CGTGTTGCCG 
GTTGGAAATA 
GTGCGCGTGT 



This corresponds to the amino acid sequence <SEQ ID 12; ORF 003. a>: 

a003 .pep 

MWFVAEGIF GRAVLGNLVL LFGQGAFEFG VTRFFIRCRV EAFALRCGLG 

FARQRFVGFA DID VAVAVGV FNQVVLMVFL GIVEVFQRLV FNNEGQLVFL 

LLAFEGGGDD GFFGGVGWH AAAVLRTGVV ALFVEAGRIN DAEEILQDVV 



1 

51 
101 
151 
201 



* AEFVGIVGH FDGFGVARMA VGHVFIARIF RVAVGVAGYR VNHAVDALEI 
GFQAPEAAAG EVDGARVHDF * 



m003/a003 95.9% identity over a 220 aa overlap 



10 20 30 40 50 60 

m003 . pep MWFVAEGIFGRAVLGNLXLLFGQGAFEFGVTRFFIRCRVEAFALRGGLGFARQRFVSXA 
I I t I I I f I I I I I I i i I 1 I I I I II t i i ! f 1 I I I I I I I I I ! I i I I I I I I I M I I I I I : I 
a003 MWFVAEGI FGRAVLGNLVLLFGQGAFE FGVTRFFIRCRVEAFALRCGLG FARQRFVGFA 

10 20 30 40 50 60 
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168 



70 80 90 100 110 120 

m003 . pep dvdvavavgvfnqvvlmvflgivevfqblvfnnegqlvflllafegxgddgffxgvgvvh 

I : 1 I I I 1 I 1 II I \ I ( I I ! 11 I ! I I I I I I M i I I ! t i ( [ I I I M I I I Mill (Mill 
a003 DIDVAVAVGVFNQVVLMVFLGIVEVFQRLVFNNEGQLVFLLLAFEGGGDDGFFGGVGWH 
70 80 90 100 110 120 

130 140 150 160 170 130 

m00 3.pep AAAVLRTGWALFVEAGRINDAEEILQDVVXAEF\'GIVGHFDGFGVARMAVGHVFIARIF 

I I f I I I I 1 I i I f I (I I I I I I I I I I I I I I I I I I I I M I I f I I II I I I f I I I I ! I I ! I If I ! 
a 00 3 AAA V LRT G VV A L FVE AG R I N DAE E I LQ D W X A E FV G I VG H F DG FG V A RMA V G H V F I A R I F 

130 140 150 160 170 180 

190 200 210 220 

mOQ 3 . pep RVAVGVAGYRVNHAVDALEIGFQAPEAAXGEVNGARVHDFX 

II I I f I I I I I II I I I I II M I I I I I I ! I I I I : I I M I I ! I 
a 0 03 RVAVGVAGYRVNHAVDALEIGFQAPEAAAGEVDGARVHDFX 

190 200 210 220 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted QRF from N. gonorrhoeae 

ORF 003 shows 88.6% identity over a 219 aa overlap with ? nredicted ORF (ORF 003.ng) 
from N. gonorrhoeae: 

m003/g003 

10 20 30 40 50 60 

m003 . Deo MVVFVAEGIFGRAVLGNLXLLFGQGAFEFGVTRFFIRCRVEAFALRGGLGFARQRFVSXA 
M I I I I i I : I I I M 1 I : I I I I I I I I I I I I I M I I I I ! I I i ! i M I 1 : f II I I I I 1 : I 
g003 MWFVAEGVFGRAVLGHLVLLFGQGAFEFGVTRFFIRCRVEAFALRCGFG FARQRFVGFA 

10 20 30 40 50 60 

70 80 90 100 110 120 

m003 . pep DVDVAVAVGVFNQVVLMVFLGIVEVFQRLVFNNEGQLVFLLLAFEGXGDDGFFXGVGWH 

I I M I I I II I t M I I I I I I I I : I I I I M : I i I I I I t i I I I I I i I I I I I I I I I Ml 
g00 3 DVDVAVAVGVFNQVVLMVFLGWEVFQRFVFNNEGQLVFLLLAFEGGGDDGFFGGVGWH 
70 80 90 100 110 120 

130 140 150 160 170 ISO 

m003 . pep aaavlrtgwalfveagrindaeeilqdwxaefvgivghfdgfgvarmavghvfiarif 

I I M i I : ! I I : I I I I M i I I M I I I I ( i I II I I I I II 1 I U : I :: I I I I I I i : i • I 
g003 AAAVLRAGWTLFVEAGRINDAEIILODWQAEFVGIVGHFDGLGMTRI4AVGKFFV-RVF 
130 140 150 160 170 180 

190 200 210 220 

m003.pep RVAVGVAGYRVNHAVDALEIGFQAPEAAXGEVNGARVHDFX 
I I 1 I I I : I I I I I I I I I I I I I I I I I I : I I I I 1 I M I I I I 

g 003 rvavgvtgyrvnhavdaleigfqapkaaagevngarvhdc 

190 200 210 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 13>: 

g004 . seq 

1 ATGgtagAAC GGCATATCCA GCATTTGCGG AACGGTCATC TTCATTTGAT 

51 GCGCCCATGC CAACAagtga gccaAAtgtT CGGCGGCAGG GCCTacgatT 

101 TCCGCGCCGA TAAagcggcc gGTGgctTTT tcgGCataca ggcgcaTatg 

151 gCCTTTGTTT ACCAgcatca cgcggctgcg accttgaTTT TTGAACGATA 

201 CTTCGCCgaT GACAAATTCG TCGGCTTGGT ATTGCGCGGC AACCTGCGCG 

2 51 TATTTCAAAC CGACAAAGCC GATTTGCgga ctggtaaACA CCACGCCAAT 

301 GGTgctgcgg cGCAAACCGC TGCCGATATt cgGgtagcgg ccccgcgtta 

351 ttgcccggca atcttacctt ggtcggcggc ttcatGCAGC AGGGGCagtt 

401 ggttggacgc gtcgcccgca ataAAGATAT GCGGAATgct ggtCTGCATg 

4 51 gtCAGCGGAT CGGCAACGGG tacgccgcgc gcgtcrttgT CGATATTGAT 

501 GTTTTCCAAA CCGATATtgT CAACGTTCGG ACGGCaACCT ACGGCTGCCA 
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551 ACATATATTC GGCAACAAA. ACGCCTTTTT CGCCATCCTG CTCCCAATGG 

601 ACTtCtACAT TGCCGTCTGC GTCGAGTTTG ACCTCGGTTT TAGCATCCAG 

651 ATGCAGTTTC AATtctTCTC CGAACACGGC TTTCGCCTCG TCTGAAACAA 

701 CGGGGTCGGA AATGCCGCCG ATGATTCCGC CCAAACCGAA AATTTCAACT 

751 TTCACACCCA AACGGTGCAA TGCCTGA 

This corresponds to the amino acid sequence <SEQ ID 14; ORF 004. ng>: 

g004 . pep 

1 MVERHIQHLR NGHLHLMRPC QQVSQM FGGR AYDFRADKAA GGFFGIQAHM 

51 AFVYQHHAAA TLI FERYFAD DKFVGLVLRG NLRVFQTDKA DLRTGKHHAN 

101 GAAAQTAADI RVAAPRYCPA ILPWSAASCS RGSW LDASPA IKICGMLVCM 

151 VSGSATGTPR ASLSILMFSK PILSTFGRRP TAANIYSATN TPFSPSCSQW 

201 TSTLPSASSL TSVLASRCSF NSSPNTAFAS SETTGSEMPP MIPPKPKIST 

251 FTPKRCNA* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 15>: 

m004 . seq 

1 ATGGTAGAAC GGCATATCCA GCATTTGCGG AACGGTCATC TTCATTTGAT 

51 GTGCCCAAGC CAACAGGTGC GCCAAATGTT CGGCGGCAGG GCCTACGATT 

101 TCCGCGCCGA TAAAGCGGCC GGTGGCTTTT TCGGCATACA GGCGCATATG 

151 GCCTTTGTTC ACCAGCATCA CGCGGCTGCG GCCTTGGTTT TTGAACGATA 

201 CTTCGCCGAT GACAAATTCG TCGGCTTGGT ATTGCGCGGC AACCTGCGCG 

2 51 TATTTCAGAC CGACAAAGCC GATTTGCGGA CTGGTAAACA CCACGCCGAT 

3 01 GGTGCTGCGC CGCAAACCGC CGCCGATATT CGGGTAGCGG CCGCGTTATC 

3 51 GCCGGCAATC TTGCCTTGGT CGGCAGCTTC ATGCAGCAGA GGCAGTTGGT 

4 01 TGGACGCATC GCCTGCGATG AAGATATGCG GAATACTGGT CTGCATGGTC 
4 51 AGCGGGTCGG CAACAGGTAC GCCGCGCGCA TCTTTTTCGA TATTGATATT 
501 TTCCAAACCG ATATTGTCAA CGTTCGGACG GCGGCCCACG GCTGCCAGCA 
551 TATATTCGGC AACAAATACG CCTTTTTCGC CATCCTGCTC CCAATGGACT 
601 TCTACATTGC CGTCTGCATC GAGTTTGACC TCGGTTTTAG CATCCAGATG 
651 CAGTTTCAAT TCTTCGCCGA ACACGGCGTT CGCCTCGTCT GAAACGACGG 
701 GGTCGGAAAT GCCGCCGATG ATTCCGCCCA AACCGAAAAT TTCAACTTTC 
7 51 ACGCCCAAAC GGTGCAATGC CTGA 

This corresponds to the amino acid sequence <SEQ ID 16; ORF 004>: 

m004 . pep 

1 MVERHIQHLR NGHLHLMCPS QQVRQMFGGR AYDFRADKAA GGFFGIQAHM 

51 AFVHQHHAAA ALV FERYFAD DKFVGLVLRG NLRVFQTDKA DLRTGKHHAD 

101 GAAPQTAADI RVAAALSPAI LPWSAASCSR GSW LDAS PAM KICGILVCMV 

151 SGSATGTPRA SFSILIFSKP ILSTFGRRPT AASI YSATNT PFSPSCSQWT 

201 STLPSASSLT SVLASRCSFN SSPNTAFASS ETTGSEMPPM IPPKPKISTF 

251 TPKRCNA* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 7>: 

a004 . seq 

1 ATGGTAGAAC GGCATATCCA GCATTTGCGG AACGGTCATC TTCATTTGAT 

51 GTGCCCAAGC CAACAGGTGC GCCAAATGTT CGGCGGCCGG ACCTACGATT 

101 TCTGCGCCGA TGAAGCGGCC GGTGGCTTTT TCGGCATACA GGCGCATATG 

151 GCCTTTGTTT ACCAGCATCA CGCGGCTGCG GCCTTGGTTT TTGAACGATA 

201 CTTCGCCGAT GACAAATTCG TCGGCTTGGT ATTGCGCGGC AACCTGCGCG 

2 51 TATTTCAAAC CGACAAAGCC GATTTGCGGA CTGGTGAACA CTACGCCGAT 
301 GGTGCTGCGG CGCAAACCGC CGCCGATATT CGGGTAGCGG CCGCGTTATC 

3 51 GCCGGCAATC TTGCCTTGGT CGGCGGCTTC ATGCAGCAGG GGCAGTTGGT 

4 01 TGGACGCGTC GCCCGCAATA AAGATATGCG GAATACTGGT CTGCATAGTC 

4 51 AGCGGATCGG CAACGGGTAC GCCGCGCGCA TCTTTTTCGA TATTGATGTT 
501 TTCCAAACCG ATATTGTCAA CGTTCGGACG GCGGCCTACG GCTGCCAGCA 

5 51 TATATTCGGC AACAAATACG CCTTTTTCGC CATCCTGCTC CCAATGGACT 
601 TCTACATTGC CGTCTGCGTC GAGTTTGGCC TCGGTTTTAG CATCCAAATG 
651 CAGTTTCAAT TCTTCACCGA ACACGGCTTT CGCCTCGTCT GAAACGACGG 
701 GGTCGGAAAT GCCGCCGATG ATGCCACCCA AACCGAAAAT TTCAACTTTC 
7 51 ACGCCCAAAC GGTGCAATGC CTGA 
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T\is corresponds to the amino acid sequence <SEQ ID 18; ORF 004. a>: 

a004 .pep 

1 MVERHIQHLR NGHLHLMCPS QQVRQMFGGR TYDFCADEAA GGFFGIQAHM 

SI AFVYQHHAAA ALVFERYFAD DKFVGLVLRG NLRVFQTDKA DLRTGEHYAD 

101 GAAAQTAADI RVAAALSPAI LPWSAASCSR GSWL DA5PAI KICGILVCIV 

151 SGSATGTPRA 5FSILMFSK? ILSTFGRRPT AASIYSATNT PFSPSCSQWT 

201 STLPSASSLA SVLASKCSFK SSPNTAFASS ETTGSEMPPM MPPKPKISTF 

251 TPKRCNA" 

m004/a004 94.9% identity over a 257 aa overlap 

10 20 30 40 50 60 

m004 . pep MVERHIQHLRNGHLHLMCPSQQVRQMFGGRAYDFRADKAAGGFFGIQAHMAFVHQHHAAA 
I i I I I I I I II I i i i i I t I I I I I I j I I I I I I : 1 I I I I : I I I I I I I I I I I I I t i : I I I I I I 
a 004 MVERHIQHLRNGHLHLMCPSQQVRQMFGGRTYDFCADEAAGGFFGIQAHMAFVYOHHAAA 

10 20 30 40 50 60 

70 80 90 100 110 120 

m004 . pep ALVFERYFADDKFVGLVLRGNLRVFQTDKADLRTGKHHADGAAPQTAADIRVAAALSPAI 
! I I I I ! I i 11 H i 1 II I I I I I I \ \ I 1 ! M 1 i \ I I i : I : I I H I I Hi III III M III I 
a00 4 ALVFERYFADDKFVGLVLRGNLRVFQTDKADLRTGEHYADGAAAQTAADI RVAAALSPAI 

70 80 90 100 110 120 

130 140 150 160 170 180 

m004 .pep LPWSAASCSRGSWLDASPAMKICGILVCMVSGSATGTPRASFSILI FSKP ILSTFGRRPT 

! I I J I I I I I i I ! II I I I I I : I I I i I! I I : I f I I I I I I! I I i ! H i : I 1 I H ! I I H i I I I 
a00 4 LPWSAASCSRGSWLDASPAIKICGILVCIVSGSATGTPRASFSILMFSKPILSTFGRRPT 

130 140 150 160 170 180 

190 200 210 220 230 240 

m004 .pep AASIYSATNTPFSPSCSQWTSTLPSASSLTSVLASRCSFNSSPNTAFASSETTGSEMPPM 
I I i I I I I I I I I I I I I ! I I I II I I I I I I ! I : I I t I I : I I I i I I II I I I i I H I I I I! I I I I 
a 004 AASIYSATNTPFSPSCSQWTSTLPSASSLASVLASKCSFNSSPNTAFASSETTGSEMPPM 

190 200 210 220 230 240 

250 

m004 .pep I PPKPKISTFTPKRCNAX 

: I I I ( ! I I M I I I I I I I I 
a004 MP PK PKI STFT PKRCNAX 

250 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 004 shows 93.4% identity over a 258 aa overlap with a predicted ORF (ORF 004. ng) 
from N. gonorrhoeae; 

m004/g004 

10 20 30 40 50 60 

m004 .pep MVERHIQHLRNGHLHLMCPSQQVRQMFGGRAYDFRADKAAGGFFGIQAHMAFVHQHHAAA 
I I I I I I I I I II I I I I I I 1 III i II I I I I I I I I I 1 ! I I I I I I I M I I I I M : I I I II I 
g 004 MVERHIQHLRNGHLHLMRPCQQVSQMFGGRAYDFRADKAAGGFFGIQAHMAFVYQHHAAA 

10 20 30 40 50 60 

70 80 90 100 110 119 

m004 . pep ALVFERY FADDKFVGLVLRGN LRV FQTDKADLRTGKHHADGAAPQTAAD I RVAAA- LS PA 

: I : I I I I I I I I I I I ! I H I I I II I ! I I I I I I I I I I i! I I : I I I I I I I I I I I I I I j 

g004 TLIFERY FADDKFVGLVLRGN LRV FQT DKADLRTGKHHANGAAAQTAADIRVAAPRYCPA 

70 80 90 100 110 120 

120 130 140 150 160 170 179 

m004 .pep ILPWSAASCSRGSWLDASPAMKICGILVCMVSGSATGTPRASFSILIFSKPILSTFGRRP 
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t I t 1 I I I I 1 ) I ! 1 i M I I II : I I I 1 : I I t t I 1 I i M I 1 i 11 I : I I I : H M M 1 I 1 I I W 
g004 I LPWSAASCSRGSWLDASPAIKICGMLVCMVSGSATGTPRASLS I LMFSKPI LSTFGRRP 

130 140 150 160 170 180 

180 190 200 210 220 230 239 

m004 . pep TAASIYSATNTPFSPSCSQWTSTLPSASSLTSVLASRCSFHS3PNTAFASSETTGSEMPP 
I I ! : I I I I 1 I I I I i I M I I I I I II I I I I I I I I I I ! I I I I I I I If I I I I I i I I I I I II I I I 
q004 TAAN I YSATNTPFSPSCSQWTSTLPSASSLTSVLASRCSFNSS PNTAFASSETTGSEMPP 

190 200 210 220 230 240 

240 250 
m004.pep MIPPKPKISTFTPKRCNAX 

i I ; I I I I I I I I M i I I I I 
g004 MIPPKPKISTFTPKRCNA 
250 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 19>: 

g005 . seq 

1 ATGGGGATGG ACAATATTGA TATGTTCATG CCTGAACAAG AGGAAATCCA 

51 ATCAATGTGG AAAGAAATTT TACTGAATTA CGGTATTTTC CTGCTCGAAC 

101 TGCTTACCGT GTTCGGCGCA ATTGCGCTGA TTGTGTTGGC TATCGTACAG 

151 AGTAAGAAAC AGTCGGAAAG CGGCAGTGTC GTACTGACAG ATTTTTCGGA 

201 AAATTATAAA AAACAGCGGC AATCGTTTGA AACATTCTTT TTAAGCGAGG 

251 AAGAGACAAA ACATCAGGAA AAAAAAGAAA AGAAAAAGGA AAAGGCGGAA 

301 GCCAAAGCAG AGAAAAAGCG TTTGAAGGAG GGCGGGGAGA AATCTGCCGA 

351 AACGCAAAAA TCCCGCCTTT TTGTGTTGGA TTTTGACGGC GATTTGTATG 

4 01 CACACGCCGT AGAATCCTTG CGTCATGAGA TTACGGCGGT GCTTTTGATT 

4 51 GCCAAGCCTG AAGATGAGGT TCTGCTCAGA TTGGAAAGTC CGGGCGGCGT 

501 GGTTCACGGT TACGGTTTGG CGGCTTCGCA GCTTAGGCGT TTGCGCGAAC 

551 GCAATATTCC GCTGAccgcc gccgTCGATA AGGTCGCGGC AAGCGgcggc 

601 tatatgatgg cgtgtgtgGC GGATAAAATT GTTTCCGCt c cgtttgcggt 

651 catcggttcg gtgggtgtgg tgGcggaagt gcCGAATATC CAccgCctGT 

7 01 TGAAAAAACA TGATATTGAT GTGGATGTGA TGACGGCGGG CGAATTTAAG 

7 51 CGCACGGTTA CTTTTATGGG TGAAAATACG GAAAAGGGCA AACAGAAATT 

801 CCGGCAGGAA CTGGAGGAAA CGCATCAGTT GTTCAAGCAG TTTGTCAGTG 

851 AAAACCGCCC CGGGTTGGAT ATTGAAAAAA TAGCGACGGG CGAGCATTGG 

901 TTCGGCCGGC AGGCGTTGGC GTTGAACTTG ATTGACGAGA TTTCGACCAG 

951 TGATGATTTG TTGTTGAAAG CGTTTGAAAA CAAACAGGtt aTCGAAGTGA 

1001 AATATCAGGA GAAGCGAAGC CTGATCCAGC GCATTGGTTT GCAGGCGGAA 

1051 GCTTCCGTTG AAAAGTTGTT TGCCAAACTT GTCAACCGGC GAGCGGATGT 

1101 GATGTAG 

This corresponds to the amino acid sequence <SEQ ID 20; ORF 005.ng>: 

g005 .pep 

1 MGMDNIDMFM PEQEEIQSMW KEILLNYG IF LLELLTVFGA IALIV LAIVQ 

51 SKKQSESGSV VLTDFSENYK KQRQSFETFF LSEEETKHQE KKEKKKEKAE 

101 AKAEKKRLKE GGEKSAETQK SRLFVLDFDG DLYAHAVESL RHEITAVLLI 

151 AKPEDEVLLR LESPGGWHG YGLAASQLRR LRERNIPLTV AVDKVAASGG 

201 YMMACVADK I V5APFAVIGS VGVVAE VPNI HRLLKKHDID VDVMTAGEFK 

2 51 RTVTFMGENT EKGKQKFRQE LEETHQLFKQ FVSENRPGLD IEKIATGEKW 

301 FGRQALALNL IDEISTSDDL LLKAFENKQV IEVKYQEKRS LIQRIGLQAE 

351 ASVEKLFAKL VNRRADVM* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 21>: 

m005 . seq 

1 ATGGACAATA TTGACATGTT CATGCCTGAA CAAGAGGAAA TCCAATCAAT 

51 GTGGAAAGAA ATTTTACTGA ATTACGGTAT TTTCCTGCTC GAACTGCTTA 

101 CCGTGTTCGG CGCAATTGCG CTGATTGTGT TGGCTATCGT ACAGAGTAAG 

151 AAACAGTCGG AwAGCGGCAG TGTCGTACTG ACGGATTTTT CGGAAAATTA 

201 TAAAAAACAG CGGCAATCGT TTGAAGCATT CTTTTTAAGC GGGGAAGAGG 

2 51 CACAACATCA GGAAAAAGAG GAAAAGAAAA AGGAAAAGGC GGAAGCC AAA 
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301 GCAGAGAAAA A. CGTTTGAA GGAGGGTGGG GAGAAATCTG CCGAAACGCA 

351 nAAATCACGC CTTTTTGTGT TGGANNNNNN NNNNNNNNNN NNNNNNNNNN 

4 01 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

4 51 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
501 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

5 51 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNGCGAGCGG CGGTTATATG 
601 ATGGCGTGTG TGGCGGATAA AATTGCTTCC GCTCCGTTTG CGATTGTCGG 
651 TTCGGTGGGT GTGGTGGCGG AAGTACCGAA TATCCACCGC CTGTTGAAAA 
701 AACATGATAT TGATGTGGAT GTGATGACGG CGGGCGAATT taagCGCA^G 

7 51 GTTACTTTTA TGGGTGAAAA TACGGAAAAG GGCAAACAGA AATTCCGACA 
801 GGAACTGGAG GAAACGCATC AGTTGTTCAA GCAGTTTGTC AGCGAGAACC 

8 51 GCCCTCAATT GGATATTGAG GAAGTGGCAA CGGGCGAGCA TTGGTTCGGT 
901 CGGCAGGCGT TGGCGTTGAA CTTGATTGAC GAGATTTCGA CTAGTGATGA 
951 TTTGTTGTTG AAAGCGTTTG AAAACAAACA GGTTATCGAA GTGAAATA T C 

1001 AGGAGAAGCA AAGCCTGATC CAGCGCATTG GTTTGCAGGC GGAAGCTTCT 

1051 GTTGAAAAGT TGTTTGCCAA ACTTGTCAAC CGGCGGGCGG ATGTGATGT A 

1101 G 



This corresponds to the amino acid sequence <SEQ ID 22; ORF 005>: 

m005 .pep 

1 MDNIDMFMPE QEEIQSMWKE ILLNYG IFLL ELLT VFGAIA_LIVLATVO.sk 

51 KQSXSGSWL TDFSENYKKQ RQSFEAFFLS GEEA V - V ^K^ ^KKKEKAEAK 

101 AEKXRLKEGG EKSAETXKSR LFVLXXXXXX XXXXXXXXXX XXXXXXXXXX 

151 XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXASGGYM 

201 MACVADK IAS APFAIVGSVG VVAEV PNIHR LLKKKDIDVD VMTAGEFKRT 

251 VTFMGENTEK GKQKFRQELE ETKQLFKQFV SENRPQLDIE EVATGEHWFG 

301 RQALALNLID EISTSDDLLL KAFENKQVIE VKYQEKQSLI QRIGLQAEAS 

351 VEKLFAKLVN RRADVM* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 23>: 

a005.seq 

1 ATGGACAATA TTGACATGTT CATGCCTGAA CAAGAGGAAA TCCAATCAAT 

51 GTGGAAAGAA ATTTTACTGA ATTACGGTAT TTTCCTGCTC GAACTGCTTA 

101 CCGTGTTCGG CGCAATTGCG CTGATTGTGT TGGCTATCGT ACAGAGTAAG 

151 AAACAGTCGG AAAGCGGCAG TGTCGTACTG ACGGATTTTT CGGAAAATTA 

201 TAAAAAACAG CGGCAATCGT TTGAAGCATT CTTTTTAAGC GGGGAAGAGG 

2 51 CAAAACATCA GGAAAAAGAG GAAAAGAAAA AGGAAAAGGC GGAAGCCAAA 

301 GCAGAGAAAA AGCGTTTGAA GGAGGGTGGG GAGAAATCTT CCGAAACGCA 

351 AAAATCCCGC CTTTTTGTGT TGGATTTTGA CGGCGATTTG TATGC^CACG 

4 01 CCGTAGAATC CTTGCGTCAT GAGATTACGG CGGTGCTTTT GATTGCCAAG 

4 51 CCTGAAGATG AGGTTCTGCT TAGATTGGAA AGTCCGGGCG GCGTGGTTCA 

501 CGGTTACGGT TTGGCGGCTT CGCAGCTTAG GCGTTTGCGC GAACGCAATA 

551 TTCCGCTGAC CGTCGCCGTC GATAAGGTGG CGGCGAGCGG TGGTTATATG* 

601 ATGGCGTGTG TGGCGGATAA AATTGTTTCC GCTCCGTTTG CGATTGTCGG 

651 TTCGGTGGGT GTTGTAGCGG AAGTACCGAA TATCCACCGC CTGTTGAAAA 

7 01 AACATGATAT TGATGTGGAT GTGATGACGG CGGGCGAATT TAAGCGCACG 

7 51 GTTACTTTTA TGGGTGAAAA TACGGAAAAG GGCAAACAGA AATTCCGACA 

801 GGAACTGGAG GAAACGCATC AGTTGTTCAA GCAGTTTGTC AGCGAGAACC 

851 GCCCTCAATT GGATATTGAG GAAGTGGCAA CGGGCGAGCA t TG Gt T CGGT 

901 CGGCAGGCGT TGGCGTTGAA CTTGATTGAC GAGATTTCGA CCAGTGATGA 

951 TTTGTTGTTG AAAGCGTTTG AAAACAAACA GGTTATCGAA GTGAAATATC 

1001 AGGAGAAGCA AAGCCTGATC CAGCGCATTG GTTTGCAGGC GGAAGCTTCT 

1051 GTTGAAAAGT TGTTTGCCAA ACTTGTCAAC CGGCGGGCGG A T GTGATGTA 

1101 G 



This corresponds to the amino acid sequence <SEQ ID 24; ORF 005. a>: 

a005.pep 

1 MDNIDMFMPE QEEIQSMWKE ILLNYG IFLL ELLTVFGAIA LI VLATVO.SK 

51 KQSESGSVVL TDFSENYKKQ RQSFEAFFLS GEEAKHQEKE EKKKEKAEAK 

101 AEKKRLKEGG EKSSETQKSR LFVLDFDGDL YAHAVESLRH EIT A VLLIAK 

151 PEDEVLLRLE SPGGVVHGYG LAASQLRRLR ERNIPLTVAV DKVAASGG V M 

201 MACVADK IVS APFAIVGSVG WAEV PNTHP LLKKKDIDVD VMTAGEFKRT 

251 VTFMGENTEK GKQKFRQELE ETHQLFKQFV SENRPQLDIE EVATGEHWFG 
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301 RQALALNLID EISTSDDL__ KAFENKQVIE VKYQEKQSLI QRIGLQAEAS 
351 VEKLFAKLVN RRADVM* 



moos/aoos 79.2% identity over a 366 aa overlap 



10 20 30 40 50 60 

m005. pep MDNIDMFMPEQEEIQSMWKEILLNYGIFLLELLTVFGAIALIVLAIVQSKKQSXSGSWL 
I I I I I I I I I I I 1 I I I I i I If I I I i I I I I I I I I I I I I ! I I I I I I I I I I I i I I ! ! MM!! 
a00 5 MDNIDMFMPEQEEIQSMWKEILLNYGIFLLELLTVFGAIALIVLAIVQSKKQSESGSVVL 
10 20 30 40 50 60 



70 80 90 100 110 120 

m005. pep TDFSENYKKQRQSFEAFFLSGEEAQHQEKEEKKKEKAEAKAEKXRLKEGGEKSAETXKSR 
I M I II M I I II I II 11 I M M M M I I I M M II M I I I I M IMIMIIhll Ml 
a005 TDFSENYKKQRQSFEAFFLSGEEAKHQEKEEKKKEKAEAKAEKKRLKEGGEKSSETQKSR 

70 80 90 100 110 120 



130 140 150 160 170 180 

m005 . pep LFVLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 
MM : 

a005 lfvldfdgdlyahaveslrheitavlliakpedevllrl.es PGGWHGYGLAASQLRRLR 

130 140 150 160 170 180 



190 200 210 220 230 240 

mOOS.pep XXXXXXXXXXXXXXASGGYMMACVADKIASAPFAIVGSVGVVAEVPNIHRLLKKHDIDVD 

( M M M M M i i ! : 1 M M M M M i M M M I M M M M M I i 
a00 5 ERNIPLTVAVDKVAASGGYMMACVADKIVSAPFAIVGSVGVVAEVPNIHRLLKKHDIDVD 

190 200 210 220 230 240 



250 260 270 230 290 300 

m005 , pep VMTAGEFKRTVTFMGENTEKGKQKFRQELEETHQLFKQFV3ENRPQLDIEEVATGEHWFG 
M I M I I II I M I I M I I M I II M II I I I I I I M ! I II I M I I I M II I M M M t M I 
a 005 VMTAGEFKRTVTFMGENTEKGKQKFRQELEETHQLFKQFVSENRPQLDIEEVATGEHWFG 

250 260 270 280 290 300 



310 320 330 340 350 360 

m005 . pep RQALALNLI DEI ST SDDLLLKAFENKQV I EVKYQEKQSL I QRIGLQAE AS VEKLFAKLVN 

I I I I I M I II I M I M I II M I II M M II M M M I I II M I I M M I M M I M M M 
a 005 RQALALNLIDEISTSDDLLLKAFENKQVIEVKYQEKQSLIQRIGLQAEASVEKLFAKLVN 

310 320 330 340 350 360 



m00 5.pep RRADVMX 
II I I I I I 

a005 RRADVMX 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 005 shows 77.0% identity over a 366 aa overlap with a predicted ORF (ORF 005. ng) 
from N. gonorrhoeae: 

m005/g005 

10 20 30 40 50 

m005.pep MDNIDMFMPEQEEIQSMWKEILLNYGIFLLELLTVFGAIALIVLAIVQSKKQSXSGSV 
I I I i I I M II i II M I II I I i I I I M M M I I II M II I II I I M I II M M I MM 
g00 5 MGMDNIDMFMPEQEEIQSMWKEILLNYGIFLLELLTVFGAIALIVLAIVQSKKQSESGSV 
10 20 30 40 50 60 



60 70 80 90 100 110 

m005 . pep VLTDFSENYKKQRQSFEAFFLSGEEAQHQEKEEKKKEKAEAKAEKKRLKEGGEKSAETXK 

M M M M M M M I t I M M I I I : : M I t : I I I I I I 1 I I H I t I I I I! 1 I I I I I I I ( 
g005 VLTDFSENYKKQRQSFETFFLSEEETKHQEKKEKKKEKAEAKAEKKRLKEGGEKSAETQK 
70 80 90 100 110 120 
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120 130 140 150 160 170 

mOO 5 . pep SRLTVLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 

Mill! : 
g 005 SRLFVLDFDGDLYAHAVESLRHE ITAVLLIAKPEDEVLLRLESPGGVVHGYGLAASQLRR 

130 14C 150 160 170 180 



180 190 200 210 220 230 

m005 . pep X XX X X XX X XXXXX XX X A S GG Y MMAC VA DK I A S A P FA I V G 5 VG V V AE V P N I H R L L KKK D I D 

: I I I I I I ! ! I I I I I i : I I I ! I : : I I i M i I I I I II If I I f I I i I I 

g 005 LRERNIPLTVAVDKVAASGGYMMACVADKIVSAPFAVIGSVGVVAEVPNIHRLLKKHDID 
190 200 210 220 230 240 



240 250 260 270 280 290 

m005 . pep VDVMTAGEFKRTVTFMGENTEKGKQKFRQELEETHQLFKQFVSENRPQLDIEEVATGEHW 
I I I I M 1 I I I I I I I I I i I I II II I I ! i 11 i I ! I I I I ii I M 1 i i I 1 I I I II : : I I I I I I 
gOOS VDVMTAGEFKRTVTFMGENTEKGKQKFRQELEETHQLFKQFVSENRPGLDIEKIATGEHW 

250 260 270 280 290 300 



300 310 320 330 340 350 

m005.pep FGRQALALNLIDEISTSDDLLLKAFENKQVIEVKYQEKQSLIQRIGLQAEASVEKLFAKL 
I I II I I I M I I! ( I I M I I II 1 t I I M ! I I I I I ! M I I : t I I ! 1 M M i I I I I I I M 1 II 
g 005 FGRQALALNLI DEISTSDDLLLKAFENKQVIEVKYQEKRSLIQRIGLQAEASVEKLFAKL 

310 320 330 340 350 360 



360 

mO 0 5 . pep VNRRADVMX 
I I I f I I I I I 
g 005 VNRRADVMX 



The following partial DNA sequence was identified in K gonorrhoeae <SEQ ID 25>: 

g0O6. seq 

1 ATGCTGCTGG TGCTggaatt ttggttCGGc gtGtCGGCGG TGGGCatact 

51 tgCGTTGTTT TTATGGCtt t TGCCACGTTT TGCCGCCATC AGCGAAAACC 

101 TGTATTTCCG CCTGAACAAC AGCTTGGAAC gcgACAACCA CTTTATCCGA 

151 AAAGGCGACG AGCGGCAGCT GTACCGCCAT TACGGACTGG TTTCGCGCCT 

201 GCGTGTGCTG ATTTCCAACC GCGAAGCCTT CGGCTATCTC TGCGTCGGCG 

251 CGGCGATGGG TATTTTGTTC GGCTTTGCTT TTGTGATGAT GACGCTCAAA 

301 GGCTACGGCA GCGCGGGGCA TATTTATTCG GTCGGCACTT ATCTGTGGAT 

351 GTTTGCCATG AGTTTGGACG ATGTGCCGCG ATTGGTCGAA CAATATTCCA 

4 01 ATTTGAAAGA CATCGGACAA CGGATAGAGT GGTCGGAACG GAACATCAAA 

4 51 GCCGGAACTT GA 



This corresponds to the amino acid sequence <SEQ ID 26; ORF 0O6.ng>: 

g006 -pep 

1 MLLVLEFWFG VSAVGILALF LWLLPRFAAI SENLYFRLNN SLERDNHFIR 

51 KGDERQLYRH YGLVSRLRVL I 5NREAFGYL CVGAAMGILF GFAFVM MTLK 

101 GYGSAGHIYS VGTYLWMFAM SLDDVPRLVE QYSNLKDIGQ RIEWSERNIK 

151 AGT+ 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 27>: 

m006 . seq 

1 ATGCTGCTGG TGCTGGAATT TTGGGTCGGC GTGTCGGCGG TGGGCATACT 

51 TGCGTTGTTT TTATGGCTTT TGCCACGTTT TGCCGCCATC AGCGAAAACC 

101 TGTATTTCCG CCTGAACAAC AGCTTGGAAC GCGACAACCA CTTTATCCGA 

151 AAAGGCGACC GGCGGCAGCT GTACCGCCAT TACGGACTGC TTGCGCGCCT 

201 GCGTGTGCTG ATTTCCAACC GCGAAGCCTT CGGCTATCTC TGCGTCGGCA 

251 CGGCGATGGG TATTTTGTTC GGCTTTGCTT TTGTGATGAT GACGCTCAAA 

3 01 GGCTACAGCA GCGCGGGGCA TGTCTATTCG GTCGGCACTT ATCTGTGGAT 
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3 51 GTTTGCCATG AGTTTGGACG ACGTGCCGCG ATTGGTCGAA CAATATTCCA 

4 01 ATTTGAAAGA CATCGGACAA CGGATAGAGT GGTCGGAACG GAACATCAAA 
4 51 GCCGGAACTTGA 

This corresponds to the amino acid sequence <SEQ ID 28; ORF 006>: 

jn00 6 .pep 

1 MLLVLEFWVG VSAVGILALF LWLLPRFAAI SENLYFRLNN SLERDNHFIR 
51 KGDRRQLYRH YGLLARLRVL I5NREAFGYL CVGTAMGILF GFAFVM MTLK 
101 GYSSAGHVYS VGTYLWMFAM SLDDVPRLVE QYSNLKDIGQ RIEWSERNIK 
151 AGT* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 29>: 

a006 . seq 

i ATGCTGCTGG TGCTGGAATT TTGGGTCGGC GTGTCGGCGG TGGGCATACT 

51 TGCGTTGTTT TTATGGCTTT TGCCACGTTT TGCCGCCATC AGCGAAAACC 

101 TGTATTTCCG CCTGAAGAAC AGCTTGGAAC GCGACAACCA CTTTATCCGA 

151 AAAGGCGACG AGCGGCAGCT GGACCGCCAT TACGGACTGC TTGCGCGCCT 

2 01 GCGTGTGCTG ATTTCCAACC GCGAAGCCTT CGGCTATCTC TGCGTCGGCA 
251 CGGCGATGGG TATTTTGTTC GGCTTTGCTT TTGTGATGAT GACGCTCAAA 
301 GGCTACAGCA GCGCGGGGCA TGTCTATTCG GTCGGCACTT ATCTGTGGAT 

3 51 GTTTGCCATA AGTTTGGACG ACGTGCCGCG ATTGGTCGAA CAATATTCCA 

4 01 ATTTGAAAGA CATCGGACAA CGGATAGAGT GGTCGAAACG GAACATCAAA 
4 51 GCCGGAACTT GA 

This corresponds to the amino acid sequence <SEQ ID 30; ORF 006. a>: 

a006 .pep 

1 MLLVLEFWVG VSAVGILALF LWLLPRFAAI SENLYFRLKN SLERDNHFIR 
51 KGDERQLDRH YGLLARLRVL ISNREAFGY L CVGTAMGILF GFAFVM MTLK 
101 GYSSAGHVYS VGTYLWMFAI SLDDVPRLVE QYSNLKDIGQ RIEWSKRNIK 
151 AGT* 

mooe/a006 96.7% identity over a 1 53 aa overlap 

10 20 30 40 50 60 

m00 6.pep MLLVLEFWVGVSAVGILALFLWLLPRFAAI SENLYFRLNNSLERDNHFIRKGDRRQLYRH 

II I I I I I I I I I I I I I I I I I I I I 1 1 I I I I I I I I I I i i I I : ! I I j I I I I I ! I M I : I I I I ! 
a 006 MLLVLEFWVGVSAVGILALFLWLLPRFAAI SENLYFRLKNSLERDNHFIRKGDERQLDRH 

10 20 30 40 50 60 

70 80 90 100 110 120 

mOOo.pep YGLLARLRVLISNREAFGYLCVGTAMG I LFGFAFVMMTLKGYSSAGHVYS VGTYLWMFAM 

I I I ! I I M I ! M I i I I I I I I I t I M I i i I i I I : ! II I I I ! I ! i. M I. I I I ; ! I I I I I i I : 
a 006 YGLLARLRVLISNREAFGYLCVGTAMG I LFGFAFVMMTLKGYSSAGHVYS VGTYLWMFAI 

70 80 90 100 110 120 

130 140 150 

rn006 . pep SLDDVPRLVEQYSNLKDIGQRIEWSERNIKAGTX 
.1 I I M I I I I I I I I I I I I I I l I 1 I I : I I I I I I I I 
a 006 SLDDVPRLVEQYSNLKDIGQRIEWSKRNIKAGTX 

130 140 150 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 006 shows 95.4% identity over a 153 aa overlap with a predicted ORF (ORF 006.ng) 
from N. gonorrhoeae: 

m006/g006 

10 20 30 40 50 60 

m0 0 6 . pep MLLVLE FWVGVSAVG I LALFLWLLPR FAAIS ENL Y FRLNNSLERDNKF I RKGDRRQL YRH 
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IIIIIIII IIIIIIMIIIIIIIMilllllllillMllllllliMlllhllilll 

g 00 6 MLLVLEFWFGVSAVGILALFLWLLPRFAAISENLYFRLNNSLERDNKFIRKGDERQLYRH 
10 20 30 40 50 60 

70 80 90 100 110 120 

mOOo pep YGLIiARLRVLISNREAFGYLCTGTAMGILFGFAFVmTLKGYSSAGHVYSVGTYLWMFAM 

I I I - I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I : I I I I : I I I I II I I I I I I 
g 00 6 YGLVSRLRVLISNREAFGYLCVGAAMGILFGFAFVMMTLKGYGSAGHIYSVGTYLWMFAM 
70 80 90 100 110 120 

130 140 150 

m006 .oep SLDDVPRLVEQYSNLKDIGQRIEWSERNIKAGTX 

IIIMMIIIMMIIIIIIIIIIMIIIIIII 

gll3 SLDDVPRLVEQYSNLKDIGQRIEWSERNIKAGT 
130 140 150 



The following partial DNA sequence was identified m N. gonorrhoeae <SEQ ID 31>: 

g006-l . seq 

1 ATGTGGAAAA TGTTGAAACA CATAGCCAAA ACCC" T3CA AGCGATTGAT 

51 TGGCACATTT TCCCCGGTCG GACTGGAAAA CCTT1 ^GATG CTGGGGTATC 

101 CGGTGTTTGG CGGCTGGGCG ATTAATGCCG TGATTGCGGG GAGGGTGTGG 

151 CAGGCGTTGC TGTACGCTTT GGTTGTATTT TTGATGTGGC TGGTCGGTGC 

201 GGCACGGCGG ATTGCCGATA CGCGCACGTT TACGCGGATT TATACCGAAA 

2 51 TCGCCGTGCC GGTTGTGTTG GAACAACGGC AGCGGCAAGT CCCGCATTCA 

301 GCGGTAACTG CACGGGTTGC CCTGTCGCGT GAATTTGTCA GCTTTTTTGA 

351 AGAACACCTG CCGATTGCCG CGACATCCGT CGTATCCATA TTCGGCGCGT 

4 01 GCATCATGCT GCTGGTGCTG GAATTTTGGG TCGGCGTGTC GGCGGTGGGC 

4 51 ATACTTGCGT TGTTTTTATG GCTTTTGCCA CGTTTTGCCG CCATCAGCGA 

5C1 AAACCTGTAT TTCCGCCTGA ACAACAGCTT GGAACGCGAC AACCACTTTA 

551 TCCGAAAAGG CGACGAGCGG CAGCTGTACC GCCATTACGG ACTGGTTTCG 

601 CGCCTGCGTG TGCTGATTTC CAACCGCGAA GCCTTCGGCT ATCTCTGCGT 

651 CGGCGCGGCG ATGGGTATTT TGTTCGGCTT TGCTTTTGTG ATGATGACGC 

7 01 TCAAAGGCTA CGGCAGCGCG GGGCATATTT ATTCGGTCGG CACTTATCTG 

7 51 TGGATGTTTG CCATGAGTTT GGACGATGTG CCGCGATTGG TCGAACAATA 

8 01 TTCCAATTTG AAAGACATCG GACAACGGAT AGAGTGGTCG GAACGGAACA 
8 51 TCAAAGCCGG AACTTGA 

This corresponds to the amino acid sequence <SEQ ID 32; ORF 006-l.ng>: 

g006-l .pep 

1 MWKMLKHIAK THRKRLIGTF SPVGLENLLM LGYPVFGGWA INAVIAGRVW 

51 Q ALLYALVVF LMWLVGAA RR IADTRTFTRI YTEIAVPVVL EQRQRQVPHS 

101 AVTARVALSR EFVSFFEEHL PIA ATSVVSI FGACIMLLVL EF WVGVSAVG 

151 1LALFLWLL P RFAAISENLY FRLNNSLERD NHFIRKGDER QLYRHYGLVS 

201 RLRVLISNRE AFGY LCVGAA MGILFGFAFV M MTLKGYGSA GHIYSVGTYL 

251 WMFAMSLDDV PRLVEQYSNL KDIGQRIEWS ERNIKAGT* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 33- 

m006-l . seq 

1 ATGTGGAAAA TGTTGAAACA CATAGCCCAA ACCCACCGCA AGCGATTGAT 

51 TGGCACATTT TCCCTGGTCG GACTGGAAAA CCTTTTGATG CTGGTGTATC 

101 CGGTGTTTGG CGGCCGGGCG ATCAATGCCG TGATTGCGGG GGAGGTGTGG 

151 CAGGCGTTGC TGTACGCTTT GGTTGTGCTT TTGATGTGGC TGGTCGGTGC 

201 GGTGCGGCGG ATTGCCGATA CGCGCACGTT TACGCGGATT TATACCGAAA 

251 TCGCCGTGCC GGTCGTGTTG GAACAGCGGC AGCGACAAGT CCCGCATTCG 

301 GCGGTAACTG CGCGGGTTGC CCTGTCGCGT GAGTTTGTCA GCTTTTTTGA 

351 AGAACACCTG CCGATTGCCG CGACATCCGT CGTATCCATA TTCGGCGCGT 

4 01 GCATCATGCT GCTGGTGCTG GAATTTTGGG TCGGCGTGTC GGCGGTGGGC 

4 51 ATACTTGCGT TGTTTTTATG GCTTTTGCCA CGTTTTGCCG CCATCAGCGA 

501 AAACCTGTAT TTCCGCCTGA ACAACAGCTT GGAACGCGAC AACCACTTTA 

551 TCCGAAAAGG CGACCGGCGG CAGCTGTACC GCCATTACGG ACTGCTTGCG 

60 i CGCCTGCGTG TGCTGATTTC CAACCGCGAA GCCTTCGGO ATCTCTGCGT 
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651 CGGCACGGCG ATGGGTAT . . TGTTCGGCTT TGCTTTTGTG ATGATGACGC 

701 TCAAAGGCTA CAGCAGCGCG GGGCATGTCT ATTCGGTCGG CACTTATCTG 

7 51 TGGATGTTTG CCATGAGTTT GGACGACGTG CCGCGATTGG TCGAACAATA 
801 TTCCAATTTG AAAGACATCG GACAACGGAT AGAGTGGTCG GAACGGAACA 

8 51 TCAAAGCCGG AACTTGA 



This corresponds to the amino acid sequence <SEQ ID 34; ORF 006- 1>: 

m006-l . pep 

1 MWKMLKHIAQ TKRKRLIGTF SLVGLENLLM LVYPVFGGRA INAVIAGEVW 

51 Q ALLYALWL LMWLVGAV RK IADTRTFTRI YTEIAVPVVL EQRQRQVPHS 

101 AVTARVALSR EFVSFFEEHL PIA ATSWSI FGACIMLLVL E FWVGVSAVG 

151 ILALFLWLL P RFAAISENLY FRLNNSLERD NHFIRKGDRR QLYRHYGLLA 

201 RLRVLISNRE AFGY LCVGTA MGIIFGFAFV M MTLKGYSSA GHVYSVGTYL 

2 51 WMFAMSLDDV PRLVEQYSNL KDIGQRIEWS ERNIKAGT* 

moo6-i/g006-i 95.5% identity in 288 aa overlap 



10 20 30 40 50 60 

m00 6~l .pep MWKMLKHIAQTHRKRLIGTFSLVGLENLLMLVYPVFGGRAINAVIAGEVWQALLYALWL 
I I I I i i ! I I : I I I ! I I I M I ( MMIIill I I M 1 i I ! II 1 I M : I I I I I 1 I i If I : 
g00 6-l MWKMLKHIAKTHRKRLIGTFSPVGLENLLMLGYPVFGGWAINAVIAGRVWQALLYALWF 

10 20 30 40 50 60 



70 80 90 100 110 120 

m00 6-l .pep LMWLVGAVRRIADTRTFTRIYTEIAVPVVLEQRQRQVPHSAVTARVALSREFVSFFEEHL 
I I I I I t I : I i i I I M I I I I I i I I I I I I M I I I I I I I I I t I j I I I I i I f M I I M I II I II 
g006-l LMWLVGAARRIADTRTFTRIYTEIAVPVVLEQRQRQVPHSAVTARVALSREFVSFFEEHL 

70 30 90 100 110 120 



130 140 150 160 170 160 

m006-l .pep PIAATSWSIFGACIMLLVLEFWVGVSAVGILALFLWLLPRFAAISENLYFRLNNSLERD 
I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I II I I I I I I I I I I I I I I II I I I I I ! I I I I I 
g 00 6-1 PIAATSWSI FGAC I MLLVLEFWVGVSAVG I LALFLWLLPRFAA I SEN LY FRLNNSLERD 

130 140 150 160 170 180 



190 200 210 220 230 240 

m00 6-l .pep NHFIRKGDRRQLYRHYGLLARLRVLISNREAFGYLCVGTAMGILFGFAFVMMTLKGYSSA 
t I ! I i I i I : ! I I I M I M : : M I ! I { \[ I 11 I I I I I M : M If i I I 1 I M ( ! I I I I I : i ( 
g00 6-l NHFIRKGDERQLYRHYGLV3RLRVLISNREAFGYLCVGAAMGILFGFAFVMMTLKGYGSA 

190 200 210 220 230 240 



250 260 210 280 289 

m006-l .pep GHVYSVGTYLWMFAMSLDDVPRLVEQYSNLKDIGQRIEWSERNIKAGTX 
I I : i I I I I ! II 11 I 1 II I II I i I I I ) II M M I It 1 It \ \ I I I I I I I I I 
g00 6-l GHIYSVGTYLWMFAMSLDDVPRLVEQYSNLKDIGQRIEWSERNIKAGTX 
250 260 270 280 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 35>: 

a006-l.seq (partial) 

1 . . AGCCAAAACC ACCGCAAGCG ATTGATTGGC ACATTTTTTC TGGTCGGACT 

51 GGAAAACCTT TTGATGCTGG TGTATCCGGT GTTTGGCGGC TGGGCGATTA 

101 ATGCCGTGAT TGCGGGGCAG GCGTGGCAGG CGTTGCTGTA CGCTTTGGTT 

151 GTGCTTTTGA TGTGGCTGGT CGGTGCGGCG CGGCGGATTG CCGATACGCG 

201 CACGTTTACG CGGATTTATA CCGAAATCGC CGTGCCGGTT GTGTTGGAAC 

251 AGCGGCAGCG GCAAGTCCCG CATTCGGCGG TAACTGCGCG GGTTGCCCTG 

301 TCGCGTGAGT TTGTCAGCTT TTTTGAAGAA CACCTGCCGA TTGCCGCGAC 

3 51 ATCCGTCGTA TCCATATTCG GCGCGTGCAT CATGCTGCTG GTGCTGGAAT 

4 01 TTTGGGTCGG CGTGTCGGCG GTGGGCATAC TTGCGTTGTT TTTATGGCTT 
4 51 TTGCCACGTT TTGCCGCCAT CAGCGAAAAC CTGTATTTCC GCCTGAAGAA 
501 CAGCTTGGAA CGCGACAACC ACTTTATCCG AAAAGGCGAC GAGCGGCAGC 
551 TGGACCGCCA TTACGGACTG CTTGCGCGCC TGCGTGTGCT GATTTCCAAC 
601 CGCGAAGCCT TCGGCTATCT CTGCGTCGGC ACGGCGATGG GTATTTTGTT 
651 CGGCTTTGCT TTTGTGATGA TGACGCTCAA AGGCTACAGC AGCGCGGGGC 
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7 01 ATGTCTATTC GGTCGGCACT TATCTGTGGA TGTTTGCCAT AAGTTTGGAC 
7 51 GACGTGCCGC GATTGGTCGA ACAATATTCC AATTTGAAAG ACATCGGACA 
801 ACGGATAGAG TGGTCGAAAC GGAACATCAA AGCCGGAACT TGA 

This corresponds to the amino acid sequence <SEQ ID 36; ORF 006- La>: 

aOOo-I.pep {partial) 

1 . . SQNHRKRLIG TFF1VGLENL LMLVYPVFGG WAINAVIAGQ AW QALLYALV 

51 VLLMWLVGAA RRIADTRTFT RIYTEIAVPV VLEQRQRQVP HSAVTARVAL 

101 SRSFVSFFEE HLPIA ATSW 51 FGACIMLL VL EF WVGVSA VGILALFLWL 

151 LPRFAAISEN LYFRLKNSLE RDNHFIRKGD ERQLDRHYGL LARLRVLISN 

201 REAFGY LCVG TAMGILFGFA PVH .MTLKGYS SAGHVYSVGT YLWMFAISLD 

251 DVPRLVEQYS NLKDIGQRIE WSKRNIKAGT * 

a006-l/m006-l 95.7% identity in 280 aa overlap 

10 20 30 40 50 

a006-l .pep SQNHRKRLIGTFFLVGLENLLMLVYPVFGGWAINAVIAGQAWQALLYALWL 
: I : I I I I I I I I I I I I I i I I II I ( I I I I I I I I I I ! I I I : : I I I I I I I I I I I 
m 00 6-l MWKMLKHIAQTHRKRLIGTFSLVGLENLLMLVYPVFGGRAINAVIAGEVWQALLYALWL 
10 20 30 40 50 60 



60 70 80 90 100 110 

a006-l .pep LMWLVGAARRIADTRT FTRIYTEIAVPVVLEQRQRQVPKSAVTARVALSREFVSFFEEHL 
I I I ! I I I : i I I I i I i II I I I I i I I I i I I I I I I I I i ! I I ! i I I I I I i I I i I I I I I I I I ! I I 
m006-l LMWLVGAVRRIADTRTFTRIYTEIAVPVVLEQRQRQVPHSAVTARVAL.SREFVSFFEEKL 
70 80 90 100 110 120 



120 130 140 150 160 170 

a006-l .pep PIAATSWSIFGACIMLLVLEFWVGVSAVGILALFLWLLPRFAAISENLYFRLKNSLERD 
I I I I I I I I I I I I I II I I I I i I I I I I I I I I M I 1 I I I t I I I t I I ! I I I I i ! I II : I I I f I I 
m 006-l PI AAT S VV S I FG AC IMLLVLE FW VG V S A VG ILALFLWLLPR F AA ISENLYFRLNNSLERD 

130 140 150 160 170 180 



180 190 200 210 220 230 

a006-i.pep NHFIRKGDERQLDRHYGLLARLRVLISNREAFGYLCVGTAMGILFG FAFVMMTLKGYSSA 
I 11 | II I t : 1 11 ! 1 I I I II i I II M I II II 1 I 1 ! II I i II 1 i 1 i I II M I M I I I I 1 I I 
m 006-l NHFIRKGDRRQLYRHYGLLARLRVLISNREAFGYLCVGTAMGILFG FAFVMMTLKGYSSA 

190 200 210 220 230 240 



240 250 260 270 280 

a006-l .pep GHVYSVGTYLWMFAISLDDVPRLVEQYSNLKDIGQRIEWSKRNIKAGTX 
I I ! | t \ i I I t U ! \ : 1 I I ! I I M I I I M I M 1 I 1 I 1 i I I I : t I I 1 I I \ t 
m006-l GHVYSVGTYLWMFAMSLDDVPRLVEQYSNLKDIGQRIEWSERNIKAGTX 
250 260 270 280 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 37>; 

g007 . seq 

1 atgaACACAA CCCGACTGCC GACCGCCTTC ATCTTGTGCT GCCTCTGcgC 

51 CGCcGCTTCT GCCGccgaca acAGCatcat gaCaAAAGGG CAAAAAGTGT 

101 ACGAATCcAa ctGCATCGCC TGCCACGGCA AGAAAGGGGA AGGGCGCGGC 

151 ACTGCGtTTC CTccgctTTT CCggtcgGac tgtattatga acaAACCGCa 

201 cgTCCtgctg cacagcatgg tcaaaggcAt cgacgggaca ttcaaagtgg 

251 agcggcaaaa cctacgacgg atttatgCcc gcaaccgcca tcagcgATGC 

301 GGACATTGCC GCCGTCGCCA CTTATATCAT GAACGCCTTT GA 

This corresponds to the amino acid sequence <SEQ ID 38; ORF 007. ng>: 

g007 .pep 

1 MNTTRLPTAF ILCCLCAAAS A ADNSXMTKG QKVYESMCIA CHGVKGEGRG 

51 TAFPPLFRSD CIMNKPHVLL HSMVKGIDGT FKVERQNLRR IYARNRHQRC 

101 GHCRRRHLYH ERL V 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 39>: 
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m007 . seq 

i ATGAACACAA CCCGACTGCC 

51 CGCCGCTTCT GCCGCCGACA 

101 ACGAATCCAA CTGCGTCGCC 

151 ACCATGTTTC CGCCGCTCTA 

201 GGTGCTGCTG CACAGCA7GG 

2 51 AACGGCAAAA CCTACAACGG 

301 GGACATTGCC GCCGTCGCCA 



GACCGCCCTC GTCTTGGGCT GCTTCTGCGC 
ACAGCATCAT GACAAAAGGG CAAAAAGTGT 
TGCCACGGCA AAAAGGGCGA AGGCCGCGGA 
CCGCTCCGAC TTCATCATGA AAAAACCGCA 
TCAAAGGCAT CAACGGTACA ATCAAAGTC. 
ATTCATGCCC GCAACCGCCA TCAGCGATGC 
CTTATATCAT GAACGCCTTT GA 



This corresponds to the amino acid sequence <SEQ ID 40; ORF 007>: 

m007 . pep 

1 MNTTRLPTAL VLGCFCAAAS A ADNSIMTKG QKVYESNCVA CHGKKGEGRG 

51 TMFPPLYR5D FIMKKPQVLL KSMVKGINGT IKVXRQNLQR IHARNRHQRC 

101 GHCRRRKLYH ERL* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 41>: 

a007 . seq 

1 ATGAACACAA CCCGACTGCC GACCGCCCTC GTCTTGGGCT GCCTCTGCGC 

51 CGCCGCTTCT GCCGCCGACA ACAGCATCAT GACAAAAGGG CAAAAAGTGT 

101 ACGAATCCAA CTGCGTCGCC TGCCACGGCA AAAAGGGCGA AGGCCGCGGA 

151 ACCATGTTTC CGCCGCTCTA CCGCTCCGAC TTCATCATGA AAAAACCGCA 

201 GGTGCTGCTG CACAGCATGG TCAAAGGCAT CAACGGTACA ATCAAAGTC . 

2 51 .AACGGCAAAA CCTACAACGG ATTCATGCCC GCCACTGCCA TCAGCGATGC 

301 GGACATTGCC GCCGTCGCCA CTTATATCAT GAACGCCTTT GA 

This corresponds to the amino acid sequence <SEQ ID 42; ORF 007. a>: 

a007 .pep 

1 MNTTRLPTAL VLGCLCAAAS A ADNSIMTKG QKVYESNCVA CHGKKGEGRG 

51 TMFPPLYRSD FIMKKPQVLL HSMVKGINGT IKVXRQNLQR IHARHCHQRC 

101 GHCRRRHLYH ERL* 



m0O7/a007 97.3% identity over a 1 13 aa overlap 



10 20 30 40 50 60 

m007 . pep MNTTRLPTALVLGCFCAAASAADNSIMTKGQKVYESNCVACHGKKGEGRGTMFPPLYRSD 
I M t I It I \ I I I I I : I I M i t I \ \ III II I I II I I t I i M I i t i I ! I M I 1 ! I t I i \ \ M 
a 007 MNTTRLPTALVLGCLCAAASAADNSIMTKGQKVYESNCVACHGKKGEGRGTMFPPLYRSD 

10 20 30 40 50 60 



70 80 90 100 110 

m007 .pep FIMKKPQVLLHSMVKGINGTIKVXRQNLQRIHARNRHQRCGHCRRRHLYHERLX 
! ! I I I I I II II I I I I I i I I I I I I i I M I I I I I I ! : I I I I ! I I I I M I I I I ! M 
a007 FIMKKPQVLLHSMVKGINGTIKVXRQNLQRIHARHCHQRCGHCRRRHLYHERLX 
70 80 90 100 110 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 007 shows 86.7% identity over a 1 13 aa overlap with a predicted ORF (ORF 007.ng) 
from N. gonorrhoeae: 

m007/g007 

10 20 30 40 50 60 

m007 . pep MNTTRLPTALVLGCFCAAASAADNSIMTKGQKVYESNC^ACHGKKGEGRGTMFPPLYRSD 
I I I i I ! I I I : : I I : t I I I I I I I I I ! I I I I I I I M I I I : I I I I I II I II I ! 1111:111 
g007 MNTTRLPTAFILCCLCAAASAADNSIMTKGQKVYESNCIACHGKKGEGRGTAFPPLFRSD 

10 20 30 40 50 60 

70 80 90 100 110 

m007 . pep FIMKKPQVLLHSMVKGINGTIKVXRQNLQRIHARNRHQRCGHCRRRHLYHERLX 
I I : I I : I I I I I I I I I I : I I : I I I I I I : I I : II I I II I I I I I ! M I I If I I I 
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q 0 0 1 C I MN K P H V L L H S M V KG I DG T FKVERQN LRR I Y ARM R H QRCG HC RRR K L Y HE R L 

70 80 90 100 110 

The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 43>: 

g007-l.seq (partial) 

1 ATGAACACAA CCCGACTGCC GACCGCCTTC ATCTTGTGCT GCCTCTGCGC 

51 CGCCGCTTCT GCCGCCGACA ACAGCATCAT GACAAAAGGG CAAAAAGTGT 

101 ACGAATCCAA CTGCATCGCC TGCCACGGCA AGAAAGGGGA AGGGCGCGGC 

151 ACTGCGTTTC CTCCGCTTTT CCGGTCGGAC TATATTATGA ACAAACCGCA 

2 01 CGTCCTGCTG CACAGCATGG TCAAAGGCAT CAACGGTACA ATCAAAGTCA 

2 51 ACGGCAAAAC CTACAACGGA TTCATGCCCG CAACCGCCAT CAGCGATGCG 
301 GACATTGCCG CCGTCGCCAC TTATATCATG AACGCCTTTG ACAACGGCGG 

3 51 CGGAAGCGTT ACCGAAAAAG ACGTAAAACA GGCAAAAGGC AAAAAAAAC . 

This corresponds to the amino acid sequence <SEQ ID 44; ORF 007-1 .ng>: 

g007-l.pep (partial) 

1 MNTTRLPTAF ILCCLCAAAS A ADNSIMTKG QKVYESNCIA CHGKKGEGRG 

51 TAFPPLFRSD YIMNKFHVLL HSMVKGINGT IKVNGKTYNG FMPATAISDA 

101 DIAAVATYIM NAFDNGGGSV TEKDVKQAKG KKN . . . 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 45>: 

m007-l . seq 

1 ATGAACACAA CCCGACTGCC GACCGCCCTC GTCTTGGGCT GCTTCTGCGC 

51 CGCCGCTTCT GCCGCCGACA ACAGCATCAT GACAAAAGGG CAAAAAGTGT 

101 ACGAATCCAA CTGCGTCGCC TGCCACGGCA AAAAGGGCGA AGGCCGCGGA 

151 ACCATGTTTC CGCCGCTCTA CCGCTCCGAC TTCATCATGA AAAAACCGCA 

2 01 GGTGCTGCTG CACAGCATGG TCAAAGGCAT CAACGGTACA ATCAAAGTCA 

2 51 ACGGCAAAAC CTACAACGGA TTCATGCCCG CAACCGCCAT CAGCGATGCG 
301 GACATTGCCG CCGTCGCCAC TTATATCATG AACGCCTTTG ACAACGGCGG 

3 51 CGGAAGCGTT ACCGAAAAAG ACGTAAAACA GGCAAAAAGC AAAAAAAACT 
401 AA 

This corresponds to the amino acid sequence <SEQ ID 46; ORF 007- 1> 

m007-l . pep 

1 MNTTRLPTAL VLGCFCAAAS A ADNSIMTKG QKVYESNCVA CHGKKGEGRG 

51 TMFPPLYRSD FIMKKPQVLL HSMVKGINGT IKVNGKTYNG FMPATAISDA 

101 DIAAVATYIM NAFDNGGGSV TEKDVKQAKS KKN* 

moo7-i / g007-i 9 1 .7% identity in 1 33 aa overlap 

10 20 30 40 50 60 

m007-l . pep MNTTRLPTALVLGCFCAAASAADMSIMTKGQKVYESNCVACHGKKGEGRGTMFPPLYRSD 
I I I I I I I I I : : I \ : I I ! 1 t I I I I II M I I I I ! I ! I I 1 : I I I I I I I i I i I I I I I I : I I I 
g 007-l MNTTRLPTAFILCCLCAAASAADNSIMTKGQKVYESNCIACHGKKGEGRGTAFPPLFRSD 
10 20 30 40 50 60 

70 30 90 100 110 120 

m007-l .pep FIMKKPQVLLHSMVKGINGTIKVNGKTYNGFMPATAISDADI AAVATYIMNAFDNGGGSV 
: I I : I ( : I I I I t M I I I I I I I M I I I I I I I I II ! I I I I I ! I I I M I M II I I I I I I I i I I 
g007-l Y IMNKPHVLLHSMVKGINGT I KVNGKTYNGFMPATAISDADI AAVATYIMNAFDNGGGSV 

70 80 90 100 110 120 

130 

m007-l .pep TEKDVKQAKS KKNX 

I I II I I I I I : I I I 
g007-l TEKDVKQAKGKKN 
130 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 47>: 

a007-l.seq (partial) 

1 ATGAACACAA CCCGACTGCC GACCGCCCTC GTCTTGGGCT GCCTCTGCGC 

51 CGCCGCTTCT GCCGCCGACA ACAGCATCAT GACAAAAGGG CAAAAAGTGT 

101 ACGAATCCAA CTGCGTCGCC TGCCACGGCA AAAAGGGCGA AGGCCGCGGA 
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151 ACCATGTTTC CGCCGCT^.A CCGCTCCGAC TTCATCATGA AAAAACCGCA 

201 GGTGCTGCTG CACAGCATGG TCAAAGGCAT CAACGGTACA ATCAAAGTCA 

2 51 ACGGCAAAAC CTACAACGGA TTCATGCCCG CCACTGCCAT CAGCGATGCG 
301 GACATTGCCG CCGTCGCCAC TTATATCATG AACGCCTTTG ACAACGGCGG 

3 51 CGGAAGCGTT ACCGAAAAAG ACGTAAAACA G G C AAAAAA C AAAAAA . . 

This corresponds to the amino acid sequence <SEQ ID 48; ORF 007- La>: 

a007-l.pep (partial) 

1 MNTTRLPTAL VLGCLCAAAS A ADNSIMTKG QPCVYE3NCVA CHGKKGEGRG 
51 TMFPPLYRSD FIMKKPQVLL HSMVKGINGT IKVNGKTYNG FMPATAISDA 
101 DIAAVATYIM NAFDNGGGSV TEKDVKQAKN KK . . 

moo7-i/a007-i 98.5% identity in 132 aa overlap 

10 20 30 40 50 60 

m007-l .pep MNTTRLPTALVLGCFCAAASAADNS IMTKGQKVYESNCVACHGKKGEGRGTMFPPLYRSD 
I I I I I I I I I I f I I t : I f I I I i I I I I I I I I I I I I I I I 1 I I I I I I I i I I I I I I I I i I I I I ! [ 
a007-l MNTTRLPTALVLGCLCAAASAADNSIMTKGQKVYESNCVACHGKKGEGRGTMFPPLYRSD 

10 20 30 40 50 60 

70 SO 90 100 110 120 

m007-l.pep FIMKKPQVLLHSMVKG1NGTIKVNGKTYNGFM PATAISDADI AAVATYIMNAFDNGGGSV 
I I I I 1 I I I I I I I I i I I I I I i I I t I I I I I I I I I I I M ! I I I I II ! i I I I 1 I I I 1 M I I I t i 
a007-l FIMKKPQVLLHSMVKGINGTIKVNGKTYNGFMPATAISDADIAAVATYIMNAFDKGGGSV 

70 80 90 100 110 120 

130 

m007-l.pep TEKDVKQAKSKKNX 

I i I I I I I I I : I I 
a 007-1 TEKDVKQAKNKK 
130 



The following partial DNA sequence was identified in TV. gonorrhoeae <SEQ ID 49>: 

g008 . seq 

1 ATGAACAACA GACATTTTGC CGTCAtcgCC TTGGGCAGCA ACCTTGACAA 

51 CCCCGCACAA CAAATacgcg gcgcattaga cgcgctctcg tcccatcctg 

101 acatccggct tgaaCaggtt tcctcactgt aTatgaccgc acctgtcggt 

151 tacgAcaaTC agcccgATTT CATCaatgcc gTCTgcaccg TTTCCACCAC 

201 CtTGGACGGC ATTGcccTGC TTGCCgaACT CAAccgTATC GAAGCCGATT 

251 TCGGACGCGA aCGCAGTTTC CGCAATGCAC CGCGCACATT GGATTTGGAC 

301 ATT AT CG ACT TTGACGGCAT CTCCAGCGAC GACCCCCGCC TTACCCTGCC 

3 51 GCATCCGCGC GCGCACGAAC GCAGTTTCGT CATACGCCCT TTGGCAGAAA 
401 TCCTCCCTGA TTTTATTTTG GGAAAATACG GAAAGGTTGT CGAATTGTCA 

4 51 AAACGGCTGG GCAATCAAGG CATCCGTCTT TTACCGGACA GGTAA 

This corresponds to the amino acid sequence <SEQ ID 50; ORF 008. ng>: 

g008 .pep 

1 MNNRHFAVIA LGSNLDNPAQ QIRGALDALS SHPDIRLEQV SSLYMTAPVG 

51 YDNQPDFINA VCTVSTTLDG IALLAELNRI EADFGRERSF RNAPRTLDLD 

101 IIDFDGISSD DPRLTLPHPR AHERSFVIRP LAEILPDFIL GKYGKWELS 

151 KRLGMQGIRL LPDR* 

The following partial DNA sequence was identified in TV. meningitidis <SEQ ID 51>: 

m008 . seq 

1 ATGAACAACA GACATTTTGC CGTCATCGCC CTGGGCAGTA ATCTTGAAAA 

51 CCCTGCTCAA CAGGTACGCG CCGCATTGGA CACGCTGTCG TCCCATCCTG 

101 ACATCCGTCT TAAACAGGCT TCCTCACTGT ATATGACCGC GCCCGTCGGT 

151 TACGACAATC AGCCCGATTT TGTCAATGCC GTCTGCACCG TTTCCACCAC 

201 TCTGGACGGC ATTGCCyTGC TTGCCGAACT CAACCGTATC GAGGCTGATT 

2 51 TCGGACGCGA ACGCAGCTTC CGCAACGCGC CGCGCACATT GkATTTGGAC 

301 ATTATCGACT TTGACGGCAT CTCCAGCGAC GACACsCGAC TcACCtTGCC 
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1S2 



351 GCATCCGCGC GCGCACGAAC GCAGTTTCGT CATCCGCCCT 7TGGCAGAAA 
4 01 TCCTCCCTGA TTTTGTTTTA GGAAAACACG GAAAGGTTGC CGAATTGTCA 
4 51 AAACGGyTGG GCAATCAAGG TATCCGTCTT TTACCGGACA GGTAATT 

This corresponds to the amino acid sequence <SEQ ID 52; ORF 008>: 

m008 .pep 

I MNNRHFAVIA LGSNLENPAQ QVRAALDTLS SHPDIRLKQA SS LYMTAPVG 

51 YDNQPDFVNA VCTVSTTLDG IALLAELNRI EADFGRERSF RNAPRTLXLD 

101 IIDFDGISSD DTRLTLPHPR AHERSFVIRP LAEI LPDFVL GKHGKVAELS 

151 KRLGNQGIRL LPDR* 

The following partial DNA sequence was identified in N. meningitidis<SEQ ID 53>: 

a008 . seq 

{ ATGAACAACA GACATTTTGC CGTCATCGCC CTGGGCAGTA ATCTTGAAAA 

51 CCCTGCCCAA CAGGTACGCG CCGCATTGGA CACGCTGTCG TCCCATCCTG 

101 ACATCCGTCT TAAACAGGCT TCCTCACTGT ATATGACCGC GCCCGTCGGT 

151 TACGACAATC AGCCCGATTT CGTCAATGCC GT r 7GCACCG TTTCCACCAC 

201 CTTGGACGGC ATTGCCCTGC TTGCCGAACT CAaCCGTATC GAAGCCGATT 

251 TCGGACGCGA ACGCAGCTTC CGCAACGCGC CGCGCACATT GGATTTGGAC 

301 ATTATCGACT TTGACGGCAT CTCCAGCGAC GACCCCCGAC TCACCCTGCC 

351 GCATCCGCGC GCGCACGAAC GCAGTTTCGT CATACGCCCT TTGGCAGAAA 

4 01 TCCTCCCTGA TTTTATTTTG GGAAAACACG GAAAGGTTGC CGAATTGTCA 

4 51 AAACGGCTGG GCAATCAAGG CATCCGTCTT TTACCGGATA AGTAA 

This corresponds to the amino acid sequence <SEQ ID 54; ORF 008. a>: 

a008 .pep 

1 MNNRHFAVIA LGSNLENPAQ QVRAALDTLS SHPDIRLKQA SS LYMTAPVG 

51 YDNQPDFVNA VCTVSTTLDG IALLAELNRI EADFGRERSF RNAPRTLDLD 

101 IIDFDGISSD DPRLTLPHPR AHERSFVIRP LAEILPDFIL GKHGKVAELS 

151 KRLGNQGIRL LPDK* 

m008/aooe 97.6% identity over a 164 aa overlap 

10 20 30 40 50 60 

m008.pep MNNRHFAVIALGSNLENPAQQVRAALDTLSSHPD I RLKQASS LYMTAPVG YDNQPDFVNA 

1 1 1 1 m 1 1 1 i 1 1 it m n 1 1 1 n 1 1 m i m i i s n i n i w \ \ w i \ \ \ w 1 1 w w \ \ 

aOOS MNNRHFAVIALGSNLENPAQQVRAALDTLSSHPDIRLKQASSLYMTAPVG YDNQPDFVNA 

10 20 30 40 50 60 

70 80 90 100 110 120 

m008 .pep VCTVSTTLDGIALLAELNRIEADFGRERSFRNAPRT LXLDI IDFDG1SSDDTRLTLPHPR 

I I I ! I I M I I I I I I I i I I I I I I i I I I I I 1 I I I I I I 1 I I I I I I I I i I I I I I I t M I I ! I 
a008 VCTVSTTLDG IALLAELNRIEADFGRERSFRNAPRTLDLDI I DFDGISSDDPRLTLPHPR 

70 80 90 100 110 120 

130 140 150 160 

m008 , pep AHERSFVIRPLAEILPDFVLGKHGKVAELSKRLGNQGIRLLPDRX 
I I ! I I M I I ! ! I I I I I I I : I i I t I i I I I I I I I I i I I I M I I I I : t 
a 008 AHERSFVIRPLAEILPDFILGKHGKVAELSKRLGNQGIRLLPDKX 

130 140 150 160 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 008 shows 92.7% identity over a 164 aa overlap with a predicted ORF (ORF008.ng) 
from N. gonorrhoeae: 

m008/g008 

10 20 30 40 50 60 

rnOOe .pep MNNRHFAVIALGSNLENPAQQVRAALDTLSSHPDIRLKQASSLYMTAPVGYDNQPDFVNA 
M I I M i I i 1 I I I ! I : i I I ! I : I : I j I : I [ | M | | i j : | : | |j I I ! I I I I I I ! I I I ! : I I 
g008 MNNRHFAVIALGSNLDNPAQQIRGALDALSSHPDIRLEQVSSLYMTAPVGYDNQPDFINA 
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10 20 30 40 50 60 

70 80 90 100 110 120 

m008 .pep VCTVSTTLDGIALLAELNRIEADFGRERSFRNAPRTLXLDI IDFDGISSDDTRLTLPHPR 

I I I I N I I I I t I t f I I I I M I II I I I I f I I I I ' I i ! I | I ! I | | | | | [ | | ( | | | | | | | | 
g008 VCTVSTTLDGIALLAELNRIEADFGRERSFRNAPRTLDLDIIDFDGISSDDPRLTLPHPR 

70 SO 90 100 110 120 

130 140 150 160 

m008 .pep AHERSFVIRPLAEILPDFVLGKHGKVAELSKRLGNQGIRLLPDRX 
I I I M I I I I I M I I I i I I : I 1 I : I I I : i I I I i I I I I I I I II M I | 
g008 AHERSFVIRPLAEI LPDFILGKYGKWELSKRLGNQGIRLLPDRX 

130 140 150 160 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 55>: 

g009. seq 

1 ATGCCCCGCG CTGCCGTAGC CTTTGAGCGT CATCATCACA AAAGCAAAGC 

51 CGAACAAAAT ACCCATCGCC GCGCCGACGC AGAGATAGCC GAAGGCTTCG 

101 CGGTTGGAAA TCAGCACACG CAGGCGCGAA ACCAGTCCGT AATGGCGGTA 

151 CAGCTGCCGC TCGTCGCCTT TTCGGATAAA GTGGTTGTcg cGTTCCAAGC 

2 01 TGTTGTTCAG GCGGAAATAC AGGTTTTCGC TGATGGCGGC AAAACGTGGC 

2 51 AaaaGCCATA A 

This corresponds to the amino acid sequence <SEQ ID 56; ORF 009.ng>: 

g009 .pep 

1 MPRAAVAFER HHHKSKAEQN THRRADAEIA EGFAVGNQHT QARNQSVMAV 
51 QLPLVAFSDK WVAFQAVVQ AEIQVFA DGG KTWQKP* 

The following partial DNA sequence was identified in K meningitidis <SEQ ID 57>: 

m009 . seq 

1 ATGCCCCGCG CTGCTGTAGC CTTTGAGCGT CATCATCACA AAAGCAAAGC 

51 CGAACAAAAT ACCCATCGCC GTGCCGACGC AGAGATAGCC GAAGGCTTCG 

101 CGGTTGGAAA TCAGCACACG CAGGCGCGCA AGCAGTCCGT AATGGCGGTA 

151 CAGCTGCCGC CGGTCGCCTT TTCGGATAAA GTGGTTGTCG CGTTCCAAGC 

201 TGTTGTTCAG GCGGAAATAC AGGTTTTCGC TGATGGCGGC AAAACGTGGC 

251 AAAAGCCATA A 

This corresponds to the amino acid sequence <SEQ ID 58; ORF 009>: 

m009 . pep 

1 MPRAAVAFER HHHKSKAEQN THRRADAEIA EGFAVGNQHT QARKQSVMAV 
51 QLPPVAFSDK WVAFQAVVQ AEIQVFA DGG KTWQKP* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 009 shows 97.7% identity over a 86 aa overlap with a predicted ORF (ORF 009. ng) 
from N. gonorrhoeae: 

m009/g009 

10 20 30 40 50 60 

m00 9.pep MPRAAVAFERHHHKSKAEQNTHRRADAEIAEGFAVGNQHTQARKQSVMAVQLPPVAFSDK 
M I I I t I I I I I I M I I I II I M I II I I I I I I I I I I M i I M I j : I | j | i | i M MINI 
g00 9 MPRAAVAFERHHHKSKAEQNTHRRADAEIAEGFAVGNgHTQARNQSVMAVQLPLVAFSDK 

10 20 30 40 50 60 

70 30 
m009 . pep WVAFQAWQAE IQVFADGGKTWQKPX 

I I I I I i I I I I I I I I I I I I I I I I I I I I I 
g 0 0 9 WVAFQAWQAE IQVFADGGKTWQKPX 

70 80 
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The following partial DNA sequence was identified in A ; . meningitidis <SEQ ID 59>: 

aOOS . seq 

1 ATGCCCCGCG CTGCTGTAGC CTTTGAGCGT CATCATCACA AAAGCAAAGC 

Si CGAACAAAAT ACCCATCGCC GTGCCGACGC AGAGATAGCC GAAGGCTTCG 

101 CGGTTGGAAA TCAGCACACG CAGGCGCGCA AGCAGTCCGT AATGGCGGTC 

151 CAGCTGCCGC TCGTCGCCTT TTCGGATAAA GTGGTTGTCG CGTTCCAAGC 

201 TGTTCTTCAG GCGGAAATAC AGGTTTTCGC TGATGGCGGC AAAACGTGGC 

2 51 AAAAGCCATA A 

This corresponds to the amino acid sequence <SEQ ID 60; ORF 009. a>: 

a009.pep 

1 MPRAAVAFER HHHKSKAEQN THRRADAEIA EGFAVGNQHT QARKQSVMAV 
51 QLPLVAFSDK VWAFQAVLQ AEIQVFA DGG KTWQKP* 

m009/a009 97.7% identity over a 86 aa overlap 

10 20 30 40 50 60 

m0Q9 .pep MPRAAVAFERHHHKSKAEQNTHRRADAEIAEG" ' "NQHTQARKQSVMA7QLPPVAFSDK 

I I I I i I I I I I I I II I I I I I I I I I I I I I I i I I 1 ! - I I I I I I I I I M I I I i I I I I [Mill 
a 009 MPRAAVAFERHHHKSKAEQNTHRRADAEIAEGFAVGNQKTQARKQSVMAVQLPLVAFSDK 

10 20 30 40 50 60 

70 80 
m009 . pep VVVAFQAVVQAEIQVFADGGKTWQKPX 
I I ! I M I I : I I I I I I I I I I M i I If t I 
a 009 W V A FQA V L Q AE IQVFADGGKTWQKPX 

70 80 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 61>; 

g010 . seq 

1 ATGGGTTTTC CTGTTCGCAA GTTTGATGCC GTGATTGTCG GCGGTGGCGG 

51 TGCAGGTTTA CGTGCAGCCC TCCAATTATC CAAATCCGGT TTGAATTGTG 

101 CCGTTTTGTC TAAAGTGTTC CCGACCCGCT CGCATACCGT AGCGGCGCAG 

151 GGCGGTATTT CCGCCTCTCT GGGTAATGTG CAGGAGGACC GTTGGGACTG 

201 GCACATGTAC GATACCGTGA AAGGTTCCGA CTGGCTGGGC GACCAAGATG 

251 CGATTGAGTT TATGTGTCGC GCTGCGCCTG AAGCGGTGAT TGAGTTGGAA 

301 CACATGGGTA TGCCTTTTGA CCGCGTTGAA AGCGGCAAAA TTTATCAGCG 

351 TCCTTTCGGC GGACATACTG CCGAACATGG TAAACGTGCG GTAGAACGTG 

4 01 CATGTGCGGT TGCCGACCGT ACCGGTCATG CGATGTTGCA TACTTTGTAC 

4 51 CAACAAAACG TCCGTGCCAA TACACAATTC TTTGTGGAAT GGACGGCGCA 

501 AGATTTGATT CGTGATGAAA ACGGCGATGT CGTCGGCGTA ACCGCCATGG 

551 AAATGGAAAC GGGCGAAGTT TATATTTTCC ACGCCAAGGC CGTGATGTTT 

601 GCTACCGGTG GCGGCGGTCG TATTTATGCT TCTTCTACCA ATGCTTATAT 

651 GAATACCGGT GACGGTTTGG GCATTTGCGC CCGTGCGGGC ATTCCGTTGG 

7 01 AAGATATGGA ATTCTGGCAA TTCCACCCGA CCGGCGTGGC GGGTGCGGGC 

7 51 GTGTTGATTA CCGAAGGCGT ACGCGGCGAG GGCGGTATTC TGTTGAacgc 
801 cgacggcgaA cgcTTTATGG AAcgctatgc GCcgACCGta aAagaCTTGG 

8 51 CTTCTCGCga cgtGGTTTCA CgcgcGatgG CGatggaAAt ctatgaaggt 
901 cgcggctgTG GtaaAAAcaA agaCCacgtC TTACTGAAAA TCGACcAt At 
951 cggtGCAGAA AAAATTATGG AAAAACTGCC GGGCATCCGC GAGATTTCCA 

1001 TTCagtttgc cGGTATCGAT CCGATTAAAG ACCCGATTcc ggttgTGCCG 

1051 ACTACCCACT ATATGATGGG CGGCATTCcg aCCAATTATC ACGGTGAAGT 

1101 TGTTGTTCCG CAAGGCGACG AGTACGAAGT ACCTGTAAAA GGCCTGTATG 

1151 CCGCAGGTGA GTGCGCCTGT GCTTCCGTAC ACGGTGCGAA CCGTTTGGGT 

1201 ACGAACTCCC TGCTGGACTT GGTGGTGTTC cgcccaaccc cccggtga 

This corresponds to the amino acid sequence <SEQ ID 62; ORF 010.ng>: 

g010 .pep 
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1 MGFPVRKFDA 

51 GGISASLGNV 

101 HMGMPFDRVE 

151 QQNVRANTQF 

201 ATGGGGRIYA 

251 VLITEGVRGE 

301 RGCGKNKDHV 

3 51 TTHYMMGGIP 

4 01 TNSLLDLWF 



VI VGGGGAuL 
QEDRWDWHMY 
SGKIYQRPFG 
FVEWTAQDLI 
SSTNAYMNTG 
GGI LLNADGE 
LLKIDHIGAE 
TNYHGEWVP 
RPTPR* 



RAALQLSKSG 
DTVKGSDWLG 
GHTAEHGKRA 
RDENGDWGV 
DGLGICARAG 
RFMERYAPTV 
KIMEKLPGIR 
QGDEYEVPVK 



LNCAVLSKVF 
DQDAIEFMCR 
VERACAVADR 
TAMEMETGEV 
I PLEDMEFWQ 
KDLASRDVVS 
EI S IQFAGID 
GLYAAGECAC 



PTRSKTVAAQ 
AAPEAVIELE 
TGHAMLHTLY 
YIFKAKAVMF 
FHPTGVAGAG 
RAMAMEI YEG 
PIKDPIPWP 
ASVKGANRLG 



The following partial DNA sequence was i 



mOlO . seq 


(PARTIAL) 




1 


. . nTCCAATTAT 


CCAAATCCGG 


51 


CCCGACCCGT 


TCGCATACCG 


101 


TGGGTAATGT 


GCAGGAAGAC 


151 


AAAGGTTCCG 


ACTGGTTGGG 


201 


CGCCGCGCCT 


GAAGCCGTAA 


251 


ACCGTGTGGA 


AAGCGGTAAA 


301 


GCCGAACACG 


GTAAACGCGC 


351 


TACAGGTCAT 


GCGATGCTGC 


401 


ATACGCAATT 


CTTTGTGGAA 


451 


AACGGCGATG 


TCGTCGGCGT 


501 


TTATATTTTC 


CACGCTAAAG 


551 


GTATTTATGC 


GTCTTCTACC 


601 


GGTATTTGTG 


CGCGTGCAGG 


651 


ATTCCAGCCG 


ACCGGCGTGG 



aitified in N. meningitidis <SEQ ID 63>: 

TCTGAATTGT GCCGTTTTGT CTAAAGTGTT 
TAGCGGCGCA GGGCGGTATT TCCGCCTCTn 
CGTTGGGACT GGCACATGTA CGATACCGTG 
CGACCAAGAT GCGATTGAGT TTATGTGCCG 
TTGAGTTGGA ACACATGGGT ATGCCTTTTG 
ATTTATCAGC GTCCTTTCGG CGGCCATACT 
GGTAGAACGC GyCTGTGCGG TTGCCGACCG 
ATACTTTGTA CCAACAAAAC GTCCGTGCCA 
TGGACGGCAC AAGATTTGAT TCGTGATGAA 
AACCGCCATG GAAATGGAAA CCGGCGAAgT 
CTGTGATGTT TGCTACCGGC GGCGGCGGTC 
AATGCCTATA TGAATACCGG CGATGGTTTG 
TATCCCGTTG GAAGACATGG AATTCTGGCA 
CGGGTGCGGG CGTGTTGATT ACCGAA . . . . 



This corresponds to the amino acid sequence <SEQ ID 64; ORF 010>: 

mOlO.pep (PARTIAL) 

1 . . XQLSKSGLNC AVLSKVFPTR SHTVAAQGGI SASXGNVQED RWDWHMYDTV 

51 KGSDWLGDQD AIEFMCRAAP EAVIELEHMG MPFDRVESGK IYQRPFGGHT 

101 AEHGKRAVER XCAVADRTGH AMLHTLYQQN VRANTQFFVE WTAQDLIRDE 

151 NGDVVGVTAM EMETGEVYIF HAKAVMFATG GGGRIYASST NAYMNTGDGL 

201 GICARAGI PL EDMEFWQFQP TGVAGAGVLI TE . . . 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 65>: 

a010 . seq 

1 ATGGGCTTTC CTGTTCGCAA GTTTGATGCC GTGATTGTCG GCGGTGGTGG 

51 TGCAGGTTTA CGCGCANCCC TCCAATTATC CAAATCCGGT CTGAATTGTG 

101 CCGTTTTGTC TAAAGTGTTC CCGACCCGTT CGCATACCGT AGCGGCGCAG 

151 GGCGGTATTT CCGCCTCTCT GGGTAATGTG CAGGAAGACC GTTGGGACTG 

201 GCACATGTAC GATACCGTGA AAGGTTCCGA CTGGTTGGGC GACCAAGATG 

251 CGATTGAGTT TATGTGCCGC GCCGCGCCTG AAGCCGTAAT TGAGTTGGAA 

301 CACATGGGTA TGCCTTTTGA CCGTGTGGAA AGCGGTAAAA TTTATCAGCG 

351 TCCTTTCGGC GGCCATACTG CCGAACACGG TAAACGCGCG GTAGAACGCG 

4 01 CCTGTGCNGT TGCCGACCGT ACAGGTCATG CGATGCTGCA TACTTTGTAC 

4 51 CAACAAAATG TCCGTGCCAA TACGCAATTC TTTGTGGAAT GGACGGCACA 

501 AG ATT TG ATT CGTGATGAAA ACGGCGATGT CGTCGGCGTA ACCGCCATGG 

551 AAATGGAAAC CGGCGAAGTT TATATTTTCC ACGCTAAAGC TGTGATGTTT 

601 GCTACCGGCG GCGGCGGCCG TATTTATGCG TCTTCTACCA ATGCCTATAT 

651 GAATACCGGC GATGGTTTGG GTATTTGTGC GCGTGCAGGT ATCCCGTTGG 

7 01 AAGACATGGA ATTCTGGCAA TTCCACCCGA CCGGCGTGGC AGGTGCGGGC 

7 51 GTGTTGATTA CCGAAGGCGT ACGCGGCGAG GGCGGTATTC TGTTGAATGC 
801 CGACGGCGAA CGCTTTATGG AACGCTATGC GCCGACCGTA AAAGACTTGG 

8 51 CTTCTCGCGA CGTTGTTTCC CGCGCGATGG CGATGGAAAT CTACGAAGGT 
901 CGCGGCTGCG GTAAAAACAA AGACCATGTC TTACTGAAAA TCGACCATAT 

9 51 CGGCGCAGAA AAAATTATGG AAAAACTGCC GGGCATCCGC GAGATTTCCA 
1001 TTCAGTTCGC CGGTATCGAT CCGATTAAAG ACCCGATTCC CGTTGTGCCG 
1051 ACTACCCACT ATATGATGGG CGGTATTCCG ACCAACTACC ATGGCGAAGT 
1101 TGTCGTTCCT CAAGGCGACG AATACGAAGT GCCTGTAAAA GGTCTGTATG 
1151 CGGCAGGTGA GTGCGCCTGT GCTTCCGTAC ACGGTGCGAA CCGCTTGGGT 
1201 ACGAACTCCC TGCTGGACTT AGTGGTATTC GGTAAAGCTG CCGGCGACAG 
1251 CAT GAT T AAA TTCATCAAAG AGCAAAGCGA CTGGAAACCT TTGCCTGCTA 
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1301 ATGCCGGCGA ACTGACCCGC 

1351 GATGGTGAAA ACGTTGATGC 

14 01 ATTGCACGCC GGCGTGTTCC 

14 51 GAGAAGTCAT GGCGATTGCC 

1501 AAGAGCAAAG TGTGGAATAC 

1551 CCTAATTGAA GTGGCGAAAG 

1601 AATCACGCGG TGCGCACGCT 

1651 AACTGGATGA AACATACGCT 

1701 CAAACCGGTG CACACCAAGC 

1751 AGCGCGTTTA TTGA 

This corresponds to the amino acid sequence <SEQ ID 66; ORF 010.a>; 

aOlO.pep 

1 MGFPVRKFDA VIVGGGGAGL RAXLOLSKSG LNCAVLSKVF PTRSHTVAAQ 

51 GG I SASLGNV QEDRWDWHMY DTVKGSDWLG DQDAIEFMCR AAPEAVIELE 

101 HMGMPFDRVE SGKIYQRPFG GHTAEHGKRA VERACAVADR TGHAMLHTLY 

151 QQNVRANTQF FVEWTAQDLI RDENGDVVGV TAMEMETGEV YIFHAKAVMF 

201 ATGGGGRIYA SSTNAYMNTG DGLGICARAG IPLEDMEFWQ FHPTGVAGAG 

251 VLITEGVRGE GGILLNADGE RFMERYAPTV KD^ASRDVVS RAMAMEIYEG 

301 RGCGKNKDHV LLKIDHIGAE KIMEKLPGIR EISIQFAGID PIKDPIPWP 

351 TTHYMMGGIP TNYHGEWVP QGDEYEVPVK GLYAAGECAC ASVHGANRLG 

401 TNSLLDLVVF GKAAGDSMIK FIKEQSDWKP LPANAGELTR QRIERLDNQT 

4 51 DGEMVDALRR ELQRSVQLHA GVFRTDEILS KGVREVMAIA ERVKRTEIKD 

501 KSKVWNTARI EALELDNLIE VAKATLVSAE ARKESRGAHA SDDHPERDDE 

551 NWMKHTLYHS DANTLSYKPV HTKPLSVEYI KPAKRVY* 

moio /aoio 98.7% identity over a 23 1 aa overlap 

10 20 30 

mOlO.pep XQLSKSGLNCAVLSKVFPTRSHTVAAQGGISASXGNV 

I I I I f I I I I I I I I I I I I I I I I I ! I ! I I I I II I I 1 ! 
a010 MGFPVRKFDAVIVGGGGAGLRAXLQLSKSGLNCAVLSKVFPTRSHTVAAQGGI SASLGNV 

10 20 30 40 50 60 

40 50 60 70 80 90 

mOlO.pep QE DRWDWHMY DTVKGSDWLG DQDAIEFMCRAAPEAVIELEHMGMPFDRVE SGKIYQRPFG 

I 1 I I 1 I t t i I I I I I II I 1 I I I 1 I 1 i I I 1 1 1 I 1 I 1 1 t I I I I if i 1 11 I i I I I 1 I I I I M i I 
aOlO QE DRWDWHMY DTVKGSDWLGDQDAIEFMCRAAPEAVIELE HMGMPFDRVE SGKIYQRPFG 

70 80 90 100 110 120 

100 110 120 130 140 150 

mOlO . pep GHTAEHGKRAVERXCAVADRTGHAMLHTLYQQNVRANTQFFVEWTAQDL I RDENGDVVGV 

I ! I I I I I I I I I 1 I I I! I i I I I I I i I i I I i I I I I I I I I I I ! I I I If M I f f I I I i j I i 1 I 
aOlO GHTAEHGKRAVERACAVADRTGHAMLHTLYQQNVRAMTQFFVEWTAQDLIRDENGDWGV 
130 140 150 160 170 180 

160 170 180 190 200 210 

mOlO.pep TAMEMETGE VY I FHAKAVMFATGGGGRI YAS STNAYMNTG DGLGI CARAG I PLE DME FWQ 

1 ! I I I I I I ! I I I t 1 ( I I I I I i t I t ! ! II i U M ! 11 ( ! i i i M I M I I ( I i ( I I I M I I I 
aOlO TAMEMETGEVYIFHAKAVMFATGGGGRIYASSTNAYMNTGDGLGICARAGIPLEDMEFWQ 
190 200 210 220 230 240 

220 230 
mOlO.pep FQPTGVAGAGVLITE 
t : I M I I I I I I I 1 I I 

aOlO FHPTGVAGAGVLITEGVRGEGGILLNADGERFMERYAPTVKDLASRDWSRAMAMEIYEG 

250 260 270 280 290 300 



Computer analysis of this amino acid sequence gave the following results: 



186 



CAACGTATCG 
ATTGCGCCGC 
GTACTGATGA 
GAGCGTGTGA 
CGCGCGTATC 
CGACTTTGGT 
TCAGACGACC 
GTACCATTCA 
CTTTGAGCGT 



AGCGTTTGGA 
GAACTGCAAC 
GATTCTGAGC 
AACGTACCGA 
GAGGCTTTGG 
GTCTGCCGAA 
ATCCTGAGCG 
GATGCCAATA 
GGAATACATC 



CAATCAAACT 
GCTCCGTACA 
AAAGGCGTTC 
AATCAAAGAC 
AATTGGATAA 
GCACGTAAAG 
CGATGATGAA 
CCTTGTCCTA 
AAACCGGCCA 
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Homology with a predicted ORF from A'', gonorrhoeae 

ORF 010 shows 98.7% identity over a 231 aa overlap with a predicted ORF (ORF OlO.ng) 
from N. gonorrhoeae: 

mO 1 0 . pep/gO 1 0 . pep 

10 20 30 

mOiO .pep XQLSKSGLNCAVLSKVFPTRSHTVAAQGGISASXGNV 

I I I I I I f I I I I 1 I I I ! I I I I I I I I I I I I I I 1! f ! I 
g 0 1 0 MG F P V R K F D A V I VGGGGAG LRAALQLS K S G LN C A V L S K V F PT R S H T V AA QGG ISASLGNV 

10 20 30 40 50 60 

40 50 60 70 80 90 

mOlO.pep QEDRWDWHMYDTVKGSDWLGDQDAIEFMCRAAPEAVIELEHMGMPFDRVESGKIYQRPFG 
I I ! M I j I I I I I I I I I I M I I I I I I II I I I I I I II ! I I I I (I M I i i I I I If It i i II ! f 
g010 QEDRWDWHMYDTVKGSDWLGDODAIEFMCRAAPEAVIELEHMGMPFDRVESGKIYQRPFG 
70 80 90 100 110 120 

100 110 120 130 140 150 

mOlO.pep GHTAEHGKRAVERXCAVADRTGHAMLHTLYQQNVRANTQFFVEWTAQDLIRDENGDVVGV 

I I I I I I I I I I I I I II I I I I I I I I I M I I I I I II I f I ! I I ! I I I I I M I I i i I II I I I 1 I 
gOlO GHTAEHGKRAVERACAVADRTGHAMLHTLYQQNVRANTQFFVEWTAQDLIRDENGDWGV 

130 140 150 160 170 180 

160 170 180 190 200 210 

mOlO .pep TAMEMETGEVY I FHAKAVMFATGGGGR I YAS STNAYMNTGDGLG I CARAG I PLE DME FWQ 

II I I I I I I I I I I II I I 1 I I I I I I I I I I I I I I I I I I M I N I I I I I I i I) I I 1 t I I I ! i I I 
gOlO T AMEMETGEV Y I FKAKAVM FATGGGGR I Y AS STN AYMNTG DG LG I CARAG I P LE DME FWQ 

190 200 210 220 230 240 

220 230 
mOlO.pep FQ PT G V AG AG V LITE 

I : I I !U i Ul 1 I i I 

gOlO FHP7GVAGAGVLITEGVRGEGGILLNADGERFMERYAPTVKDLASRDWSRAMAMEI YEG 

250 260 270 280 290 300 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 67>: 

g010-l . seq. . 

1 ATGGGTTTTC CTGTTCGCAA GTTTGATGCC GTGATTGTCG GCGGTGGCGG 

51 TGCAGGTTTA CGTGCAGCCC TCCAATTATC CAAATCCGGT TTGAATTGTG 

101 CCGTTTTGTC TAAAGTGTTC CCGACCCGCT CGCATACCGT AGCGGCGCAG 

151 GGCGGTATTT CCGCCTCTCT GGGTAATGTG CAGGAGGACC GTTGGGACTG 

201 GCACATGTAC GATACCGTGA AAGGTTCCGA CTGGCTGGGC GACCAAGATG 

251 CGATTGAGTT TATGTGTCGC GCTGCGCCTG AAGCGGTGAT TGAGTTGGAA 

301 CACATGGGTA TGCCTTTTGA CCGCGTTGAA AGCGGCAAAA TTTATCAGCG 

351 TCCTTTCGGC GGACATACTG CCGAACATGG TAAACGTGCG GTAGAACGTG 

4 01 CATGTGCGGT TGCCGACCGT ACCGGTCATG CGATGTTGCA TACTTTGTAC 

4 51 CAACAAAACG TCCGTGCCAA TACACAATTC TTTGTGGAAT GGACGGCGCA 

501 AGATTTGATT CGTGATGAAA ACGGCGATGT CGTCGGCGTA ACCGCCATGG 

551 AAATGGAAAC GGGCGAAGTT TATATTTTCC ACGCCAAGGC CGTGATGTTT 

601 GCTACCGGTG GCGGCGGTCG TATTTATGCT TCTTCTACCA ATGCTTATAT 

651 GAATACCGGT GACGGTTTGG GCATTTGCGC CCGTGCGGGC ATTCCGTTGG 

701 AAGATATGGA ATTCTGGCAA TTCCACCCGA CCGGCGTGGC GGGTGCGGGC 

7 51 GTGTTGATTA CCGAAGGCGT ACGCGGCGAG GGCGGTATTC TGTTGAACGC 

801 CGACGGCGAA CGCTTTATGG AACGCTATGC GCCGACCGTA AAAGACTTGG 

851 CTTCTCGCGA CGTGGTTTCA CGCGCGATGG CG AT GGAAAT CTATGAAGGT 

901 CGCGGCTGTG GTAAAAACAA AGACCACGTC TTACTGAAAA TCGACCATAT 

951 CGGTGCAGAA AAAATTATGG AAAAACTGCC GGGCATCCGC GAGATTTCCA 

1001 TTCAGTTTGC CGGTATCGAT CCGATTAAAG ACCCGATTCC GGTTGTGCCG 

10 51 ACTACCCACT ATATGATGGG CGGCATTCCG ACCAATTATC ACGGTGAAGT 

1101 TGTTGTTCCG CAAGGCGACG AGTACGAAGT ACCTGTAAAA GGCCTGTATG 

1151 CCGCAGGTGA GTGCGCCTGT GCTTCCGTAC ACGGTGCGAA CCGTTTGGGT 
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1201 ACGAACTCCC TGCTGGACTT GGTGGTGTTC cgcccaaccc cccggtga 

This corresponds to the amino acid sequence <SEQ ID 68; ORF 010-l.ng>: 

g010-l . pep 

1 MGFPVRKFDA VI VGGGGAGL RAALQLSKSG U3CAVLSKVF PTRSH7VAAQ 

51 GGISASLGNV QEDRWDWHMY DTVKGSDWLG DQDAIEFMCR AAPEAVIELE 

101 HMGMPFDRVE SGKIYQRPFG GHTAEHGKRA VERACAVADR TGHAMLHTLY 

151 QQNVRANTQF FVEWTAQDLI RDENGDVVGV TAMEMETGEV YIFHAKAVMF 

201 ATGGGGRIYA SSTNAYMNTG DGLGICARAG IPLEDMEFWQ FHPTGVAGAG 

251 VLITEGVRGE GGILLNADGE RFMERYAPTV KDLASRDVVS RAMAMEIYEG 

301 RGCGKNKDHV LLKIDHIGAE KIMEKLPGIR E I S I Q FAG I D PIKDPIPWP 

351 TTHYMMGGIP TNYKGEWVP QGDEYEVPVK GLYAAGECAC ASVHGANRLG 

4 01 TNSLLDLVVF RPTPR* 

g010-l / P10444 

sp ) PI 044 4 | DHSA_ECOLI SUCCINATE DEHYDROGENASE FLAVOPROTE I N SUBUNIT 

gnl | PID| dl015210 (D90711) Succinate dehydrogenase, flavoprotein [Escherichia coli] gi | 1786942 
(AE000175) succinate dehydrogenase flavoprotein subunit [Escherichia coli) Length = 588 

Score = 1073 (495.6 bits), Expect = 6.7e-169, Sum P(2) = 6.7e-169 

Identities = 191/303 (63%) , Positives = 238/303 '78%) 



Query : 


1 


MGFPVRKFDAVI VXXXXXXXXXXXXXSKSGLNCAVLSKVFPTR ^ " W VAAQGG I SASLGNV 


60 






M PVR+FDAV++ S + SG CA+LSKVFPTRiTT, rAQGGI+ +LGN 




Sbjct : 


1 


MKLPVREFDAWIGAGGAGMRAALQISQSGQTCALLSKVFPTRSHTVSAQGGITVALGNT 


60 


Query : 


61 


QEDRWDVmMYDTVKGSDWLGDQDAIEFMCRAAPEAVIELEHMGMPFDRVESGKIYQRPFG 


120 






ED VJ +WHMYD7VKGSD + +GDQDAI E +KC +■ PEA++ELEHMG+PF R* + G+IYQRPFG 




Sbjct : 


61 


HEDNWEWHMYDTVKGSDYIGDQDAIEYMCKTGPEAILELEHMGLPFSRLDDGRIYQRPFG 


120 


Query : 


121 


GHT AE HGKRA VE RACAVAD RTGHAMLHTL YQQNVRANTQ FFVE WTA QD L I RDENGDVVGV 


180 






G + G PA ADRTG HA + LHTL YQQN + + +T F EW A DL+++++G WG 




Sbj ct : 


121 


GQS KN FGGEQAARTAAAADRTGHALLHTLYQQNLKNHTT I FS EWYALDLVKNQDGA WGC 


180 


Query : 


181 


TAMEMETGEVYI FHAKAVMFATGGGGRI YASSTNAYMNTGDGLG I CARAGI PLEDMEFWQ 


240 






TA-f +ETGEV F A+A + ATGG GRIY S +TNA+ +NTGDG +G + RAG+P++DME WQ 




Sbj Ct : 


181 


TALCIETGEVVYFKARATVLATGGAGRIYQSTTNAHINTGDGVGMAIRAGVPVQDMEMWQ 


240 


Query : 


241 


FH PTG VAG AGVL I TEGVRG EGG I LLNADG ER FMERYAPTVKD LAS RD WSRAMAME I YEG 


300 






FH PTG + AG AG V L + T EG RGEGG LLN GERFMERYAP KDLA RDW+R + + +EI EG 




Sbjct : 


241 


FHPTGIAGAGVLVTEGCRGEGGYLIjNKHGERFMERYAPNAKDLAGRDVVARSIMIEIREG 


300 


Query : 


301 


RGC 3 03 








RGC 




Sbj Ct : 


301 


RGC 3 03 




Score 


= 24 9 


(115.0 bits), Expect = 6.7e-169, Sum P(2) = 6.7e-169 




Identities ■ 


= 53/102 (51%), Positives = 62/102 (60%) 




Query : 


309 


HVLLKI DH IGAEKI MEKLPG I RE I S I QFAGXXXXXXXXXXXXTTHYMMGG I PTNYHGE W 


368 






H LK+DH+G E + +LPGI E+S FA T HYMMGGIPT G+ +■ 




Sbjct : 


310 


HAKLKLDHLGKEVLESRLPGILELSRTFAHVDPVKEPI PVI PTCHYMMGGI PTKVTGQAL 


369 


Query : 


369 


VPQGDEYEVPVKGLYAAGECACASVHGANRLGTNSLLDLWF 410 








+V V GL+A GE AC SVHGANRLG NSLLDLWF 




Sbjct : 


370 


TVNEKGEDVWPGLFAVGEIACVSVHGANRLGGNSLLDLWF 411 





The following partial DNA sequence was identified in N. meningitidis <SEQ ID 69>: 

mOlO-1 . seq. . 

1 ATGGGTTTTC CTGTTCGCAA GTTTGATGCC GTGATTGTCG GCGGTGGTGG 

51 TGCAGGTTTA CGCGCAGCCC TCCAATTATC CAAATCCGGT CTGAATTGTG 

101 CCGTTTTGTC TAAAGTGTTC CCGACCCGTT CGCATACCGT AGCGGCGCAg 

151 GGCGGTATTT CCGCCTCTCT GGGTAATGTG CAGGAAGACC GTTGGGACTG 

201 GCACATGTAC GATACCGTGA AAGGTTCCGA CTGGTTGGGC GACCAAGATG 

2 51 CGATTGAGTT TATGTGCCGC GCCGCGCCTG AAGCCGTAAT TGAGTTGGAA 
301 CACATGGGTA TGCCTTTTGA CCGTGTGGAA AGCGGTAAAA TTTATCAGCG 

3 51 TCCTTTCGGC GGCCATACTG CCGAACACGG TAAACG^ "CG GTAGAACGCG 

4 01 CCTGTGCGGT TGCCGACCGT ACAGGTCATG CGATGCTGCA TACTTTGTAC 
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4 51 CAACAAAACG TCCGTG^AA 

501 AGATTTGATT CGTGATGAAA 

551 AAATGGAAAC CGGCGAAGTT 

601 GCTACCGGCG GCGGCGGTCG 

651 GAATACCGGC GATGGTTTGG 

7 01 AAGACAT3GA ATTCTGGCAA 

151 GTGTTGATTA CCGAAGGCGT 

801 CGACGGCGAA CGCTTTATGG 

851 CTTCTCGCGA CGTTGTTTCC 

901 CGCGGCTGCG GTAAAAACAA 

951 CGGCGCAGAA AAAATTATGG 

1001 TTCAGTTCGC CGGTATCGAT 

1051 ACTACCCACT ATATGATGGG 

1101 TGTCGTTCCG CAAGGTGAAG 

1151 CGGCAGGTGA GTGCGCTTGT 

1201 ACCAACTCCC TGTTGGACTT 

1251 CATGATTAAA TTCATCAAAG 

1301 ATGCAGGTGA GTTGACCCGC 

1351 GATGGTGAAA ACGTTGATGC 

14 01 ACTGCACGCC GGCGTGTTCC 

14 51 GAGAAGTCAT GGCGATTGCC 

1501 AAGAGCAAAG TGTGGAATAC 

1551 CCTGATTGAA GTGGCGAAAG 

1601 AATCACGCGG TGCGCACGCT 

1651 AACTGGATGA AACATACGCT 

1701 CAAACCGGTG CACACCAAGC 

17 51 AGCGCGTTTA TTGATGA 



TACGCAATTC TTTGTGGAAT GGACGGCACA 
ACGGCGATGT CGTCGGCGTA ACCGCCATGG 
TATATTTTCC ACGCTAAAGC TGTGATGTTT 
TATTTATGCG TCTTCTACCA ATGCCTATAT 
GTATTTGTGC GCGTGCAGGT ATCCCGTTGG 
TTCCACCCGA CCGGCGTGGC GGGTGCGGGC 
ACGCGGCGAG GGCGGTATTC TGTTGAATGC 
AACGCTATGC GCCGACCGTA AAAGACTTGG 
CGCGCGATGG CGATGGAAAT CTACGAAGGT 
AGACCATGTC TTACTGAAAA TCGACCATAT 
AAAAACTGCC GGGCATCCGC GAGATTTCCA 
CCGATTAAAG ACCCGATTCC CGTTGTGCCG 
CGGCATTCCG ACCAATTACC ACGGCGAAGT 
ATTACGAAGT GCCTGTAAAA GGTCTGTATG 
GCTTCCGTAC ACGGTGCGAA CCGCTTGGGT 
GGTGGTATTC GGTAAAGCTG CCGGCGACAG 
AGCAAAGCGA CTGGAAACCT TTGCCTGCTA 
CAACGTATCG AGCGTTTGGA CAACCAAACC 
ATTGCGTCGC GAACTGCAAC GCTCTGTACA 
GTACTGATGA GATTCTGAGC AAAGGCGTTC 
GAGCGTGTGA AACGTACCGA AATCAAAGAC 
CGCGCGTATC GAGGCTTTGG AATTGGATAA 
CGACTTTGGT GTCTGCCGAA GCACGTAAAG 
TCAGACGACC ATCCTGAGCG CGATGATGAA 
GTACCATTCA GATATCAATA CCTTGTCCTA 
CTTTGAGCGT GGAATACATC AAACCGGCCA 



This corresponds to the amino acid sequence <SEQ ID 70; ORF 010-1>: 

mOlO-l.pep. . 

1 MGFPVRKFDA VIVGGGGAGL RAALQLSKSG LNCAVLSKVF PTRSHTVAAQ 

51 GGISASLGMV QEDRWDWHMY DTVKGSDWLG DQDAIEFMCR AAFEAVIELE 

101 HMGMPFDRVE SGKIYQRPFG GHTAEHGKRA VERACAVADR TGHAMLHTLY 

151 QQNVRANTQF FVEWTAQDLI RDENGDWGV TAMEMETGEV YIFHAKAVMF 

201 ATGGGGRIYA SSTNAYMNTG DGLGICARAG IPLEDMEFWQ FHPTGVAGAG 

251 VLITEGVRGE GGILLMADGE RFMERYAPTV KDLASRDVVS RAMAMEIYEG 

301 RGCGKNKDHV LLKIDHIGAE KIMEKLPGIR EISIQFAGID PIKDPIPWP 

351 TTHYMMGGIP TNYHGEWVP QGEDYEVPVK GLYAAGECAC ASVHGANRLG 

4 01 TNSLLDLWF GKAAGDSMIK FIKEQSDWKP LPANAGELTR QRIERLDNQT 

4 51 DGENVDALRR ELQRSVQLHA GVFRTDEILS KGVREVMAIA ERVKRTEIKD 

501 KSKVWNTARI EALELDNLIE VAKATLVSAE ARKESRGAHA SDDHPERDDE 

551 NWMKHTLYHS DINTLSYKPV HTKPLSVEYI KPAKRVY* 

m01O-l / g0l0-l 99.5% identity in 410 aa overlap 



10 20 30 40 50 60 

mO 1 0 - 1 . pep MGFPVRKFDAVI VGGGGAGLRAALQLSKS GLNCAVLSKVFPTRS HTVAAQGGI SASLGNV 

i I t I I I I I I I I I I I I I I I li I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I ! I 
gO 1 0 - 1 MGFFVRKFDAVIVGGGGAGLRAAI^LSKSGLNCAVLSKVFPTRSHTVAAQGGI SASLGNV 

10 20 30 40 50 60 



70 80 90 100 110 120 

m010-l . pep QEDRWDWHMYDTVTCGSDWLGDQDAIEFMCRAAPEAVTELE HMGMPFDRVE SGKIYQRPFG 

I I I I I I I I M I I I I 1 I I I i I I I I I t I I I I I I I I f I I I I I I I I I I I I I I I I I I I I I I I I I I 
g010-l QEDRWDWHMYDTVKGSDWLGDQDAIEFMCRAAPEAVIELEHMGMPFDRVESGKIirQRPFG 

70 80 90 100 110 120 



130 140 150 160 170 180 

m010-l . pep GHTASHGKRAVERACAVADRTGHAMLHTLYQQITVRANTQF 

I I I I I I I I I I I I I I I I I I I I t M I I 1 I I I I I I I I | I I I | I | | | || | | | | | | | | | | | | | | | 
gO 1 0 - 1 GHTAEHGKRAVERACAVADRTGHAMLHTLYQQimU^QFFVEWTAQDLIRDENGDVVGV 

130 140 150 160 170 180 



190 200 210 220 230 240 

m010-l .pep TAMEMETGEVYIFHAKAVMFATGGGGRIYASSTNAYMNTGDGLGICARAGIPIXDMEFWQ 
I I I I I I I I I I I I I t I I I I M I I I N I I I I I I I I I | I I | | I | | | I | | | | | I ! I | I | I I I I I 
g01O-l TAMEMETGEVYIFHAKAVMFATGGGGRIYASSTNAYMNTGDGLG I CARAG IPLEDMEFWQ 
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190 



190 



200 



210 



220 



230 



240 



m010-l .pep 
g010-l 

m010-l . pep 
g010-l 

m010-l . pep 
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250 260 270 280 290 300 

FHPT^AGAGVLITEGVRGEGGILLNADGERFMERYAPTVKDLASRDVVSRAMAME I YEG 

I I I t I I I I I I I I t I I I I I I I I I j I I I M I I 1 I I I I t I I I i I I I I I I t I I I I ! I I I | | | | | 
FHPTCTVAGACTVXITEGVRGEGGILLNADGERF^ 

250 260 270 2B0 290 300 
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RGCGKNKDHVLLKIDHIGAEKIMEKLPGIREISIQFAGIDPIKDPIPWPTTHYhlMGGIP 
I I I I I I I I I I I I I I I I I I I I I | I | | | | | | | | | | | | j | | j | | j | | j | | | | | , , j , | 

RGCGKNKDHVLLKIDHIGAEKIMEKLPGIRJEISIQFAGIT>PIKDPIPVVPTTHYMMGGIP 
310 320 330 340 350 360 
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TNYHGEVVVPQGEDYEVP\^GLYAAGECACASVHGANP^GTNSIJ^LVVFGKAAGDS>fIK 
I I I I I I I I I I I t :: M I t I t t I I I I M I I I I I t I I ! 1 I I I | I I | 1 | | | | | 
TNY H GE VWPQGDE YE VP VKGL Y AAGE CACAS VH GANRLGTNS LLD L WFRP TPRX 
370 380 390 400 410 

430 440 450 4.0 470 480 

FIKEQSDWKPLPANAGELTRQRIERLDNQTDGENVDALRRELQRSVQLHAGVFRTDEILS 



partial DNA sequence was 

eq. . 

ATGGGCTTTC CTGTTCGCAA 
TGCAGGTTTA CGCGCANCCC 
CCGTTTTGTC TAAAGTGTTC 
GGCGGTATTT CCGCCTCTCT 
GCACATGTAC GATACCGTGA 
CGATTGAGTT TATGTGCCGC 
CACATGGGTA TGCCTTTTGA 
TCCTTTCGGC GGCCATACTG 
CCTGTGCNGT TGCCGACCGT 
CAACAAAATG TCCGTGCCAA 
AGATTTGATT CGTGATGAAA 
AAATGGAAAC CGGCGAAGTT 
GCTACCGGCG GCGGCGGCCG 
GAATACCGGC GATGGTTTGG 
AAGACATGGA ATTCTGGCAA 
GTGTTGATTA CCGAAGGCGT 
CGACGGCGAA CGCTTTATGG 
CTTCTCGCGA CGTTGTTTCC 
CGCGGCTGCG GTAAAAACAA 
CGGCGCAGAA AAAATTATGG 
TTCAGTTCGC CGGTATCGAT 
ACTACCCACT ATATGATGGG 
TGTCGTTCCT CAAGGCGACG 
CGGCAGGTGA GTGCGCCTGT 
ACGAACTCCC TGCTGGACTT 
CATGATTAAA TTCATCAAAG 
ATGCCGGCGA ACTGACCCGC 
GATGGTGAAA ACGTTGATGC 
ATTGCACGCC GGCGTGTTCC 
GAGAAGTCAT GGCGATTGCC 
AAGAGCAAAG TGTGGAATAC 
CCTAATTGAA GTGGCGAAAG 
AATCACGCGG TGCGCACGCT 
AACTGGATGA AACATACGCT 
CAAACCGGTG CACACCAAGC 
AGCGCGTTTA TTGA 



identified in N. meningitidis <SEQ ID 71>: 



GTTTGATGCC 
TCCAATTATC 
CCGACCCGTT 
GGGTAATGTG 
AAGGTTCCGA 
GCCGCGCCTG 
CCGTGTGGAA 
CCGAACACGG 
ACAGGTCATG 
TACGCAATTC 
ACGGCGATGT 
TATATTTTCC 
TATTTATGCG 
GTATTTGTGC 
TTCCACCCGA 
ACGCGGCGAG 
AACGCTATGC 
CGCGCGATGG 
AGACCATGTC 
AAAAACTGCC 
CCGATTAAAG 
CGGTATTCCG 
AATACGAAGT 
GCTTCCGTAC 
AGTGGTATTC 
AGCAAAGCGA 
CAACGTATCG 
ATTGCGCCGC 
GTACTGATGA 
GAGCGTGTGA 
CGCGCGTATC 
CGACTTTGGT 
TCAGACGACC 
GTACCATTCA 
CTTTGAGCGT 



GTGATTGTCG 
CAAATCCGGT 
CGCATACCGT 
CAGGAAGACC 
CTGGTTGGGC 
AAGCCGTAAT 
AGCGGTAAAA 
TAAACGCGCG 
CGATGCTGCA 
TTTGTGGAAT 
CGTCGGCGTA 
ACGCTAAAGC 
TCTTCTACCA 
GCGTGCAGGT 
CCGGCGTGGC 
GGCGGTATTC 
GCCGACCGTA 
CGATGGAAAT 
TTACTGAAAA 
GGGCATCCGC 
ACCCGATTCC 
ACCAACTACC 
GCCTGTAAAA 
ACGGTGCGAA 
GGTAAAGCTG 
CTGGAAACCT 
AGCGTTTGGA 
GAACTGCAAC 
GATTCTGAGC 
AACGTACCGA 
GAGGCTTTGG 
GTCTGCCGAA 
ATCCTGAGCG 
GATGCCAATA 
GGAA.TACATC 



GCGGTGGTGG 
CTGAATTGTG 
AGCGGCGCAG 
GTTGGGACTG 
GACCAAGATG 
TGAGTTGGAA 
TTTATCAGCG 
GTAGAACGCG 
TACTTTGTAC 
GGACGGCACA 
ACCGCCATGG 
TGTGATGTTT 
ATGCCTATAT 
ATCCCGTTGG 
AGGTGCGGGC 
TGTTGAATGC 
AAAGACTTGG 
CTACGAAGGT 
TCGACCATAT 
GAGATTTCCA 
CGTTGTGCCG 
ATGGCGAAGT 
GGTCTGTATG 
CCGCTTGGGT 
CCGGCGACAG 
TTGCCTGCTA 
CAATCAAACT 
GCTCCGTACA 
AAAGGCGTTC 
AATCAAAGAC 
AATTGGATAA 
GCACGTAAAG 
CGATGATGAA 
CCTTGTCCTA 
AAACCGGCCA 



This corresponds to the amino acid sequence <SEQ ID 72; ORF 010-l.a>: 

aOlO-l .pep. . 

1 MGFPVRKFDA VIVGGGGAGL RAXLQLSKSG LNCAVLSKVF PTRSHTVAAQ 

51 GGISASLGNV QEDRWDWHMY DTVKGSDWLG DQDAIEFMCR AA^EAVT~LE 

101 HMGMPFDRVE SGKIYQRPFG GHTAEHGKRA VERACAVADR TGh^t.htly 
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151 
201 
251 
301 
351 
401 
451 
501 
551 



QQNVRANTQF 
ATGGGGRIYA 
VLITEGVRGE 
RGCGKNKDHV 
TTHYMMGGIP 
TNSLLDLVVF 
QGENVDALRR 
KSKVWNTARI 
NWMKHTLYHS 



FVEWTAQDLI 
SSTNAYMNTG 
GGILLNADGE 
LLKIDH1GAE 
TNYHGEWVP 
GKAAGDSMIK 
ELQRSVQLHA 
EALELDNLIE 
DANTLSYKPV 



RDENGDVVGV 
DGLGICARAG 
RFMERYAPTV 
KIMEKLPGIR 
QGDEYEVPVK 
r IKEQSDWK? 
GVFRTDEILS 
VAKATLVSAE 
HTKPLSVEYI 



TA>5EMETGEV 
IPLEDMEFWQ 
KDLASRDVVS 
EISIQFAGID 
GLYAAGECAC 
LPANAGELTR 
KGVREVMAIA 
ARKESRGAHA 
KPAKRVY* 



YIFHAKAVMF 
FHPTGVAGAG 
RAMAMEIYEG 
PIKDPIPWP 
ASVHGANRLG 
QRIERLDMQT 
ERVKRTEIKD 
SDDHPERDDE 



m010-l / a010-l 99.3% identity in 587 aa overlap 

10 20 30 40 50 60 

aOlO-l pep MGFPVRKFDAVIVGGGGAGLRAXLObSKSGU3CAVLSKVFPTRSHTVAAQGGISASLGN\^ 

iMffiiMjiiMiiiiiiii MimmMnmiimiiiiMiiiiiiif fi 

aOlO- 1 MGFPVRKFDAVIVGGGGAGLRAALQLSKSGLNCAVLSKVFPTRSHTVAAQGGISASLGNV 

10 20 30 40 50 60 
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aOlO - 1 pep QEDRWDWHMYDTVKGSDWLGDQDAIEFMCRAAPEAVIELEHMGMPFDRVESGKIYQRPFG 

[||||lll!liM!!tlt1iMill)ll]ttll)!lltlMIMIIIlllll!)MM!l 

m010 - 1 QEDRWDWHMYDTVKGSDWLGDQDAIEFMCRAAPEAVIELEHMGMPFDRVESGKIYQRPFG 

70 80 90 100 110 120 

130 140 150 160 170 180 

aOlO-l pep GHTAEHGKRAVERACAVADRTGHAMLHTLYQQNVRANTOFFVEWTAQDLIRDENGDWGV 

illllilillillilMMMiMMIlllllliiMlMllllliMIIUIIMMII 

mO 1 0 - 1 GHTAEHGK^AVERACAVADRTGHAMLHTLYQQNVRANTQFFVEWTAQDLIRDENGDVVGV 
130 140 150 160 170 180 

190 200 210 220 230 240 

a010-l pep T AMEMETG E VY I F HA KA WF ATGGGGR I Y AS STN A YKNTG DGLGICARAG IPLEDMEFWQ 

!|jjt[lll!Millilllll!l!Mllif!IMII!!!l!ill(iiilll!ililill! 

mOlO- 1 T AMEMETG EVYIFHAKAVMFATGGGGR I YASSTN A YMNTGDGLG I CARAG IPLEDMEFWQ 

190 200 210 220 230 240 

250 260 270 280 290 300 

a010-l pep FHPTGVAGAGVLITEGVRGEGG I LLNADGERFMERYAPTVKD LAS RDWS RAMAMEIYEG 

1 1 i M 1 1 1 1 1 1 1 1 i 1 1 1 i 1 1 1 1 1 M I M i M 1 1 M I i M 1 1 1 1 1 M 1 1 1 5 1 1 1 1 1 M 1 1 i 

mOlO - 1 FHPTGVAGAGVLITEGVRGEGG I LLNADGERFMERYAPTVKD LASRDWSRAMAME I YEG 

250 260 270 280 290 300 

310 320 330 340 350 360 

a 0 1 0 - 1 . pep RGCGKNKDHVLLK I DHIGAE KI ME KLPG I REI S IQFAG IDPIKDPI P WPTTHYMMGG I P 

MlllltfjlllllllMMMtMlllliilllllMMIIiMMIIIMIIMIIII 

mO 1 0 - 1 RGCGKNKDHVLLKI DH IGAEKIMEKLPGI REI S I QFAGIDPI KDP I P WPTTHYMMGG I P 

310 320 330 340 350 360 

370 380 390 400 410 420 

a010-l pep TNYHGEVVVPQGDEYEVPVKGLYAAGECACASVHGANRLGTNSLLDLVVFGKAAGDSMIK 

IlijlMIMIhHIMilMMIIillllMIIMIIMMMMiiillililMM 

mOlO - 1 TNYHGEVWPQGEDYEVPVKGLYAAGECACASVHGANRLGTNSLLDLWFGKAAGDSMIK 
370 360 390 400 410 420 

430 440 450 460 470 480 

a010-l pep FIKEQSDWKPLPANAGELTRQRIERLDNQTDGENVDALRRELQR3VQLHAGVFRTDEILS 

MlllllllllllilUllMtflil!(illUMII!lltllllltlMIIMIII!M 

raO 1 0 - 1 F I KEQSDWKPLPANAGELTRQRI ERLDNQTDGENVDALRRELQRS VQLHAGVFRTDEI LS 

430 440 450 460 470 480 

490 500 510 520 530 540 

aOlO-l -pep KGWEV^IAERVKRTEIKDKSKVWNTARIEALELDNLIEVAKATLVSAEARKFISRGAHA 

IIIMIillMllllilMliliMIIIIilllliilllilNMMIIIilhllllli 

m010-l KGVREVMAIAERVKRTEIKDK3KVWNTARIEALELDNLIEVAKATLVSAEARKESRGAHA 
490 500 510 520 530 540 



550 560 570 580 

aOlO - I . pep SDDHPERDDENWMKKTLYKSDAIJTLSYKPVHTKPLSVEYIKPAKRVYX 

!M!l!ii MIIM IMMM I 1 1 1 1 1 ! I 1 1 1 1 1 ! 1 1 1 1 1 1 1 ! I ! I f 
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m010-l SDDHPERDDENWMKHTI J YHSDINTLSYKP T /HTKPLSVEYIKPAKR\ r YX 
550 560 570 580 

The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 73>: 

gOll . seq 

1 ATGAAGACAC ACCGCAAGAC CTGCTCTGCG GTGTGTTTTG CTTTTCAGAC 

51 GGCATCGAAA CCCGCCGTTT CCATCCGACA TCCCAGCGAG GACA7CATGA 

101 GCCTGAAAAC CCGCCTTACC GAAGATATGA AAACCGCGAT GCGCGCCAAA 

151 GATCAAGTTT CCCTCGGCAC CATCCGCCTC ATCAATGCCG CCGTCAAACA 

201 GTTTGAAGTA GACGAACGCA CCGAAGCCGA CGATGCCAAA ATCACCGCCA 

2 51 TCCTGACCAA AATGGTCAAA CAGCGCAAAG ACGGCGCGAA AATCTACACT 
301 GAAGCCGGCC GTCAGGATTT GGCAGACAAA GAAAACGCCG AAATCGACGT 

3 51 GCTGCACCGC TACCTGCCGC AAATGCTCTC CGCCGGCGAA ATCCGCACCG 

4 01 CCGTCGAAGC AGCCGTTGCC GAAACCGGCG CGGCAGGTAT GGCGGATATG 
4 51 GGCAAAGTGA TGGTCGTATT GAAAAcccGC CTCGCCGGCA AAGccgATAT 
501 GGGCGAAGTC AACAAAATCT TGAAAAccGt aCTGACCGCC tga 

This corresponds to the amino acid sequence <SEQ ID 74; ORF 01 1 .ng>: 

gOll . pep 

1 MKTHRKTCSA VCFAFQTASK PAVSIRHPSE D'^SLKTRLT EDMKTAMRAK 

51 DQVSLGTIRL INAAVKQFEV DERTEADDAK I'AZ^TXMVK QRKDGAKIYT 

101 EAGRQDLADK ENAEIDVLHR YLPQMLSAGE IRTAVEAAVA ETGAAGMADM 

151 GKVMWLKTR LAGKADMGEV NKILKTVLTA * 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 75>: 

mOll.seq (partial) 

1 ATGAGGACAC ACCGCAAGAC CTGCTCTGCG GTGTGTTTTG CTTTTCAGAC 

51 GGCATCGAAA CCCGCCGTTT CCATCCGACA TCCCAGCGAG GACATCATGA 

101 GCCTGAAAAT CCGCCTTACC GAAGACATGA AAACCGCGAT GCGCGCCAAA 

151 GACCAAGTTT CCCTCGGCAC CATCCGCCTC ATCAACGCCG CCGTCAAACA 

201 GTTTGAAGTG GACGAACGCA CCGAAGCCGA CGATGCCAAA ATCACCGCCA 

2 51 TCCTGACCAA AATGGTCAAA CAGCGAAAAG ACAGCGCGAA AATCTACACT 

301 GAAGCCGGCC GTCAGGATTT GGCAGACAAA GAAAACGCCG AAATCGAGGT 

351 ACTGCACCGC TACCTTCCCC AAATGCTTTC CGCCGGCGAA ATCCGTACCG 

4 01 AGGTCGAAGC TGCCGTTGCC GAAACCGGCG CGGCAGGTAT GGCGGATATG 

4 51 GGTAAAGTCA TGGGGCTGCT GAAAACCCGC CTCGCAGGTA AAGCCGA. . . 

This corresponds to the amino acid sequence <SEQ ID 76; ORF 01 1>: 

mOll.pep (partial) 

1 MRTHRKTCSA VCFAFQTASK PAVSIRHPSE DIMSLKIRLT EDMKTAMRAK 

51 DQVSLGTIRL INAAVKQFEV DERTEADDAK ITAILTKMVK QRKDSAKIYT 

101 EAGRQDLADK ENAEIEVLHR YLPQMLSAGE IRTEVEAAVA ETGAAGMADM 

151 GKVMGLLKTR LAGKA 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 01 1 shows 95.8% identity over a 165 aa overlap with a predicted ORF (ORF 01 l.ng) 
from TV. gonorrhoeae: 

mOll/gOll 

10 20 30 40 50 60 

mOll.pep MRTHRKTCSAVCFAFQTASKPAVSIRHPSEDIMSLKIRLTEDMKTAMRAKDQVSLGTIRL 

I : I I ! i I I I I M ! I I I i I I I I | I | j | | ! | | | | | i I 1 I I I I ! I I ! 

<3° 11 MKTHRKTCSAVCFAFQTASKPAVSIRHPSEDIMSLKTRLTEDMKTAMRAKDQVSLGtirl 

10 20 30 40 50 60 

70 80 90 100 110 120 

mOll .pep INAAVKQFEVDERTEADDAKITAILTKMVKQRKDSAKIYTEAGRQDLADKENAEIEVLHR 
I ' I ' I I I I I 1 I I I I I I I I I I I | | | | | | | j | | | | | : | | I | | | | j | | | j | | | j | j | j . | j | j 

9° U INAAVKQFEVDERTEADDAKITAILTKMVKQRKDGAKI YTEAGRQDLADKENAE T DVLHR 
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70 80 90 100 110 120 

130 140 150 160 

mOll . pep YLPQMLSAGEIRTEVEAAVAETGAAGMADMGKVMGLLKTRLAGKA 
M 1 t I I I I I I I I I I I I I I I I I I I j I I I ! I I I ! | : | | | | | | | | j 
g 0 1 1 YLP'jMLSAGE I RT A V Z AAV A E T GAA 3MA DM G K V MVV L K T R LAG KA DMGE VN K I L KT V LT A 

130 14 0 150 160 170 180 

gOll X 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 77>: 

g012 . seq 

1 ATGCTCGCCC GTCGCTATTT TTTCAATATC CAACCCGGGG CGGTTTTCAC 

51 TGACAAACTG CTTGAACAAC TGATGCGTTT CCTCCAGTTC CTGCCGGAAT 

101 TTCTGTTTGC CCTTTTCCGT ATTTTCACCC ATAAAAGTAA CCGTGCGCTT 

151 AAATTCGCCC GCCGTCATCA CATCCACATC AATATCATGT TTTTTCAACa 

201 gGcggTGGAT ATTCGgcact tccgCcacca cacccaccga accgatgacc 

251 gcaaacggaG CGGAAACAAT TTTATCCGCc acacacgcca tcatatagcc 

301 gcCGCTTGCC GCGACCTTAT CGAcggcge.c ggTCAGCGGA ATATTGCGTT 

351 CGCGCAAACG CCTAAGCTGC GAAGCCGCCA AACCGTAACC GTGAACCACG 

4 01 CCGCCCGGAC TTTCCAATCT GAGCAGAACC TCATCTTCAG GCTTGGCAAT 

4 51 CAAAAGCACC GCCGTAATCT CATGACGCAA GGATTCTACG GCGTGTGCAT 

501 ACAAATCGCC GTCAAAATCC AACACAAAAA GGCGGGATTT TTGCGTTTCG 

551 GCAGATTTCT CCCCGCCCTC CTTCAAACGC TTTTTCTCTG CTTTGGCTTC 

601 CGCCTTTTCC TTTTTCTTTT CTTTTTTTTC CTGATGTTTT GTCTCTTCCT 

651 CGCTTAA 

This corresponds to the amino acid sequence <SEQ ID 78; ORF 012.ng>: 

g012 .pep 

1 M LARRY F FN I QPGAVFTDKL LEQLMRFLQF LPEFLFALFR IFTHKSNRAL 

51 KFARRHHIHI NIMFFQQAVD IRHFRHHTHR TDDRKRSGNN FIRHTRHHIA 

101 AACRDLIDGD GQRNIAFAQT PKLRSRQTVT VNHAARTFQS EQNLIFRLGN 

151 QKHRRNLMTQ GFYGVCIQIA VKIQHKKAGF LRFGRFLPAL LQTLFLCFGF 

201 RLFLFLFFFF LMFCLFLA* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 79>: 



m012 . seq 












1 


ATGCTCGCCC 


GTTGCCACTT 


CCTCAATATC 


CAATTGAGGG 


CGGTTCTCGC 


51 


TGACAAACTG 


CTTGAACAAC 


TGATGCGTTT 


CCTCCAGTTC 


CTGTCGGAAT 


101 


TTCTGTTTGC 


CCTTTTCCGT 


ATTTTCACCC 


ATAAAAGTAA 


CCGTGCGCTT 


151 


AAATTCGCCC 


GCCGTCATCA 


CATCCACATC 


AATATCATGT 


TTTTTCAACA 


201 


GGCGGTGGAT 


ATTCGGTACT 


TCCGCCACCA 


CACCCACCGA 


ACCGACAATC 


251 


GCAAACGGAG 


CGGAAGCAAT 


TTTATCCGCC 


ACACACGCCA 


TCATATAACC 


301 


GCCGCTCGCn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


351 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


401 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


451 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


501 


nnnnnnnnnn 


nnnnnnnnnC 


AACACAAAAA 


GGCGTGATTT 


nTGCGTTTCG 


551 


GCAGATTTCT 


CCCCACCCTC 


CTTCAAACGT 


TTTTCcTCTG 


CTTTGGCTTC 


601 


CGCCTTTTCC 


TTTTTCTTTT 


CCTCTTTTTC 


CTGATGTTGT 


GCCTCTTCCC 


651 


CGCTTAA 











This corresponds to the amino acid sequence <SEQ ID 80; ORF 012>: 

m012 . pep 

1 MLARCHFLNI QLRAVLADKL LEQLMRFLQF LSEFLFALFR I FTHKSNRAL 

51 KFARRHHIHI NIMFFQQAVD IRYFRHHTHR TDNRKRSGSN FIRHTRHHIT 

101 AARXXXXXXX xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 

151 XXXXXXXXXX XXXXXXXXXX XXXQHKKA* F XRFGRFLPTL LQTFFLCFGF 

201 RLFLFLFLFF LMLCLFPA" 
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The following partial DNA sequence was identified in N. meningitidis <SEQ ID SI- 



aO 12 . seq 








1 


ATGCTCGCCC 


GTTGCCACTT 


CCTCAATATC 


51 


TGACAAACTG 


CTTGAACAAC 


TGATGCGTTT 


101 


TTCTGTTTGC 


CCTTTTCCG7 


ATTTTCACGC 


151 


AAATTCGCCC 


GCCGTCATCA 


CATCCACATC 


2 01 


GGCGGTGGAT 


ATTCGGTAC7 


TCCGCTACAA 


251 


GCAAACGGAG 


CGGAAACAAT 


TTTATCCGCC 


301 


ACCGCTCGCC 


GCCACCTTAT 


CGACGGCGAC 


351 


CGCGCAAACG 


CCTAAGCTGC 


GAAGCCGCCA 


401 


CCGCCCGGAC 


TTTCCAATCT 


AAGCAGAACC 


451 


CAAAAGCACC 


GCCGTAATCT 


CATGACGCAA 


501 


ACAAATCGCC 


GTCAAAATCC 


AACACAAAAA 


551 


GAAGATTTCT 


CCCCACCCTC 


CTTCAAACGC 


601 


CGCCTTTTCC 




CCTCTTTTTC 


651 


CGCTTAA 







CAATTGAGGG 
CCTCCAGTTC 
A T AAAAG T AA 
AATATCATGT 
CACCCACCGA 
ACACACGCCA 
GGTCAGCGGA 
AACCGTAACC 
TCATGTTCAG 
GGATTCTACG 
GGCGGGATTT 
TTTTTCTCTG 
GTGATGTTTT 



CGGTTCTCGC 
CTGTCGGAAT 
CCGTGCGCTT 
TTTTTCAACA 
ACCGACAA7C 
TCATATAACC 
ATATTGCGTT 
GTGAACCACG 
GCTTGGCAAT 
GCGTGTGCAT 
TTGCGTTTCG 
CTTTGGCTTC 
GCCTCTTCCC 



This corresponds to the amino acid sequence <SEQ ID 82; ORF 012.a>: 

a012 .pep 

1 MLARCHFLNI QLRAVLADKL LEQLMRFLQF LSEFLFALFR I FTHKSNRAL 

51 KFARRHHIHI NIMFFQQAVD IRYFRYNTHR TDKRKRSGNN FIRHTRHHIT 

101 TARRHLI DGD GQRNI AFAQT PKLRSRQTVT VNHAARTFQS KQNLIFRLGN 

151 QKHRRNLMTQ GFYGVCIQIA VKIQHKKAGF LRFGRFLPTL LQTLFLCFGF 

201 RLFLFLFLFF LMFCLFPA* 



moi2/aoi2 64.2% identity over a 218 aa overlap 



10 20 30 40 50 60 

m012 . pep MLARCHFLNIQLRAVLADKLLEQLMRFLQFLSEFLFALFRI FTHKSNRALKFARRHHIHI 

I I i I I I I I I If I I I I i i I I ! I I I I I I I I I I I i i I I I I I I I I I I ! I I i I 1 I M I I I I I I M 
a012 MLARCHFLNIQLRAVLADKLLEQLMRFLQFLSEFLFALFRI FTHKSNRALKFARRHHIHI 

10 20 30 40 50 60 



70 80 90 100 110 120 

m012.pep NIMFFQQAVD I RYFRHHTHRTDNRKRSGSNFIRHTRHH I TAARXXXXXXXXXXXXXXXXX 

I I I M t I I I I I I I I I : : ! I I ! I I I i i I I : I I I i I I i I I i I : I I : 
a012 NIMFFQQAVD I RYFRYNTHRTDNRKRSGNNFIRHTRHH I TTARRHL I DGDGQRN I AFAQT 

70 80 90 100 110 120 



130 140 150 160 170 180 

mO 1 2 . pep XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQHKKAX F 

: : : : I I I I I I 

a012 PKLRSRQTVTVNHAARTFQSKQNLIFRLGNQKHRRNLMTQGFYGVCIQIAVKIQHKKAGF 

130 140 150 160 170 180 



190 200 210 219 

m012 . pep XRFGRFLPTLLQTFFLCFGFRLFLFLFLFFLMLCLFPAX 
I I M 1 \\ \ \ \ I t : I I 1 ! I I ! ( t I t i I I ! I I 1 : I i I I It 
a012 LRFGRFLPTLLQTLFLCFGFRLFLFLFLFFLMFCLFPA-X. 

190 200 210 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 012 shows 58.7% identity over a 218 aa overlap with a predicted ORF (ORF 012.ng) 
from N. gonorrhoeae: 

m012/g012 

10 20 30 40 50 60 

m012 . pep MLARCH FLN I QLRAVLADKLLEQLMRFLQFLSEFLFALFRI FTHKSNRALKFARRHHIHI 

I i I i : I : I I I I I : : I II i i I II I I i I I I I I I I I I I M I I I I I I I I I j I I I I ! I I j ! 
g012 MLARRYFFNIQPGAVF7DKLLEQLMRFLQFLPEFLFALFR I FTHKSNRALKFARRHHIHI 
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10 20 30 40 50 60 

70 SO 90 100 110 120 

m012 . pep NIMFFQQAVDIRY FRHHTHRTDNRKRSGSNFIRHTRHHITAARXXXXXXXXXXXXXXXXX 

I II I I I I I I I I ! : i I ! I I I S 1 I : 1 t i I i : I i I I I I it 1 ! : ! I : 
g 0 1 2 N I M F FQQ A V D I R H FR H K T HRT D DR KR S G N N F I R H T R H H I AAACR D L I DG DGQ RN I A FA QT 

70 80 90 100 110 120 

130 140 150 160 170 180 

mO 12 . pep XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQHKKAXF 

: : : : I I i I I I 

aO 12 PKLRSRQTVTVNHAARTFQSEQNLIFRLGNQKHPRNLMTQGFYGVCIQIAVKIQHKKAGF 

130 140 150 160 170 180 

190 200 210 219 

m012.pep XRFGRFLPTLLQTFFLCFGFRLFLFLFLFFLMLCLFPAX 
I! I I I I I : I I I f : I I M I I I M I I I I : I I ! I : 1 II II 
g012 LRFGRFLPALLQTLFLCFGFRLFLFLFFFFLMFCLFLAX 

190 200 210 



The following partial DNA sequence was 

m012-i . seq 

1 ATGCTCGCCC GTTGCCACTT 

51 TGACAAACTG CTTGAACAAC 

101 TTCTGTTTGC CCTTTTCCGT 

151 AAATTCGCCC GCCGTCATCA 

201 GGCGGTGGAT ATTCGGTACT 

2 51 GCAAACGGAG CGGAAGCAAT 

301 GCCGCTCGCC GCCACCTTAT 

351 CGCGCAAACG CyTAAGCTGC 

4 01 CCGCCCGGAC TTTCCAATCT 

4 51 CAAAAGCACC GCCGTAATCT 

501 ACAAATCGCC GTCAAAATCC 

551 GCAGATTTCT CCCCACCCTC 

601 CGCCTTTTCC TTTTTCTTTT 

651 CGCTTAA 



identified in N. meningitidis <SEQ ID 83>: 



CCTCAATATC 
TGATGCGTTT 
ATTTTCACCC 
CATCCACATC 
TCCGCCACCA 
TTTATCCGCC 
CGACGGCGAC 
GAAGCCGCCA 
GAGCAGAACC 
CATGACGCAA 
AACACAAAAA 
CTTCAAACGC 
CCTCTTTTTC 



CAATTGAGGG 
CCTCCAGTTC 
ATAAAAGTAA 
AATATCATGT 
CACCCACCGA 
ACACACGCCA 
GGTCAGCGGA 
AACCGTAACC 
TCATCTTCAG 
GGATTCTACG 
GGCGGGATTT 
TTTTTCTCTG 
CTGATGTTTT 



CGGTTCTCGC 
CTGTCGGAAT 
CCGTGCGCTT 
TTTTTCAACA 
ACCGACAATC 
TCATATAACC 
ATATTGCGTT 
GTGAACCACG 
GCTTGGCAAT 
GCGTGTGCAT 
TTGCGTTTCG 
CTTTGGCTTC 
GCCTCTTCCC 



This corresponds to the amino acid sequence <SEQ ID 84; ORF 012- 1>: 

m012-l .pep 

1 MLARCHFLNI QLRAVLADKL LEQLMRFLQF 
51 KFARRHHIHI NIMFFQQAVD IRYFRHHTHR 
101 AARRHLIDGD GQRNIAFAQT XKLRSRQTVT 
151 QKHRRNLMTQ GFYGVCIQI A VKIQHKKAGF 
201 RLFLFLFLFF LMFCLFPA* 



LSEFLFALFR IFTHKSNRAL 

TDNRKRSGSN FIRHTRHHIT 

VNHAARTFQS EQNLI FRLGN 

LRFGRFLPTL LQTLFLCFGF 



m012-l/g012 



91.7% identity in 218 aa overlap 



10 20 30 40 50 60 

m012-l .pep MLARCHFLNIQLRAVLADKLLEQLMRFLQFLSEFLFALFRI FTHKSNRALKFARRHHIHI 
I I i I : I : I I I i I : : I I I 1 I I I I I I i I I I I I I I I I i I I I I ! I ! I I 1 I I I I I I I I I I I 
g012 MLARRYFFNIQPGAVFTDKLLEQLMRFLQFLPEFLFALFRI FTHKSNRALKFARRHHIHI 

10 20 30 40 50 60 

70 80 90 100 110 120 

m012-l . pep NIMFFQQAVD I RYFRHHTHRTDNRKRSGSNFIRHTRHHITAARRHL IDG DGQRNIAFAQT 
1 I II I I I i I I II : I ! M t I I ! 1 : II I M : I I I I II I I I I : I 1 I i I I I I I I 1 I I I M I I 
g012 NIMFFQQAVDIRHFRHHTHRTDDRKRSGNNFIRHTRHHIAAACRDLIDGDGQRNIAFAQT 

70 80 90 100 110 120 



130 140 150 160 170 180 

m012-l .pep XKLRSRQTVTVNHAA.RTFQSEQNLIFRLGNQKHRRNLMTQGFYGVCIQIAVKIQHKKAGF 
I I I 1 I I f M I ( i I ! I I I i | I i I I I I i I I 1 I I I i I I I I i I I I I | I I I I ! | | | M || I | | [ 
g 0 1 2 PKLRSRQTVTVNHAARTFQSE QN L I FR LG N QKH RRN LMT QG F Y G V C I QI A V K I Q H KKAG F 
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130 



140 



150 



160 



170 



ISO 



190 200 210 219 

m012-l .pep LRFGRFLPTLLQTLFLCFGFRLFLFLFLFFLMFCLFPAX 
I I I I I I I I : I I I ! I I I ! It I I I I I i I I : I I I I I I [ i I j 
g012 LRFGRFLPALLQTLFLCFG FRLFLFLFFFFLMFCLFLAX 

190 200 210 



The following partial DNA sequence was 

a012-l . seq 

1 ATGCTCGCCC GTTGCCACTT CCTCAATATC 

5 3 TGACAAACTG CTTGAACAAC TGATGCGTTT 

101 TTCTGTTTGC CCTTTTCCGT ATTTTCACCC 

151 AAATTCGCCC GCCGTCATCA CATCCACATC 

201 GGCGGTGGAT ATTCGGTACT TCCGCTACAA 

251 GCAAACGGAG CGGAAACAAT TTTATCCGCC 

301 ACCGCTCGCC GCCACCTTAT CGACGGCGAC 

351 CGCGCAAACG CCTAAGCTGC GAAGCCGCCA 

4 01 CCGCCCGGAC TTTCCAATCT AAGCAGAACC 

4 51 CAAAAGCACC GCCGTAATCT CATGACGCAA 

501 ACAAATCGCC GTCAAAATCC AACACAAAAA 

551 GAAGATTTCT CCCCACCCTC CTTCAAACGC 

601 CGCCTTTTCC TTTTTCTTTT CCTCTTTTTC 

551 CGCTTAA 



identified in N. meningitidis <SEQ ID 85>: 



CAATTGAGGG 
CCTCCAGTTC 
ATAAAAGTAA 
AATATCATGT 
CACCCACCGA 
ACACACGCCA 
GGTCAGCGGA 
AACCGTAACC 
TCATCTTCAG 
GGATTCTACG 
G^CGGGATTT 
. T _~?CTCTG 
CTGATGTTTT 



CGGTTCTCGC 
CTGTCGGAAT 
CCGTGCGCTT 
TTTTTCAACA 
ACCGACAATC 
TCATATAACC 
ATATTGCGTT 
GTGAACCACG 
GCTTGGCAAT 
GCGTGTGCAT 
TTGCGTTTCG 
CTTTGGCTTC 
GCCTCTTCCC 



This corresponds to the amino acid sequence <SEQ ID 86; ORF 012-1 .a>; 

a012-l . pep 

1 MLARCHFLNI QLRAVLADKL LEQLMRFLQF LSEFLFALFR IFTHKSNRAL 
51 KFARRHHIHI NIMFFQQAVD IRYFRYNTHR TDNRKRSGNN FIRHTRHHIT 
101 TARRHLIDGD GQRNIAFAQT PKLRSRQTVT VNHAARTFQS KQNLI FRLGN 
151 QKHRRNLMTQ GFYGVCIQIA VKIQHKKAGF LRFGRFLPTL LQTLFLCFGF 
201 RLFLFLFLFF LMFCLFPA* 



a012-l/m012-l 



a012-l . pep 



m012-l 



97.2% identity in 218 aa overlap 

10 20 30 40 50 60 

MLARCHFLNIQLRAVLADKLLEQLMRFLQFLSEFLFALFRIFTHKSNRALKFARRHHIHI 
i I I I I I ! M M I I 1 I I I I I I I i I I I I I I I I I I I I i If I I i I II I M I I I ! I I I I I I I I I ! 
MLARCHFLN IQLRAVLADKLLEQLMRFLQFLSEFLFALFRI FTHKSNRALKFARRHKI H I 
10 20 30 40 50 60 



70 80 90 100 110 120 

a012-l . pep NIMFFQQAVDIRYFRYNTHRTDNRKRSGNNFIRHTRHHITTARRHLI DGDGQRNIAFAQT 
M I I I I M I I I 1 I M : : I M I I I I M I ! : I I i I I I I I I i I : I 1 [ I M I M i I M I I I I i 1 
m012-l NIMFFQQAVDIRYFRHHTHRTDNRKRSGSNFIRHTRHHITAARRHLI DGDGQRNIAFAQT 

70 80 90 100 110 120 



130 140 150 160 170 180 

a012-l .pep PKLRSRQTVT VNHAARTFQSKQNLIFRLGNQKHRRNLMTQGFYGVC I QIAVKIQHKKAGF 
M I ! I I I i I I 1 M I M M I : I ! I I I I I M I I I I I I I i I I I I | i M M I I I I I t II i I I I 
m012-l XKLRSRQTVT VNRAART FQSEQNL I FRLGNQKHRRNLMTQG FYGVC I QI AVKI QHKKAG F 

130 140 150 160 170 180 

190 200 210 219 

a012-l . pep LRFGRFLPTLLQTLFLCFGFRLFLFLFLFFLMFCLFPAX 
i I II I I I I I I I I M I I I I I I I 1 I I li I I M 1 I I I i If i I 

m012-l LRFGRFLPTLLQTLFLCFG FRLFLFLFLFFLMFCLFPAX 

190 200 210 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 87>: 

g013 . seq 

1 aTgcctttga ccatgctgtg cagcaGGAcg tGCGGTTtgt tcataataca 

51 gtCcgaccGG AAAagccgAG GAAaCGCAGT GCCGC^^CCT TCCCCTTTCT 

101 TGCCGTGGCA GGCGATGCag tTgGATTCGT ACACTTTTTG CCCTTTtGtc 
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151 atgatGCTgt tgtcggCGGC AGAAGCgGCG GcgCAGAGGC AGCACAAGAT 

201 GAAGGCGGTC GGCAGTCGGG TTGTGTt cat tGgcqTTTCC cctaatgtcc 

251 tgaaacctrg t cttttgatt Ttgcctttac ggcgrgaaaa gtttttTtgg 

301 cccaaatccg gaatttag 

This corresponds to the amino acid sequence <SEQ ID 88; ORF 013.ng: 

g013 .pep 

1 MPLTMLCSRT CGLFI IQSDR KSGGNAVPRP SPFLPWQAMQ LDSYTFCPFV 

51 MMLLSAAEAA AQRQHKMKAV GSR VVFIGVS PNVLKPCFLI LPLRGEKFFW 

101 PKSGI* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 89>: 

m013 . seq 

1 ATGCCTTTGA CCATGCTGTG CAGCAGCACC TGCGGTTTTT TCATGATGAA 

51 GTCGGAGCGG TAGAGCGGCG GAAACATGGT TCCGCGGCCT TCGCCCTTTT 

101 TGCCGTGGCA GGCGACGCAG TTGGATTCGT ACACTTTTTG CCCTTTTGTC 

151 ATGATGCTGT TGTCGGCGGC AGAAGCGGCG GCGCAGAAGC AGCCCAAGAC 

201 GAGGGCGGTC GGCAGTCGGG TTGTGTTCAT TGGTGTTTCC TTCATGTTTG 

2 51 AAACCTTGTT GTTGATTTTG CGTAGCGGGT GAAA.GATTTT TTTGCCGAAT 

301 CAGTAG 

This corresponds to the amino acid sequence <SEQ ID 90; ORF 01 3>: 

m013.pep 

1 MPLTMLCSST CGFFMMKSER XSGGNMVPRP SPFLPWQATQ LDSYTFCPFV 

51 MMLLSAAEAA AQKQPKTRAV GSR VVFIGVS FMFETLLLI L RSGXKI FLPN 

101 Q* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 91 >: 

a01 3 . seq 

1 ATGCCTTTGA CCATGCTGTG CAGCAGCACC TGCGGTTTTT TCATGATGAA. 

51 GTCGGAGCGG TAGAGCGGCG GAAACATGGT TCCGCGGCCT TCGCCCTTTT 

101 TGCCGTGGCA GGCGACGCAG TTGGATTCGT ACACTTTTTG CCCTTTTGTC 

151 ATGATGCTGT TGTCGGCGGC AGAAGCGGCG GCGCAGAGGC AGCCCAAGAC 

201 GAGGGCGGTC GGCAGTCGGG TTGTGTTCAT TGGTGTTTCC TTAATGTTTG 

2 51 AAACCTTGTT GTTGATTTTG CGTAGCGGGT GAAAGATTTT CTTGCCGAAT 

301 CGGTAG 

This corresponds to the amino acid sequence <SEQ ID 92; ORF 013.a>: 

a013 . pep 

1 MPLTMLCSST CGFFMMKSER +SGGNMVPRP SPFLPWQATQ LDSYTFCPFV 

51 MMLLSAAEAA AQRQPKTRAV GSR VVFIGVS LMFETLLLIL RSG*KIFLPN 

101 R* 



m0i3/a0i3 97.0% identity over a 101 aa overlap 

10 20 30 40 50 60 

m013.pep MPLTMLCSSTCGFFMMKSERXSGGNMVPRPS PFLPWQATQLDSYT FCPFVMMLLSAAEAA 

It I 1 I I I I M I I I I I I I I il I I I I I I I i I I t I I i I I i M I ! I i t I 1 I I I I I I I I I I M i I 
a013 MPLTMLCSSTCGFFMMKSERXSGGNMVPRPS PFLPWQATQLDSYT FCPFVMMLLSAAEAA 

10 20 30 40 50 60 



70 80 90 100 

m013. pep AQKQPKTRAVGSRVVFIGVSFMFETLLLILRSGXKI FLPNQX 

I I : I I I i I ! I I! I I 1 I f I I I : I I I i I I I I I M ! M I I I I ! : I 
a013 AQRQPKTRAVGSRVVFIGVSLMFETLLLILRSGXKI FLPNRX 

70 80 90 100 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 013 shows 73.3% identity over a 101 aa overlap with a predicted ORF (ORF 013.ng) 
from N. gonorrhoeae: 
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m013/g0l3 

10 20 30 40 50 60 

mOl 3 . pep M P LT.M LC S S T C G F FMM K S E RX SG G NM V P R P S P F L P WQ AT Q LDSYTFC? FVMM L L S AA E AA 

i I 11 M I I I S i : I : : : t : \ WW I I W \ I M i t I 1 M i i I I M I I I I I I I I I I I I 
gOI3 MPLTMLCSRTCGLFIIQSDRKSGGNAVPRP5PFLPWQAMQLDSYTFCPFVMMLLSAAEA.A 

10 20 30 40 50 60 



70 80 90 100 

m 0 1 3 . pe p AQKQPKT FLA V G S R VV F I G V S F- H FE T L L L I L R - S GX K I FL PN QX 

I 1 : I I : I I I f I I I I 1 I I I : : : MM I Ml M 
g 0 1 3 AQR QHKMK A VG SRVVFIGVSPNVLKPCFLI LPLRGEKFFWPKSGIX 

70 80 90 100 



The following partial DNA sequence was identified in A ; . gonorrhoeae <SEQ ID 93>: 

g015 . seq 

1 ATGCAGTATC TGATTGTCAA ATACAGCCAT CAAATCTTCG TTACCATCAC 

51 CATTTTGGTA TTCAACATCC GTTTTTTCCT ACTTTGGAAA AATCCAGAAA 

101 AGCCCTTGGT CGGCTTTTGG AAAGCACTGC CCCACCTCAA CGACACGATG 

151 CTGCTGTTTA CGGGATTGTG GCTGATGAAG ATTACCCATT TCTCCCCGTT 

201 CAACGCGCCT TGGCTCGGCA CAAAAATCCT GCTCCTGTTC GCCTACATCG 

251 CACTGGGCAT GGTAATGATG CGCGCCCGTC CGCGTTCGAC CAAGTTCTAC 

3 01 ACCGTTTACC TGCTCGCTAT GTGTTGCATC GCCTGCATCG TTTACCTTGC 

351 CAAAACCAAA GTCCTGCCAT TCTGA 



This corresponds to the amino acid sequence <SEQ ID 94; ORF 015.ng>: 

g015 .pep 

1 MQYLIVKYSH Q IFVTITILV FNIRFFLL WK NPEKPLVGFW KALPHLNDTM 

51 LLFTGLWLMK ITKFSPFNAP WLGTK ILLLF AYIALGMVMM RA RPRSTKFY 

101 TVYLLAMCCI ACIVYLAKTK VLPF* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 95>: 

m015.seq (partial) 

1 . . AAAATCAGAA AAGCCTTGGC GGGCTTTTGG AAGGCACTGC CCCACCTTAA 

51 CGACACCATG CTGCTGTTTA CGGGATTGTG GCTGATGAAA ATTACCCATT 

101 TCTCCCCGTT CAACGCGCCT TGGCTCGGTA CAAAAATCCT GCTTCTGCTC 

151 GCCTATATCG CATTGGGTAT GATGATGATG CGCGCCCGTC CGCGTTCGAC 

2 01 CAAGTTCTAC ACCGTTTACC TGCTCGCCAT GTGTTGCGTC GCCTGCATCG 

2 51 TTTACCTTGC CAAAACCAAA GTCCTGCCTT TCTGA 



This corresponds to the amino acid sequence <SEQ ID 96; ORF 015: 

mOlS-pep (partial) 

1 . . KIRKALAGFW KALPHLNDTM LLFTGLWLMK ITHFSPFNAP WLGTKILLLL 
SI AYIALGMMMM RARPRSTKFY TVYLLAMCCV ACIVYLAKTK VLPF* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 97>: 

a015 . seq 

1 ATGCAGTATC TGATTGTCAA ATACAGCCAT CAAATCTTCG TTACCATCAC 

51 CATTTTGGTA TTCAACATCC GTGTTTTCNT ACTTTGGAAA AATCCAGAAA 

101 AGCCCTTGGC GGGCTTTTGG AAGGCACTGC CCCACCTTAA CGACACCATG 

151 CTGCTGTTTA CGGGATTGTG GCTGATGAAA ATTACCCATT TCTCCCCGTT 

201 CAACGCGCCT TGGCTCGGTA CAAAAATCCT GCTTCTGCTC GCCTATATCG 

2 51 CATTGGGTAT GATGATGATG CGCGCCCGTC CGCGTTCGAC CAAGTTCTAC 

301 ACCGTTTACC TGCTCGCCAT GTGTTGCCTC ACCTGCATCG TTTACCTTGC 

351 CAAAACCAAA GTCCTGCCTT TCTGA 



This corresponds to the amino acid sequence <SEQ ID 98; ORF 015.a>: 

a015 . pep 
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1 MQYLIVKYSH Q IFVTITILV FNJRVFXL WK NPEKFLAGFW KALPHLNDTM 
51 LLFTGLWLMK ITHFSPFNAP WLGTK ILLLL AYIALGMMMM RA RPRSTKFY 
101 TVYLLAMCCL TCI VYLAKTK VLPF* 



mois/aoi5 96.7% identity over a 91 aa overlap 



10 20 30 

m015.pep KIRr^LAGFWKALPHLNDTMLLFTGLWLNjKITH 

! I I I I I I I M I I I I I I I I i I t I I I I I ! I I 
a 0 1 5 L I VKY S HQ I FVT I T I L V FN I R V FX LWKN P E K ? LAG FW KAL ?K L N DT M L L FTG L W LM K I T H 

10 20 30 40 50 60 



40 50 60 70 SO 90 

m015. pep FSPFNAPWLGTKILLLLAYIALGMMMMRARPRSTKFYTVYLLAMCCVACIVYLAKTKVLP 
I I I I II I I 1 ( I I I I I I II I I I i t I I I I i i I I I I ! I I U I i ! I I I I ( : : I I I I i I I I i I I i 
a015 FSPFNAPWLGTKILLLLAYIALGMMMMRARPRSTKFYTVYLLAMCCLTCIVYLAKTKVLP 
70 80 90 100 110 120 



mOlS.pep FX 
I I 

a015 FX 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from TV. gonorrhoeae 

ORF 015 shows 94.5% identity over a 91 aa overlap with a predicted ORF (ORF 015.ng) 
from N. gonorrhoeae: 

m0l5/g015 

10 20 30 

KI RKALAGFWKALPHLNDTMLLFTGLWLMKI TH 

I hllllllllllllllllllilllllM 
LIVKYSHQIFVTITILVFNIRFFLLWKNPEKPLVGFWKALPHLNDTMLLFTGLWLMKITH 
10 20 30 40 50 60 



m015 . pep 
g015 



40 50 60 70 80 90 

m015 . pep FSPFNAPWLGTKILLLI^YIALGMMMMRARPRSTKFYTVYLI^CCVACIVYLAKTKVLP 

ll||illiitli!il|:illlllhitl|]lii)IIMIIIi)Mt:||lliilliii!i 
gO 15 FSPFNAPWLGTKILLLFAYIALGMVMMRARPRSTKFYTVYLLAMCCIACIVYLAKTKVLP 
70 80 90 100 110 120 



mOlS.pep FX 

II 

g015 FX 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 99>: 

g0l8 . seq 

1 atGCAGCAGG GGCagttggt tggacgcgtc gcccgcaata AAGATATGCG 

51 GAATgctggt CTGCATggtC AGCGGATCGG CAACGGGtac gccgcgcgcg 

101 tctttgTCGA TATTGATGTT TTCCAAACCG ATATtgTCAA CGTTCGGACG 

151 GCgACCTACG GCTGCCAACA TATATTCGGC AACAAATACG CCTTTTTCGC 

201 CATCCTGCTC CCAATGGACT tctACATTGC CGTCTGCGTC GAGTTTGACC 

251 TCGGTTTTAG CATCCAGATG CAGTTTCAAT t ctTCTCCGA ACACGGCTTT 

301 CGCCTCGTCT GA 

This corresponds to the amino acid sequence <SEQ ID 100; ORF 018.ng>: 

gOl8 .pep 

1 MQQGQLVGRV ARNKDMRNAG LHGQRIGNGY .AARVFVDIDV FQTDIVNVRT 

51 ATYGCQHIFG HKY AFFAI LL PMDFY I AVCV EFDLGFSIQM QFQFFSEHGF 
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101 RLV* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 101 >: 

ra018 . seq 

1 ATGCAGCAGA GGCAGTTGGT TGGACGCATC GCCTGCGATG AAGATATGCG 

51 GAATACTGGT CTGCATGGTC AGCGGGTCGG CAACAGGTAC GCCGCGCGCA 

101 TCTTTTTCGA TATTGATATT TTCCAAACCG ATATTGTCAA CGTTCGGACG 

151 GCGGCCCACG GCTGCCAGCA TATATTCGGC AACAAATACG CCTTTTTCGC 

201 CATCCTGCTC CCAATGGACT TCTACATTGC CGTCTGCATC GAGTTTGACC 

2 51 TCGGTTTTAG CATCCAGATG CAGTTTCAAT TCTTCGCCGA ACACGGCGTT 

301 CGCCTCGTCT GA 

This corresponds to the amino acid sequence <SEQ ID 102; ORF 01 8>: 

m018 .pep 

1 MQQRQLVGRI ACDEDMRNTG LHGQRVGNRY AARI FFDIDT FQTDI VNVRT 
51 AAHGCQHIFG NKY AFFAILL PMPFYIAVCI EFDLGFSIQM QFQFFAEHGV 
101 RLV* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 103>: 

a018 . seq 

1 ATGCAGCAGG GGCAGTTGGT TGGACGCGTC GCCCGCAATA AAGATATGCG 

51 GAATACTGGT CTGCATAGTC AGCGGATCGG CAACGGGTAC GCCGCGCGCA 

101 TCTTTTTCGA TATTGATGTT TTCCAAACCG ATATTGTCAA CGTTCGGACG 

151 GCGGCCTACG GCTGCCAGCA TATATTCGGC AACAAATACG CCTTTTTCGC 

201 CATCCTGCTC CCAATGGACT TCTACATTGC CGTCTGCGTC GAGTTTGGCC 

251 TCGGTTTTAG CATCCAAATG CAGTTTCAAT TCTTCACCGA ACACGGCTTT 

301 CGCCTCGTCT GA 

This corresponds to the amino acid sequence <SEQ ID 104; ORF 018.a>: 

a018 .pep 

1 MQQGQLVGRV ARNKDMRNTG LHSQRIGNGY AARIFFDIDV FQTDIVNVRT 
51 AAYGCQHIFG NKY AFFAILL PMDFYIAVCV EFGLGFSIQM QFQFFTEHGF 
101 RLV* 

moi8/aoi8 86.4% identity over a 103 aa overlap 

10 20 30 40 50 60 

m018 .pep MQQRQLVGRIACDEDMRNTGLHGQRVGNRYAARI FFDIDI FQTDI VNVRTAAHGCQHIFG 

IN I I t I I : I : : I I ! I I I I I : 1 ! : I I I ! 1 I I M I I I : I i I I I ! I I I II I : I I I M i I 
a0!8 MQQGQLVGRVARNKDMRNTGLH SQRIGNGY AARI FFD I DVFQTD I VNVRT AAYGCQHIFG 

10 20 30 40 50 60 

70 80 90 100 

m0l8 . pep NKYAFFAI LLPMDFYIAVC IE FDLGFS IQMQFQFFAEHGVRLVX 

i I I i M I M t I I I 1 I M II : I \ \ I \\ \ 1 I I II I I : 11 I 1111 
a0!8 NKYAFFAI LLPMDFYIAVCVEFGLGFS I QMQFQFFTEHGFRLVX 

70 80 90 100 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 018 shows 84.5% identity over a 103 aa overlap with a predicted ORF (ORF 018.ng) 
from N. gonorrhoeae: 

m018/g018 

10 20 30 40 50 60 

m018.pep MQQRQLVGRI ACDEDMRNTGLHGQRVGNRYAARI FFD I DI FQTDI VNVRT AAHGCQHIFG 

til 11111:1 : : i 1 1 1 : 1 I 1 I I 1 : 1 i 1 I 1 i : 1 1 ! 1 : 11 1 1 1 1 1 I t 1 1 : : 11 1! t 1 i 
g018 MQQGQLVGRVARNKDMRNAGLHGQRIGNGYAARV7VDIDVFQTDI VNVRTATYGCQKI FG 

10 20 30 40 50 60 
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70 80 90 100 

m018. pep NKYAFFAI LLPMDFYIAVCIEFDLGFSIQMQFQFFAEKGVRLVX 

I I I I I I I I I I ! I I 1 I ! I II : I I I I I I I I I I M I I I : I I I I | I I 

g018 NKYAFFAI LLPMDFYI AVCVEFDLGFSIQMQFQFFSEHGFRLVX 

70 80 90 100 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 105>: 

g019.seq (partial) 

1 . - ctgctggcgg ccctggtgct tgccgcgtgt tcttcgACAA ACAcacTGCC 
51 AGCCGGCAAG ACCCCGGCAG ACAATATAGA AActgcCgAC CTTTCGGCAA 
101 GCGTTCCCAC ccgcCCTGCC GAACCGGAAG GAAAAACGCT GGCAGATTAC 
151 GGCGGCTACC CGTCCGCACT GGATGCAGTG AAACAGAACA ACGATGCGGC 
201 AGCCGCCGCC TATTTGGAAA Acgcaggaga cagCGcgatg gcGGAAAatg 
251 tccgcaagga gtgGCTGa 

This corresponds to the amino acid sequence <SEQ ID 106; ORF 019.ng>: 

g019.pep (partial) 

1 . . LLAALVLAAC SSTNTLPAGK TPADNIETAD LSASVPTRPA EPEGKTLADY 
51 GGYPSALDAV KQNNDAAAAA YLENAGDSAM AENVRKEWL* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 107>: 

m019.seq (partial) 

1 ATGTACCTAC CCTCTATGAA GCATTCCCTG CCGCTGCTGG CGGCCCTGGT 

51 GCTTGCCGCG TGTTCTTCGA CAAACACACT GCCAGCCGGC AAGACCCCGG 

101 CAGACAATAT AGAAACTGCC GACCTTTCGG CAAGCGTTCC CACCCGCCCT 

151 GCCGAACCCG AAAGAAAAAC GCTGGCAGAT TACGGCGGCT ACCCGTCCGC 

201 ACTGGATGCA GTGAAACAGA AAAACGATGC CGCCGTCGCC GCCTATTTGG 

251 AAAACGCCGG CGACAGCGCG ATGGCGGAAA ATGTCCGCAA CGAGTGGCTG 

301 AAGTCTTTGG GCGCACGCAG ACAGTGGACG CTGTTTGCAC AGGAATACGC 

351 CAAACTCGAA CCGGCAGGGC GCGCCCAAGA AGTCGAATGC TACGCCGATT 

401 CGAGCCGCAA CGACTATACG CGTGCCGCTG AACTGGTCAA AAATACGGGC 

451 AAACTGCCTT CGGGCTGCAC CAAACTGTTG GAACAGGCAG CCGCATCCGG 

501 CTTGTTGGAC GGCAACGACG CCTGGAGGCG CGTGCGCGGA CTGCTGGCCG 

551 GCCGCCAAAC CACAGACGCA CGCAACCTTG CCGCCGCATT GGGCAGCCCG 

601 TTTGACGGCG GTACACAAGG TTCGCGCGAA TATGCCCTGT TGAACGTCAT 

6 51 CGGCAAAGAA GCACGCAAAT CGCCGAATGC CGCCGCCCTG CTGTCCGAAA 

701 TGGAAAGCGG TTTAAGCCTC GAACAACGCA GTTTCGCGTG GGGCGTATTG 

751 GGGCATTATC AGTCGCAAAA CCTCAATGTG CCTGCCGCCT TGGACTATTA 

801 CGGCAAGGTT GCCGACCGCC GCCAACTGAC CGACGACCAA ATCGAGTGGT 

851 ACGCCCGCGC CGCCTTGCGC GCCCGACGTT GGGACGAGCT GGCCTCCGTT 

901 ATCTCGCATA TGCCCGAAAA ACTGCAAAAA AGCCCGACCT GGCTCTACTG 

951 GCTGGCACGC AGCCGCGCCG CAACGGGCAA CACGCAAGAG GCG GAAAAAC 

1001 TTTACAAACA GGCGGCAGCG ACGGGCAGGA ATTTTTATGC GGTGCTGGCA 

10 51 GGGGAAGAAT TGGGTCGGAA AATCGATACG CGCAACAATG TGCCCGATGC 

'1101 CGGCAAAAAC AGCGTCCGCC GCATGGCGGA AGACGGTGCA GTCAAACGCG 

1151 CACTGGTACT GTTCCAAAAC AGCCAATCTG CCGGTGATGC AAAAATGCGC 

12 01 CGTCAGGCTC AGGCGGAATG GCGTTTTGCC ACACGCGGCT TTGACGAAGA 
1251 CAAGCTGCTG ACCGCCGCGC AAACCGCGTT CGACCACGGT TTTTACGATA 
1301 TGGCGGTCAA CAGCGCGGAA CGCACCGACC GCAAACTCAA CTACACCTTG 

13 51 CGCTATATTT CGCCGTTTAA AGACACGGTA ATCCGCCACG CGCAAAATGT 

14 01 TAATGTCGAT CCGGCTTGGG TTTATGGGCT GATTCGTCAG GAAAGCCGCT 
14 51 TCGTTATAGG CGCGCAATCC CGCGTAGGCG CGCAGGGGCT GATGCAGGTT 
1501 ATGCCTGCCA CCGCGCGCGA AATCGCCGGC AAAATCGGTA TGGATGCCGC 
1551 ACAACTTTAC ACCGCCGACG GG . . . 

This corresponds to the amino acid sequence <SEQ ID 108; ORF 01 9>: 

m019.pep (partial) 

1 MYLPSMKHSL PLLAALVLAA CSSTNT LPAG KTPADNIETA DLSASVPTRP 
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51 AEPERKTLAD YGGYPSALDA VKQKNDAAVA AYL-ENAGDSA MAENVRNEWL 

101 KSLGARRQWT LFAQEYAKLE PAGRAQEVEC YADSSRNDYT RAAELVKNTG 

151 KLPSGCTKLL EQAAASGLLD GNDAWRRVRG LLAGRQTTDA RNLAAALGSP 

201 FDGGTQGSRE YALLNVIGKE ARKSPNAAAL LSEME5GL5L EQRSFAWGVL 

2 51 GHYOSQNLNV PAALDYYGKV ADRRQLTDDO IEWYARAALR ARRWDELASV 

301 ISHMPEKLQK SPTWLYWLAR SRAATGNTQE AEKLYKQAAA TGRNFYAVLA 

351 GEELGRKIDT RNNVPDAGKN SVRRMAEDGA VKRALVLFQN SQSAGDAKMR 

401 RQAQAEWRFA TRGFDEDKLL TAAQTAFDHG FYDMAVNSAS RTDRKLNYTL 

4 51 RYISPFKDTV I RKAQNVNVD PAWVYGLIRQ ESRFVIGAQS RVGAQGLMQV 

501 MPATAREIAG KIGMDAAQLY TADG . . . 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 109>: 

a019 . seq 

1 ATGTACCCAC CCTCTCTGAA GCATTCCCTG CCGCTGCTGG TGGNCCTGGT 

51 GCTTGCCGCG TGTTCTTNGA CAAACACACT GTCAGCCGAC AAGACCCCGG 

101 CAGACAATAT AGAAACTGCC GACCTTTCGG CAAGCGTTCC CACCNGCCCT 

151 GCCGAACCCG AANGAAAAAC GTNGGCAGA" TACGGCGGCT ACCCGTCCGC 

201 ACTGGATGCA GTGAAACAGA AAAACGATGC CGCCGTCGCC GCCTATTTGG 

251 AAAACGCCGG CGACAGCGCG ATGGCGGAAA ATGTCCGCAA CGAGTGGCTG 

301 AAGTCTTTGG GCGCGCGCAG ACAGTGGACG CTGTNTGCAC ANGAATATGC 

3 51 NAAACTCGAA CCGGCANGGC GCGCCCAAGA AGTCGAATGC TACGCCGATT 

4 01 CGAGCCGCAA CGACTATACG CGTGCCGCCG AACTGGTCAA AAATACGGGC 
4 51 AAACTGCCTT CGGGCTGCAC CAAACTGTTG GAACAGGCAG CCGCATCCGG 
501 CTTGTTGGAC GGCAACGACG CCTGGAGGCG CGTGCGCGGA CTGCTGGCCG 
551 GCCGCCAAAC CACAGACGCA CGCAACCTTG CCGCCGCATT GGGCAGCCCG 
601 TTTGACGGCG GTACACAAGG TTCGCGCGAA TATGCCCTGT TGAACGTCAT 
651 CGGCAAAGAA GCACGCAAAT CGCCGAATGC CGCCGCCCTG CTGTCCGAAA 
701 TGGAAAGCGG TTTAAGCCTC GAACAACGCA GTTTCGCGTG GGGCGTATTG 
7 51 GGGCATTATC AGTCGCAAAA CCTCAATGTG CCTGCCGCCT TGGACTATTA 
801 NGGCAAGGTT GCCGACCGCC GCCAACTGAC CGACGACCAA ATCGAGTGGT 
851 ACGCCCGCGC CGCNNTNNGC NNNCGNNGTT NGNANGANNT GGCNNCCGNN 
901 ANCNCGNNNN TGCMNGANAA ACNNNNNNAN AGNCNNANNT NGNTNMANTG 
951 NNTGGCACGC AGCCGCGCCG CNACGGGCAA CACGCAANAN GCGGANAAAC 

1001 TNTACAAACA GGCGGCAGCA NCGGGCANGA ATTTTTATGC NGTGCTGNCN 

1051 GGGGAAGAGT TGGGGCGCAN AATCGATACG CGCAACAATG TGCCCGATGC 

1101 CGGCAAAANC AGCGTCCTCC GTATGGCGGA AGACGGCGCG ATTAAGCGCG 

1151 CGCTGGTGCT GTTCCGAAAC AGCCGAACCG CCGGCGATGC GAAAATGCGC 

1201 CGTCNGGCTC AGGCGGAATG GCGTTTCGCC ACACGCGGCT TCGATGAAGA 

1251 CAAGCTGCTG ACCGCCGCGC AAACCGCGTT CGACCACGGT TTTTACGATA 

1301 TGGCGGTCAA CAGCGCGGAA CGCACCGACC GCAAACTCAA CTACACCTTG 

1351 CGCTACATTT CGNNNNNTNA NGACACGGTA ATCCGCCACG CGCAAAATGT 

14 01 TAATGTCGAT CCGGCGTGGG TTTACGGGCT GATTCGTCAG GAAAGCCGCT 

14 51 TCGTTATGGG CGCGCAATCC CGCGTAGGCG CGCAGGGGCT GATGCAGGTT 

1501 ATGCCTGCCA CCGCGCGCGA AATCGCCGGC AAAATCGGTA TGGATGCCGC 

1551 ACAACTTTAC ACCGCCGACG GCAATATCCG TATGGGGACG TGGTATATGG 

1601 CGGACACCAA ACGCCGCCTG CAAAACAACG AAGTCCTCGC CACCGCAGGC 

1651 TATAACGCCG GTCCCGGCAG GGCGCGCCGA TGGCAGGCGG ACACGCCCCT 

1701 CGAAGGCGCG GTATATGCCG AAACCATCCC GTTTTCCGAA ACGCGCGACT 

17 51 ATGTCAAAAA AGTGATGGCC AATGCCGCCT ACTACGCCTC CCTCTTCGGC 

1801 GCGCCGCACA TCCCGCTCAA ACAGCGTATG GGCATTGTCC CCGCCCGCTG 

1851 A 

This corresponds to the amino acid sequence <SEQ ID 110; ORF 019.a>: 

a019 .pep 

1 MYPPSLKHSL PLLVXLVLAA CSXTNT LSAD KTPADWIETA DLSASVPTXP 

51 AEPEXKTXAD YGGYPSALDA VKQKNDAAVA AYLENAGDSA MAENVRNEWL 

101 KSLGARRQWT LXAXEYAKLE PAXRAQEVEC YADSSRNDYT RAAELVKNTG 

151 KLPSGCTKLL EQAAASGLLD GNDAWRRVRG LLAGRQTTDA RNLAAALGSP 

201 FDGGTQGSRE YALLNVIGKE ARKSPNAAAL LSEMESGLSL EQRSFAWGVL 

251 GHYQSQNLNV PAALDYXGKV ADRRQLTDDQ IEWYARAAXX XRXXXXXAXX 

301 XXXXXXKXXX XXXXXXXXAR SRAATGNTQX AXKLYKQAAA XGXNFYAVLX 

351 GEELGRXIDT RNNVPDAGKX SVLRMAEDGA IKRALVLFRN SRTAG DAKMR 

4 01 RXAQAEWRFA TRGFDEDKLL TAAQTAFDHG FYDMAVNSAE RTDRKLNYTL 
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4 51 RYISXXXDTV IRHAQNVNVD PAWVYGLIRC ESRFVMGAQS RVGAQGLMQV 

501 MPATAREIAG KIGMDAAQLY TADGNIRMGT WYMADTKREL QNNEVLATAG 

551 YNAGPGRARR WQADTPLEGA VYAETIPFSE TRDYVKKVMA NAAYYASLFG 

601 APHIPLKQRM GIVPAR" 

m0i9/a0i9 88.9% identity over a 524 aa overlap 



10 20 30 40 50 60 

m019 . pep MylpsmkHSLPLLAALVLAACSSTNTLPAGKTPADNIETADLSASVPTRPAEPERKTLAD 
II It : I (I I I I I : I I i I I I I Ml! I I M i I i I I I I ! i ! I I I I I I M I I II II 
a019 MYPPSLKHSLPLLVXLVIjAACSXTNTLSADKTPADNIETADLSASVPTXPAEPEXKTXAD 
10 20 30 40 50 60 



70 80 90 100 110 120 

mO 1 9 . pep YGGYPSALDAVKQKNDAAVAAYLENAGDSAMAENVRNEWLKSLGARRQWTLFAQEYAKLE 
I I I I ! II I I I i I I I i I I I I I II I I I I I I M I I t i ! I i I I I I I I I I I 1 I f I I I I I I I I I 
a019 YGGYPSALDAVKQKNDAAVAAYLENAGDSAMAENVRNEWLKSLGARRQWTLXAXEYAKLE 

70 30 90 100 110 120 



130 140 150 160 170 130 

m019 . pep PAGRAQEVECYADSSRNDYTRAAELVKNTGKLPSGCTKLLEQAAASGLLDGNDAWRRVRG 
II I I I I I II I I II i I I I 1 I I I I I I I I I I I I II I I I I I I I M I I I I I I II I II I I I I I I I 
a019 PAXRAQEVECYADSSRNDYTRAAELVKNTGKLPSGCTKLLEQAAASGLLDGNDAWRRVRG 

130 140 150 160 170 180 



190 200 210 220 230 240 

m019 .pep LLAGRQTTDARNLAAALGSPFDGGTQGSREYALLNVIGKEARKSPNAAALLSEMESGLSL 
i i \ I M I I I 1 M I i I 1 M i M S I i I II \ \\ \ II II I I I M 1 I 1 M I I I i M ', M I 1 ! I M 
a019 LLAGRQTTDARNLAAALGSPFDGGTQGSREYALLNVIGKEARKSPNAAALLSEMESGLSL 

190 200 210 220 230 240 



250 260 270 280 290 300 

m019 . pep EQRSFAWGVLGHYQSQNLNVPAALDYYGKVADRRQLTDDQIEWYARAALRARRWDELASV 

I I I M I I II I I ! II I I I I I I I I I I I I I I i I i I I ! I I I I I I I I I II 11 I I 
aOlS EQRSFAWGVLGHYQSQNLNVPAALDYXGKVADRRQLTDDQIEWYARAAXXXRXXXXXAXX 

250 260 270 280 290 300 



310 320 330 340 350 360 

mOl 9 . pep ISHMPEKLQKSPTWLYWLARSRAATGNTQEAEKLYKQAAATGRNFYAVLAGEELGRKIDT 
I : I I I I I I I I I I I I MINIM:! MINI M i II I Ml 

a019 XXXXXXKXXXXXXXXXXXARSRAATGNTQXAXKLYKQAAAXGXNFYAVLXGEELGRXIDT 
310 320 330 340 350 360 



370 380 390 400 410 420 

mOl 9 . pep RNNVPDAGKNSVRRMAEDGAVKRALVLFQMSQSAGDAKMRRQAQAEWRFATRGFDEDKLL 
I I I II i I II II f M II II M i I I I I I M I : M I I II M I I M II M I I II M I I I M 
a019 RNNVPDAGKXSVLRMAEDGAIKRALVLFRNSRTAG DAKMRRXAQAEWRFATRGFDEDKLL 

370 380 390 400 410 420 



430 440 450 460 470 480 

m019.pep TAAQTAFDHGFYDMAVNSAERTDRKLNYTLRYI SPFKDTVIRHAQhiVNVDPAWVYGLIRQ 

M I M M I II II M M M I I II I I M I I I I I M I M I! II I M I I II I M I M M li 
a019 TAAQTAFDHGFYDMAVNSAERTDRKLNYTLRYISXXXDTVIRHAQNVNVDPAWVYGLIRQ 

430 440 450 4 60 470 480 



490 500 510 520 

m019.pep ESRFVIGAQSRVGAQGLMQVMPATARE1AGKIGMDAAQLYTADG 
M M M M M M M M II M M I I M M II M I M M 11 M I M 
a019 ESRFVMGAQSRVGAQGLMQVMPATAREIAGKIGMDAAQLYTADGNIRMGTWYMADTKRRL 
490 500 510 520 530 540 



a019 QNNEVLATAGYNAGPGRARRWQADTPLEGAVYAETIPFSETRDYVKKVMANAAYYASLFG 
550 560 570 580 590 600 
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Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORE from N. gonorrhoeae 

ORF 019 shows 95.5% identity over a 89 aa overlap with a predicted ORF (ORF 019.ng) 
from /V. gonorrhoeae: 

g019/m019 

10 20 30 40 49 

g019 . pep LLAALVLAACSSTNTLPAGKTPADNIETADLSASVPTRPAEPEGKTLAD 

MiiiiimmmMMiimiiMiiMiiimim imm 

m0 1 9 MYLPSMKHSLPLLAALVLAACSSTNTLPAGKTPADNI ETADLSASVPTRPAEPERKTLAD 

10 20 30 40 50 60 

50 60 70 80 89 

g 0 1 9 . pep YGGY PSALDAVKQNNDAAAAAYLENAGDS AMAENVRKEWL 

IIIMMIIMIhillhllMIIIIMilllllhlll 

m019 YGGYPSALDAVKQKNDAAVAAYI "NAGDSAMAENVRNEWLKSLGARRQWTLFAQEYAKLE 

70 80 90 100 110 120 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 1 1 1>: 

g023 . seq 

1 ATGGTAGAAC GTAAATTGAC CGGTGCCCAT TACGGTTTGC GCGATTGGGT 

51 AATGCAGCGT GCGACTGCGG TTATTATGTT GATTTATACC GTTGCACTTT 

101 TAGTGGTTCT ATTTGCCCTG CCTAAAGAAT ATCCGGCATG GCAGGCATTT 

151 TTTAGTCAAG CTTGGGTAAA AGTATTTACC CAAGTGAGCT TTATCGCCGT 

2 01 ATTCTTGCAC GCTTGGGTGG GTATCCGCGA TTTGTGGATG GACTATATCA 
251 AACCCTTCGG CGTGCGTTTG TTTTTGCAGG TTGCCACCAT TGtctGGCTG 
301 GTGGGCTGCC TCGTGTATTC AGTTAAAGTG ATTTGGGGGT AA 

This corresponds to the amino acid sequence <SEQ ID 112; ORF 023. ng>: 

g023 -pep 

1 MVERKLTGAH YGLRDWVMQR ATA V IMLIYT VALLWLFAL P KEYPAWQAF 

51 FSQAWVKVFT QVSFIAVFLH AWVGIRDLWM DYIKPFGVR L r LQVATIVWL 

101 VGCLVYSVKV IWG* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 13>: 

m0 2 3 . seq 

1 ATGGTAGAAC GTAAATTGAC CGGTGCCCAT TACGGTTTGC GCGATTGGGT 

51 GATGCAACGT GCGACTGCGG TTATTATGTT GATTTATACC GTTGCACTTT 

101 TAGTGGTTCT ATTTTCCCTG CCTAAAGAAT ATTCGGCATG GCAGGCATTT 

151 TTTAGTCAAA CTTGGGTAAA AGTATTTACC CAAGTGAGCT TCATCGCCGT 

201 ATTCTTGCAC GCTTGGGTGG GTATCCGCGA TTTGTGGATG GACTATATCA 

251 AACCCTTCGG CGTGCGTTTG TTTTTGCAGG TTGCCACCAT CGTTTGGCTG 

3 01 GTCGGCTGTC TCGTGTATTC AGTTAAAGTG ATTTGGGGGT AA 

This corresponds to the amino acid sequence <SEQ ID 114; ORF 023>: 

m023 -pep 

1 MVERKLTGAH YGLRDWVMQR ATA VI MLIYT VALLWLFSL P KEYSAWQAF 

51 FSQTWVKVFT QVSFIAVFLH AWVGIRDLWM DYIKPFGVR L FLQVATIVWL 

101 VGCLVY SVKV IWG* 

The following partial DNA sequence was identified in /V. meningitidis <SEQ ID 1 15>: 

a023. seq 

1 ATGGTAGAAC GTAAATTGAC CGGTGCCCAT TACGGTTTGC GGGATTGGGC 

51 GATGCAACGT GCGACCGCGG TTATTATGTT GATTTATACC GTTGCACTTT 

101 TAGTGGTTCT ATTTGCTCTG CCTAAAGAAT ATTCGGCATG GCAGGCATTT 

151 TTTAGTCAAA CTTGGGTAAA AGTATTTACC CAAGTGAGCT TCATCGCCGT 
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201 ATTCTTGCAC GCTTGGGTGG GTATCCGCGA TTTGTGGATG GACTATATNA 
251 AACCCTTCGG CGTGCGTTTG TTTTTGCAGG TTGCCACCAT CGTCTGGCTG 
30.1 GTCGGCTGCT TGGTGTATTC AATTAAAGTA ATTTGGGGGT AA 

This corresponds to the amino acid sequence <SEQ ID 116; ORF 023. a>: 

a 02 3 .pep 

1 t-'VERKLTGAH YGLRDWAMQR ATA V IMLIYT VALLVVLFAL P KEYSAWQAF 

51 FSQTWVKV FT QVSFIAVFLH AWVGI RDLWM DYXKPFGVR L FLQVAT I VWL 

101 VGCLVY SIKV IWG* 

m023/a023 96.5% identity ever a 1 1 3 aa overlap 

10 20 30 40 50 60 

m023 .pep MVERKLTGAHYGLRDWVMQRATAVIMLIYTVALLVVLFSLPKEYSAWQAFFSQTWVKVFT 
I I I I I t I I I I I I I I ! t : I I I I! I I I I I I ! I I 1 M II I I :! I I ! I [ I ! II I ! I I I I I I I M 
a02 3 MVERKLTGAHYGLRDWAMQRATAVIMLTYTVALLVVLFALPKEYSAWQAFFSQTWVKVFT 
10 20 30 40 50 60 

70 80 90 100 110 

m02 3 .pep QVSFIAVFLhAWVGIRDLWMDYIKPFGVRLFLQVATIVWLVGCLVYSVKVIWGX 
I I I I I I I I I I M I I I I I I 1 I I I I I I I I I ! I I I I I I I M I I I I If I I : I I I 11 I 
a023 Q V S F I AV FLHAW V GIRDLWMDYXKPFGVRL FLQVAT I V W L V G C L V Y S I K V I WGX 

70 80 90 100 110 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from 7V. gonorrhoeae 

ORF 023 shows 97.3% identity over a 1 13 aa overlap with a predicted ORF (ORF 023. ng) 
from N. gonorrhoeae: 

g023/m023 

10 20 30 40 50 60 

g023 .pep MVERKLTGAHYGLRDWVMQRATAVIMLIYTVALLVVLFALPKEYPAWQAFFSQAWVKVFT 
IIIIIIIIIIIIIIIIIIIIIIIIIMiMIIIMIIhlllll IIIMIIhllllli 
TT102 3 MVERKLTGAHYGLRDWVMQRATAVIMLIYTVALLWLFSLPKEYSAWQAFFSQTWVKVFT 

10 20 30 40 50 60 

70 80 90 100 110 

g 0 2 3 . pep QVS F I AVFLHAWVG I RDLWMDY I KPFGVRL FLQVAT I VWLVGCLVYS VKV I WGX 

IMIIIIIMIIIIMIIIIIIIIIIIIIIIIMIII1IIIIIIIIII1IIIII 

m023 QVS F I AVFLHAWVG I RDLWMDY I KPFGVRLFLQVAT I VWLVGCLV YSVKV I WGX 

70 80 90 100 110 



The following partial DNA sequence 



g025 . seq 






1 


AT G TTG AAA C 


AAAcgACACT 


51 


GTTGGGCGGT 


TGcgCCACCC 


101 


CAGGTATGCA 


GACCGTATCG 


151 


ACGCCGTACA 


ATGCCGCTCC 


201 


CGTGCAAact 


gcgccggttT 


251 


CACCTGCCGT 


TTCGGgtaca 


301 


aacgCGGCGa 


cgCataCTAT 


351 


caaAcgCtac 


CATATCTCTC 


401 


CCGACAATAC 


GTTGAGCATC 


451 


TATGCCGCAC 


CGAAAACCGC 


501 


TGCCGCGCAA 


ACCCCTGTGA 


551 


CGCCGCAACC 


TGCCGCGCCC 


601 


GCGCCCGCCC 


CGCAATCTCC 


651 


CGGCGGCATT 


GTTTGGCAGC 



identified in N. gonorrhoeae <SEQ ID 1 1 7>: 

TTTGGCAGCT TGTACCGCCG TTGCCGCTCT 
AACAGCCTGC TccTGTCATT GCAGG CAATT 
TCTGCGCCGG TTTACAATCC TTATGGCGCA 
TGCCGCCAac gatgcGCCgT ATGTGCCGCC 
ATTCGCCTCC TGCTTATGTT CCGCcgtCTG 
tatgtTCCTT CTTACGCACC CgtcgACATC 
TGTGCGTGGC GACACgGcgt acaACATTTc 
AAGACGATTT CCGTGCGTGG AACGGCATGA 
T^TCAGATTG TTAAAGTCAA ACCGGCaggA 
AGCCGTAGAA AGCAGGCCCG CCGTACCGGC 
AACCCGCCGC gcaACCGCCC GTTCAGTCCG 
GCTGCGGAAA ATAAAGCGGT TCCCGCCCCC 
TGCCGCTTCG CCTTCCGGCA CGCGTTCGGT 
GTCCGACCCA AGGTAAAGTG GTTGCCGATT 
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701 


TCGGCGGCGG 


CAACAAGGGT 


GTCGATATTG 


CCGGCAATGC 


CGGACAACCC 


751 


GTTTTGGCGG 


CGGCTGACGG 


CAAAGTGGTT 


TATGCCGGTT 


CAGGTTTGAG 


801 


GGGATACGGA 


AACTTGGTCA 


TCATCCAGCA 


CAATTCCTCT 


TTCCTGACCG 


851 


CGTACGGGCA 


CAACCAAAAA 


TTGCTGGTCG 


GCGAAGGTCA 


GCAGGTCAAA 


901 


CGCJGTCAGC 


AGGTTGCTTT 


GA~GGGTAAT 


ACCGATGCTT 


CCAGAACGCA 


951 


GCTTCATTTC 


GAGGTGCGTC 


AAAACGGCAA 


ACCGGTTAAC 


CCGAACAGCT 


1001 


ATATCGCGTT 


CTGA 









This corresponds to the amino acid sequence <SEQ ID 118; ORF 025. ng>: 

g025 .pep 

1 MLKQTTLLAA CTAVAALLGG CATQQPAPVI AGNSGMQTVS SAPVYNPYGA 

51 TPYNAAPAAN DAPYVPPVQT APVYSPPAYV PPSAPAVSGT YVPSYAPVDI 

101 NAATHTIVRG DTVYNISKRY HISQDDFRAW NGMTDNTLSI GQIVKVKPAG 

151 YAAPKTAAVE SRPAVPAAAQ TPVKPAAQPP VQSAPQPAAP AAENKAVPAP 

201 APAPQSPAAS PSGTRSVGGI VWQRPTQGKV VADFGGGNKG VDIAGNAGQP 

251 VLAAADGKW YAGSGLRGYG NLVIIQHNSS FLTAYGHNQK LLVGEGQQVK 

301 RGQQVALMGN TDASRTQLHF EVRQNGKPVN PNSYIAF* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 19>: 

m025.seq (partial) 

1 . . GTGCCGCCGG TGCAAAGCGC GCCGGTTTAT ACGCCTCCTG CTTATGTTCC 

51 GCCGTCTGCA CCTGCCGTTT CGGGTACATA CGTTCCTTCT TACGCACCCG 

101 TCGACATCAA CGCGGCGACG CATACTATTG TGCGCGGCGA CACGGTGTAC 

151 AACATTTCCA AACGCTACCA TATCTCTCAA GACGATTTCC GTGCGTGGAA 

201 CGGCATGACC GACAATACGT TGAGCATCGG TCAGATTGTT AAAGTCAAAC 

251 CGGCAGGATA TGCCGCACCG AAAGCCGCAG CCGTAAAAAG CAGGCCCGCC 

301 GTACCGGCTG CCGCGCAACC GCCCGTACAG TCCGCACCCG TCGACATTAA 

3 51 CGCGGCGACG CATACTATTG TGCGCGGCGA CACGGTGTAC AACATTTCCA 

4 01 AACGCTACCA TATCTCTCAA GACGATTTCC GTGCGTGGAA CGGCATGACC 

4 51 GACAATATGT TGAGCATCGG TCAGATTGTT AAAGTCAAAC CGGCAGGATA 

5 01 TGCCGCACCG AAAACCGCAG CCGTAGAAAG CAGGCCCGCC GTACCGGCTG 
551 CCGTGCAAAC CCCTGTGAAA CCCGCCGCGC AACCGCCTGT GCAGTCCGCG 
601 CCGCAACCTG CCGCGCCCGC TGCGGAAAAT AAAGCGGTTC CCGCGCCCGC 
651 CCCGCAATCT CCTGCCGCTT CGCCTTCCGG CACGCGTTCG GTCGGCGGCA 
701 TTGTTTGGCA GCGTCCGACG CAAGGTAAAG TGGTTGCCGA TTTCGGCGGC 
751 AACAACAAGG GTGTCGATAT TGCCGGTAAT GCGGGACAGC CCGTTTTGGC 
801 GGCGGCTGAC GGCAAAGTGG TTTATGCCGG TTCAGGTTTG AGGGGATACG 
851 GAAACTTGGT CAT CAT CC AG CATAATTCTT CTTTCCTGAC CGCATACGGG 
901 CACAACCAAA AATTGCTGGT CGGCGAGGGG CAGCAGGTCA AACGCGGTCA 
951 GCAGGTTGCT TTGATGGGCA AT AC CG AT GC TTCCAGAACG CAGCTTCATT 

1001 TCGAGGTGCG TCAAAACGGC AAACCGGTTA ACCCGAACAG CTATATCGCG 

1051 TTCTGA 

This corresponds to the amino acid sequence <SEQ ID 110; ORF 025>: 

m025.pep (partial) 

1 . .VPPVQSAPVY TPPAYVPPSA PAVSGTYVPS YAPVDINAAT HTIVRGDTVY 

51 NISKRYHISQ DDFRAWNGMT DNTLSIGQIV KVKPAGYAAP KAAAVKSRPA 

101 VPAAAQPPVQ SAPVDINAAT HTIVRGDTVY NISKRYHISQ DDFRAWNGMT 

151 DNMLSIGQIV KVKPAGYAAP KTAAVESRPA VPAAVQTPVK PAAQPPVQSA 

201 PQPAAPAAEN KAVPAPAPQS PAASPSGTRS VGGIVWQRPT QGKWADFGG 

2 51 NNKGVDIAGN AGQPVLAAAD GKWYAGSGL RGYGNLVIIQ KNSSFLTAYG 

3 01 HNQKLLVGEG QQVKRGQQVA LMGNTDASRT QLKFEVRQNG KPVNPNSYIA 
351 F* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 1 1>: 

a025 . seq 

1 ATGTTGACAC CAACAACACT TTAGGTAGCT TGTACCGCCC TTGCCGCTCA 

51 GTTGGGCGGA TGCCCCACCC AACACCCTTC TCCTGTCATT GCAGGCAATT 

101 CAGGTATGCA GACCGTACCG TCTGCGCCGG TTTACAATCC TTATGGCGCA 
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151 


APP.PCGTAPA 


201 


GGTGCAAAGC 


251 


p APPTPPPHT 


3 01 


AAPGPGGCG A 


351 


CAAATGCTAC 




CCGACAATAC 


4 51 


TATGCCGCAC 


50 1 


TGPPGPGPAA 


551 


PHPATAPTAT 


601 


P AT ATPTPTP 


651 


GTTGAGCATC 


701 


CGAAAGCCGC 


751 


ACCCCTGTGA 


801 


TGCCGCGCCC 


851 


CTCCTGCCGC 


901 


CAGCGTCCGA 


951 


GGGTGTCGAT 


1001 


ACGGCAAAGT 


1051 


GTCATCATCC 


1101 


AAAATTGCTG 


1151 


CTTTGATGGG 


1201 


CGGCAAAACG 



ATGCCGCTCC 
GCGGCGGTTT 
TTCGGGTACA 
CCCATACTAT 
CATATCTCTC 
GTTGAGCATC 
CGAAAGCCGC 
CCGCTCGTAC 
TGTGCGCGGC 
AAGACGATTT 
GGTCAGATTG 
AGCCGTAAAA 
AACCCGCCGC 
GCTGCGGAAA 
TTCGCCTTCC 
CGCAAGGTAA 
ATTGCAGGAA 
GGTTTATGCA 
AGCATAATTC 
GTCGGCGAAG 
CAATACCGAG 
GCAAACCGGT 



TGCCGCCAAC 
ATANGCCTCC 
TACGTTCCTT 
TGTGCGCGGC 
AAGACGATTT 
GGTCAGATTG 
AGCCGTAAAA 
AGTCCGCACC 
GACACGGTGT 
CCGTGCGTGG 
TTAAAGTCA,, 
AGCAGGCCCG 
GCAACCGCCT 
ATAAAGCGGT 
GGCACGCGTT 
AGTGGTTGCC 
ATGCGGGACA 
GGTTCCGGTT 
TTCCTTCCTG 
GCCAGCAGGT 
GCTTCTAGAA 
TAATCCGAAC 



GATGCGCCGT 
TGCTTATGTT 
CTTACGCANC 
GACACGGTGT 
CCGTGCGTGG 
TTAAAGTCAA 
AGCAGGCCCG 
CGTCGACATC 
ACAACATTTC 
AACGGCATGA 
ACCGGCAGGA 
CCGTACCGGC 
GTGCAGTCCG 
TCCCGCGCCC 
CGGTCGGCGG 
GATTTCGGCG 
GCCCGTTTTG 
TGAGGGGATA 
ACCGCATACG 
CAAACGCGGG 
CGCAGCTTCA 
AGCTATATCG 



ATGTGCCGCC 
CCGCCGTCTG 
CGTCGACATC 
ACAAGATTTC 
AACGGCATGA 
ACCGGCAGGA 
CCGTACCGGC 
AACGCGGCGA 
CAAACGCTAC 
CCGACAATAC 
TATGCCGCAC 
TGCCGTGCAA 
CGCCGCAACC 
GCCCCGCAAT 
CATTGTTTGG 
GCAACAACAA 
GCGGCGGCTG 
CGGCAATTTG 
GGCACAACCA 
CAGCAGGTCG 
TTTCGAGGTG 
CGTTCTGA 



This corresponds to the amino acid sequence <SEQ ID 112; ORF 025. a>: 

a025 . pep 

1 MLTPTTL*VA CTALAAQLG G CPTQHPSPVI AGMSGMQTVP SAPVYNPYGA 

51 TPYNAAPAAN DAPYVPPVQS APVYXPPAYV PPSAPAVSGT YVPSYAXVDI 

101 NAATHTIVRG DTVYKISKCY HISQDDFRAW NGMTDNTLSI GQIVKVKPAG 

151 YAAPKAAAVK SRPAVPAAAQ PLVQSAPVDI NAATHTIVRG DTVYNISKRY 

201 HISQDDFRAW NGMTDNTLSI GQIVKVKPAG YAAPKAAAVK SRPAVPAAVQ 

251 T D VK?AAQPP VQSAPQPAAP AAENKAVPAP APQSPAASPS GTRSVGGIVW 

301 QRPTQGKWA DFGGNNKGVD IAGNAGQPVL AAADGrCVVYA GSGLRGYGNL 

351 VIIQHNSSFL TAYGHNQKLL VGEGQQVKRG QQVALMGNTE ASRTQLHFEV 

4 01 RQNGKPVNPN SYIAF* 



m025/a025 



97.4% identity over a 351 aa overlap 



10 20 30 

m025.pep VPPVQSAPVYTPPAYVPPSAPAVSGTYVPS 

I ! I i I I i I I i : I I t i ! I I I ! I I M I II I M 
a02 5 GMQTVPSAPVYNPYGATPYNAAPAANDAPYVPPVQSAPVYXPPAYVPPSAPAVSGTYVPS 
40 50 60 70 80 90 

40 50 60 70 80 90 

m025.pep YAPVDINAATHTIVRGDTVYNISKRYHISQDDFRAWNGMTDNTLSIGQIVKVKPAGYAAP 
i I I I I I I I I I i I I I ! 1 I i I : I I i I ! I I I i I I ! I I I I I I I I I I I I I I I I I M 1 M I I I I 
a025 YAXVDINAATHTIVRGDTVYKISKCYHISQDDFRAWNGMTDNTLS IGQI VKVKPAGYAAP 

100 110 120 130 140 150 

100 110 120 130 140 150 

m025.pep KAAAVKSRPAVPAAAQPPVQSAPVDINAATKTIVRGDTVYNISKRYKISQDDFRAWNGMT 
I I I I I I ! I I I I It I M I II I 1 I I I I I I I I I [ i I I I I 1 I I I 1 I I I I | I I | | | | I | ! | | | | 
a02 5 KAAAVKSRPAV PAAAQPLVQSAPVDINAATHTIVRGDTVYNISKRYHISQDDFRAWNGMT 

160 170 180 190 200 210 

160 170 180 190 200 210 

m025 . pep DNMLSIGQIVKVKPAGYAAPKTAAVESRPAV PAAVQTPVKPAAQPPVQSAPQPAAPAAEN 

I I I I I I I I I I I I I M I 1 i I I : I I I : i I I ! I I I I ! 1 I I I I I I I ! I II t I [ I I 1 t 1 I I I I I 
a025 DNTLSIGQIVKVKPAGYAAPKAAAVKSRPAVPAAVQTPVKPAAQPPVQSAPQPAAPAAEN 
220 230 240 250 260 270 



220 230 240 

mO 2 5 . pep KAVPA PAPQS PAAS PSGTRS VGG I VWQRPTQGKV* 

1 I II I I I I I I I I I I I I I I I I t I I I ! I I I I I I I I i 



250 260 270 

'ADFGGNNKGVDIAGNAGQPVLAAAD 
I I 1 M I I I i I I i I I I I II I I I I I | j 
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a02 5 KAVPAPAPQSPAASPSGTRSVGGIVWQRPTQGKWADFGGNNKGVDIAGNAGQPVLAAAD 
280 290 300 310 320 330 

280 290 300 310 320 330 

m02 5.pep GKVVYAGSGLRGYGNLVI IQHNSSFLTAYGHNQKLLVGEGQQVKRGQQVALMGNTDASRT 

M I I I I I I I I I i I I j I I 1 I I I 1 I | | I j ( I ! I | ! | | || | | | | ! | | | | | | j i | | | j | ; | j J | 

a02 5 GKVVYAGSGLRGYGNLVI IQHNSSFLTAYGHNQKLLVGEGOQVKRGQOVALMGNTEASRT 

340 350 360 370 360 390 



340 350 
m02 5 . pep QLHFEVRQNGKPVNPNSYIAFX 
I 1 I I I I I I ! I I I I I I ! I I I i I ! 
a 02 5 QLHFEVRQNGKPVNPNSYIAFX 
400 410 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 025 shows 75.6% identity over a 353 aa overlap with a predicted ORF (ORF 025. ng) 
from N. gonorrhoeae: 

m025/g025 



10 20 30 

mO 2 5 . pep VPPVQSAPVYTPPAYVPPSAPAVSGTYVPS 

! i ! I p I I I h ! I I ! I ! ! I I i I 1 I I i I I i I 

g02 5 GMQTVSSAPVYNPYGATPYNAAPAANDAPYVPPVOTAPVYSPPAYVPPSAPAVSGTYVPS 
40 50 60 70 80 90 



40 50 60 70 80 90 

m02 5 - pep YAPVD I NAATHT I VRGDTVYN I S KRYH I SQDD FRAWNGMTDNTLS I GQ I VKVKPAG YAAP 

I M I 1 1 I I ! 1 1 1 1 I !! M I I 1 1 II 1 1 1 1 I II I 1 1 I I I 1 1 M 1 1 I i 1 1 1 1 I 1 1 I I I I 1 1 j | 

902 5 YAPVD I NAATHT I VRGDTV YNI S KRYHISQDDFRAWNGMTDNTLS I GQ I VKVKPAG YAAP 

100 110 120 130 140 150 

100 110 120 130 140 150 

m025 .pep KAAAVKSRPAVPAAAQPPVQSAPVD I NAATHT I VRGDTV YNI S KRYH I SQDDFRAWNGMT 

1 

g025 K 



160 170 180 190 200 210 

m025 .pep DNMLSIGOIVKVKPAGYAAPKTAAVESRPAVPAAVQTPVKPAAQPPVQSAPQPAAPAAEN 

I I I I I I I I I I I I I : I I I I I I 1 I I I I I I I I I I | I | | | | | | 

9025 TAAVESRPAVPAAAQTPVKPAAQ? PVQSAPQPAAPAAEN 

160 17C 180 190 



220 230 240 250 260 

m025 .pep KAVP AP A P - - QS P AAS P S GTRS VGG I VWQR PTQGKWAD FGGNNKGVD I AGN AGQ P VLAA 

MINIM MMMIIIMIMMIIMIMIIMIIMIMIMMIIiMIIMM 

g02 5 KAVPAPAPAPQSPAASPSGTRSVGGIVWQRPTQGKWADFGGGNKGVDIAGNAGQPVLAA 
200 210 220 230 240 250 



270 280 290 300 310 320 

m02 5 . pep ADGKWYAGSGLRGYGNLV I IQHNS S FLTAYGHNQKLLVGEGQQVKRGQQVALMGNTDAS 

M 1 1 1 1 1 1 ! M M ! 1 1 i 1 1 I M 1 1 1 M I I i j 1 1 1 j 1 1 1 I i i I ) I ! II 1 1 1 1 II I 1 1 1 1 M 

g 0 2 5 ADGKWYAGSGLRGYGNLV 1 1 QHNS S FLT A YGHNQ KL L VGEGQQ VKRGQQVALMGNTD AS 

260 270 280 290 300 310 



m025 . pep 



330 340 350 

RTQLHFEVRQNGKPVNPNSYIAFX 
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lllillllllllllllllllilM 

g02 5 RTQLHFEVRQNGKPVNPNS Y I AFX 

320 330 

The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 1 13>; 

g031 . seq 

1 ATGGTGTCCC TCCGCTTCAG ATTCGGCAAC CACTTTAAAC GCCGACATTC 

51 TGACAATTTC CTTTTCCGCC AGCCAAATAT CATGCGTATC TTTCGGTTCG 

101 GGCTTGTTGG GCATGGCAAC CTTCAACAGC 7GCGCCATCA CAGGAATCGT 

151 CGTTCCCTGA A7CAGCAGCG ACAGCACCAC CACGGCAAAC GCCACATCAA 

201 ACAGCAGGTG CGAATTGGGA ACGCCCATCA CCAGCGGCAT CATCGCCAGC 

251 GAAATCGGTA CGGCTCCTCG CAAGCCCAAC CAACTGATAT ACGCCTTTTC 

3 01 ACGCAGGCTG TAATTGAATT TCCACAAACC GCCGAACACT GCCAGCGGAC 
351 GCGCGACCAG CATCAGGAAC GCCGCAATCG CCAAGGCTTC CGCCGCCCTG 

4 01 TCCAACACGC CGGCGGGAGA AACCAGCAGA CCGAGCATGA CGAACAAAGT 
4 51 TGCCTGCGCC AGCCAAGCCA AACCGTCCAT CACACGCAAA ACGTGTTCCG 
501 TcgcACGGTT GCGCTGGTTA CCGACAATGA TGCCGGCAAG GTAAACCGCC 
551 AAAAAGCCGC TGCCGCCTAT GGTATTGGTA AACGCAAACA CAAGCAGCCC 
601 GCCCGACACA A7TATCAGCG CGTACAGACC TTCCGtacac acctccaatt 
651 cccaatcaac gtcatagctg tctcccgtgt taaaatgttc ttcacttcag 
701 aatccccccc ttcttcccag cccgaaacct tcatgtgtta naccccgggg 
751 tgccccaacg gatttagtaa cctcccaatg acictgcttc tcgccccctt 
801 cgcccgcttt ciccttccgg gaaaacttgt tgtccccgtc ttacatcaa 

This corresponds to the amino acid sequence <SEQ ID 114; ORF 031.ng>: 

g031 .pep 

1 MVSLRFRFGN HFKRRHSDNF LFRQPNIMRI FRFGLVGKGN LQQPRHHRNR 

51 RSLNQQRQHH HGKRHIKQQV RIGNAHHQRH HRQRNRYGSS QAQPTDIRLF 

101 TQAVIEFPQT AEHCQRTRDQ HQERRNRQGF RRPVQHAGGR NQQTEHDEQS 

151 CLRQPSQTVH HTQNVFRRTV ALVTDN DAGK VNRQKAAAAY GIGKRKHKQP 

201 ARHNHQRVQT FRTHLQFPIN VIAVSRVKMF FTSESPPSSQ PETFMCXTLG 

2 51 CPNGFSNLPM TLLVAPFARF LLPGKLV VPV LH * 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 15>: 

m031 . seq (partial) 

1 . . . CGCCTGAAGC ACGGTGTCGG ACTGCATTTC TATTCGGCTA TACGCCTTTT 

51 CACGCAGGCT GTAATTGAAT TTCCACAAAC CGCCGAACAC TGCCGACGGA 

101 CGCGCGACCA GCATCAGGAA CGCCGCAATC GCCAAgGCTT CCGCCGCCCT 

151 GTCCAACACG TTGGCAGGAG AAACCAGCAG CAAAGGCATT CCCAAACGTG 

201 CGGACAAAGT GGTCGAAACC ACGCTCAGAA ACAACAGTGC GCCACCCGGC 

251 AG 

This corresponds to the amino acid sequence <SEQ ID 116; ORF 03 1>: 

m031.pep (partial) 

1 . . . RLKHGVGLKF YSAIRLFTQA VIEFPQTAEK CRRTRDQHQE RRNRQG FRRP 
51 VQHVGRRNQQ QRHSQTCGQS GRN HAQKQQC ATRQ 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 1 7>: 

a031 . seq 

1 ATACGCCTTT TCACGCAGGC TGTAATTGAA TTTCCACAAA CCGCCGAACA 

51 CTGCCGGCGG ACGCGCGACC AGCATCAGGA ACGCCGCAAT CGCCAAGGCT 

101 TCCGCCGCCC CGTCCAACAC GTTGGCAGGA GAAACCAGCA GCAAAGGCAT 

151 TCCCAAACGT GCGGACAAAG TGGTCGAAAC CACGCTCAGA AACAACAGTG 

201 CGCCACCCGG CAG 

This corresponds to the amino acid sequence <SEQ ID 118; ORF 03 1 .a>: 

a031.pep (partial) 

1 IRLFTQAVIE FPQTAEHCRR TRDQHQERRN RQG FRRPVQH VGRRNQQQRH 
51 SQTCGQSGRN HAQKQQCATR Q 
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m03i/a03i 1 00.0% identity over a 7 1 aa overlap 

10 20 30 40 50 60 

m031 .pep RLKHGVGLHFYSATRLFTQAVIEFPOTAEHCRRTRDQH0ERRNRQGFRRPVQHVGRRNOQ 

i ! M i I I I M I II I M i I I If I I I I I I I I I I f I I I I I I I t I II t I I I 
a 031 IRLFTQAVIEFPQTAEHCRRTRDQHQERRNRQGFRRPVQHVGRRNQQ 

10 20 30 40 

70 80 
m031 . pep QRHSQTCGQSGRNHAQKQQCATRQ 
I I I ! I I I II ! f I I I I I I II I I I I I 
a031 QRHSQTCGQSGRNHAQKQQCATRQ 
50 60 70 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N, gonorrhoeae 

ORF 031 shows 60.0% identity over a 85 aa overlap with a predicted ORF (ORF 031 .ng) 
from N. gonorrhoeae: 

m031/g031 

10 20 30 

m031 . pep RLKKGVGLH FYSAI RLFTQAVI E FPQTAEH 

I : ■ i : : I i ; i I I I I i I f I I M I I 

g031 NQQRQHHHGKRHIKQQVRIGNAHHQRHHRQRNRYGSSQAQPTDIRLFTQAVIEFPQTAEH 
60 70 80 90 100 110 

40 50 60 70 30 

m031 .pep CRRTRDQHQERRNRQG FRRPVQHVGRRNQQQRHS-QTCGQSGRNHAQKQQCATRQ 

I : I I I I I I I I I I! II I I I I I I I I : i I I I I : I : I : i : : : : : : I : I : 
g0 31 CQRTRDQHQERRNRQGFRRPVQHAGGRNQQTEHDEQSCLRQP3QTVHHTQNVFRRTVALV 
120 130 140 150 160 170 

g031 TDNDAGKVNRQKAAAAYGIGKRKHKQPARHNHQRVQTFRTHLQFPINVIAVSRVKMFFTS 

180 190 200 210 220 230 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 1 19>: 

g032 . seq 

1 ATGCGGCGAA ACGTGCCTGC CGTCGCCGTA TTGCGCCGCC CACGATTCGA 

51 GGCGTTTTTG GATTTGGCGT TGGCTCAGGC GCGTGCCGTT CCTGCCGGTA 

101 AACAGGGCTT TGCCGTCCGA TGCCGTCTGA CGCAGCGGCA GATAGTTTTT 

151 CAAGGCTTCC ACGCTTTTGC CGGTCAGCGG AACCTGACGC TGCTTGCGCC 

201 CTTTGCCGGT AACGTGTACC CACGCTTCGT CCAAATATAC ATCATCTGCA 

251 TTCAAGCCGT GTATCTCGCT CACGCGCAAA CCGCTGCCGT ACATCAGCTC 

301 GAACAGCGCG TGGTCGCGCA CCGCCAGCGG GTCGCCGCCG TCCACGGGCA 

351 AATCCAACAT CCGGTTCAGC CATTCCTGCG GCAGGGCTTT GGGTACGCGC 

401 TCGGGCTGCT TCGGCGGTTT GATGTCGGCG GTCGGGTCGG CGCGCATCAG 

4 51 CCCGCGTTTG ACCAGCCAGG CGCAATACTG CCGCCACGCC GACAGCTTGC 

501 GCGCCAGCGT CCGACCGTCC AAACCGCGCT GCGACAGCCG CCGCAACGCC 

551 GccgTAAAAT CGCGCCGCGA CAAGTCCTGC GGCACGCcgc ctgcaTCTTC 

601 AGACGGCATT TGTGCCAACA GTGCAAACAG TTCTTCCAAA TCGCGCCGGT 

651 ATGCCGCAAC CGTGTGCTCC GACTTGCCCT CGCGCACGAT GTTTTCCAAA 

701 TAAGCGTCAA AATacgccgC AAACccgTCC AAAACCATAA CCGTCCCACA 

7 51 CAAATATCAA AAAACCAGTG A 

This corresponds to the amino acid sequence <SEQ ID 110; ORF 032.ng>: 

g032 .pep 

1 MRRNVPAVAV LRRPRFEAFL DLALAQARAV PAGKQGFAVR CRLTQRQIVF 

51 QGFHAFAGQR NLTLLAPFAG NVYPRFVQIY IICIQAVYLA HAQTAAVHQL 
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101 EQRVVAHRQR VAAVHGQIQH PVQPFLRQGF GYALGLLRRF DVGGRVGAHQ 
151 PAFDQPGAIL PPRRQLARQR PTVQTALRQP PQRRRKIAPR QVLRHAACIF 
201 RRHLCQQCKQ FFQIAPVCRN RVLRLALAHD VFQI 3VKIRR KPVQNHNRPT 
251 QISKNQ* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 1 1>: 

kTGCGCCGCC CATTGCGCCA 
GCGTGCCGTT CCTGCCGGTA 
3CAGCGGCA GATAGTTTTT 
CACCTGCCGC TgTT.GCGCC 
CCAAATAGAC ATCATCTGCA 
CCGCTGCCGT ACATCAGTTC 



m032 . seq 


(partial) 






I 


ATGCGGCGAA 


ACGTGCmTGC 


mGTCGCCGTT 


51 


AACGTTTTTG 


GATTTGGCGT 


TGGCTCAGGC 


101 


AACAGGGCTT 


TGCCGTCCGA 


TGCCGTCTGA 


151 


CAGGGCTTCC 


ACGCTTTTGC 


CGACCAGCGG 


201 


CTTTGCCGAT 


AAcGTGTACC 


CACGCyTCGT 


251 


TTCAAGCCGT 


GTATCTCGCT 


CACGCGCAAA 


301 


GAACAGGGCG 


TGGTCGCGCA 


CCGCCAGCGG 


351 


AATCCAGCAT 


CCGGT7CACC 


CATTCCTGCG 


401 


TCGGGCTGCT 


TCGGCGGTTT 


GATGTCGGCG 


4 51 


GCCGCGCTTT 


ACCAGCCAAA 


CGCAATACTG 


501 


GAGCCAGCGT 


CCGTTCCCCC 


AAACCGCG. . 



This corresponds to the amino acid sequence <SEQ ID 112; ORF 032>: 

m032.pep (partial) 

1 MRRNVXAVAV XRRPLRQTFL DLALAQARAV PAGKQG FAVR CRLTQRQIVF 

SI QGFHAFADQR HLPLXAPFAD NVYPR XVQID IICIQAVYLA HA QTAAVKQF 

101 EQGVVAHRQR VAAVHGQIQH PVQPFLRQGF GYALGLLRRF DVGGRVGVHQ 

151 AALYQPNAIL PPRRKLASQR PFPQTA... 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 13>: 

a032 . seq 

1 ATGCGGCGAA ACGTGCCTGC CGTCGCCGTT TTGCGCCGCC CATTGCGCCA 

51 AACGTTTTTG GATTTGGCGT TGGCTCAGGC GCGTGCCGTT CCTGCCGGTA 

101 AACAGGGCTT TGCCGTCCGA TGCCGTCTGA CGCAGCGGCA GATAGTTTTT 

151 CAGGGCTTCC ACGCTTTTGC CGGTCAGCGG AACCTGCCGC TGCTTGCGTC 

201 CTTTGCCGGT AACGTGTACC CACGCCTCGT CCAAATATAC ATCATCTGCA 

251 TTCAAGCCGT GTATCTCGCT CACGCGCAAA CCGCTGCCGT ACATCAGTTC 

301 GAACAGCGCG TGATCGCGCA CCGCCAGCGG GTCGCCGCCG TCCACGGGCA 

3 51 AATCCAGCAT CCGGTTCAGC CATTCCTGCG GCAGGGCTTT GGGTACGCGC 

4 01 TCGGGCTGCT TCGGCGGTTT GATGTCGGCG GTCGGGTCGG TATGCAGCAG 
4 51 ACCGCGTTTG ACCAGCCAGG CGCAATACTG CCGCCAAGAC GACAGCTTGC 
501 GCGCCAGCGT CCGCGCATTC AAACCGCGCT GCGACAGCCG CCGCAACGCC 
551 GCCGTAAAAT CGCGCTGCGA CAAGCCCTGC GGCACGCCGC CTGCATCTTC 
601 AGACGGCATT TGTGCCAACA GCGCAAACAG TTCTTCCAAA TCGCGCCGGT 
651 ATGCCGCCAC CGTGTGCTCC GACTTGCCCT CGCGCACGAT GTTTTCCAAA 
701 TAAGCGTCAA AATGCGCCGC AAACCCGTCC AAAACCATAA CCGCCCCACA 
7 51 CAAATATCAA AAAAACAGTG A 

This corresponds to the amino acid sequence <SEQ ID 114; ORF 032.a>: 

a032 . pep 

1 MRRNVPAVAV LRRPLRQTFL DLALAQARAV PAGKQG FAVR CRLTQRQIVF 

51 QGFHAFAGQR NLPLLASFAG NVYPR LVQIY IICIQAVYLA HA QTAAVHQF 

101 EQRVIAHRQR VAAVHGQIQH PVQPFLRQGF GYALGLLRRF DVGGRVGMQQ 

151 TAFDQPGAIL PPRRQLARQR PRIQTALRQP PQRRRKIALR QALRHAACIF 

201 RRHLCQQRKQ FFQIAPVCRH RVLRLALAHD VFQISVKMRR KPVQNHNRPT 

251 QISKKQ* 

m032/a032 88. 1 % identity over a 1 76 aa overlap 

10 20 30 40 50 60 

m032 . pep MRRNVXAVAVXRRPLRQTFLDLALAQARAVPAGKQGFAVRCRLTQRQI VFQGFHAFADQR 

[III! I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I [ I f I i I I I I I I I i I I I I I II 
a032 MRRNVPAVAVLRRPLRQTFLDLALAQARAVPAGKQGFAVRCRLTQRQIVFQG FHAFAGQR 

10 20 30 40 50 60 
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70 80 90 100 110 120 

m0 32 . pep H L P LX A P FA DN V Y ? RX VQ I D 1 1 C I Q A V Y L AKAQT AAV H QFE QGVV AH RQR V AA VH G Q I QH 

: I I ( I I i Mill III I I I 1 i II , I , I I I 11 I I ! | | | | I : t I I I I 1 I I j I I I I 1 I 
a 0 3 2 NLPLLASFAGNVYPRLVQI YI ICIQAVY LAHAQT AAV HQFEQRVIAH RQ R V AA VH G Q I QH 

70 SO 90 100 110 120 

130 140 150 160 170 

m0 3? . pep P VQ P F LR QG FG Y AL G L LP. R F D V G GR VG V H C AAL Y Q ?N A I L ? ? R R K LA S Q R P F PQT A 

M I I M 1 I I M 1 I I I I 1 I 1 M I I ! I I I : : I : I : i I : ! I I 1 I M : I I ill !i 
a032 P VQ P FLRQG FG YALGLLRR FDVGGRVGMQQT AFDQPGA I LP PRRQLARQRPR I QTALRQP 

130 140 150 160 170 180 

a032 PQRRRKI ALRQALRHAACI FRRHLCQQRKQFFQIAPVCRHRVLRLALAH DVFQISVKMRR 

190 200 210 220 230 240 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from TV. gonorrhoeae 

ORF 032 shows 86.4% identity over a 176 aa overlap with a predicted ORF (ORF 032. ng) 
from N. gonorrhoeae: 

m032/g032 

10 20 30 40 50 60 

m032 . pep MRRNVXAVAVXRRPLRQTFLDLALAQARAVPAGKQG FAVRCRLTQRQIVFQGFHAFADQR 

I i M I I I I 1 111 : : I M I I 1 I! I I I I ! I I I I I I I i I I M I I I I i 1 M I i I ! I I II 
g0 3 2 MRRNVPAVAVLRRPRFEAFLDLALAQARAVPAGKQG FAVRCRLTQRQIV FQG FHAFAGQR 

10 20 30 40 50 60 

70 80 90 100 110 120 

mO 3 2 . pep H LPLXAP FADNVY PRXVQI D I IC I QAV YLAH AQT AAVHQFEQG WAKRQRVAAVHGQI QH 

: 1 I 1 M I MM! Mi I I ! i I I I M I I I I I I I I I I : ! I I II i I I I M i M I I I I 1 
g032 NLTLLAPFAGNVYPRFVQIYI ICIQAVYLAHAQTAAVHQLEQRVVAHRQRVAAVHGQIQH 

70 80 90 100 110 120 

130 140 150 160 170 

m032 . pep PVQPFLRQGFGYALGLLRRFDVGGRVGVHQAALYQPNAILPPRRKLASQRPFPQTA 
I M I I ( I I I I I 1 I I 1 I I I I I II I I I I I : I I I : I : I I I ; . i : ; j| I Ml 
g032 PVQPFLRQGFGYALGLLRRFDVGGRVGAHQPAFDQPGAILPPRRQLARQRPTVQTALRQP 

130 140 150 160 170 180 

g032 PQRRRKI APRQVLRHAAC I FRRHLCQQCKQFFQIAPVCRNRVLRLALAHDVFQI SVKIRR 

190 200 210 220 230 240 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 1 15>: 

g033 . seq 

1 ATGGCGGCGG CGGACAAACT CTTGGGCGGC GACCGCCGCA GCGTCGCCAT 

51 CATCGGAGAC GGCGCGATGA CGGCGGGGCA GGCGTTTGAA GCCTTGAATT 

101 GCGCGGGCGA TATGGATGTG GATTTGCTGG TCGTCCTCAA CGACAACGAA 

151 ATGTCGATTT CCCCCAACGT CGGCGCGTTG CCCAAATATC TTGCCAGCAA 

201 CGTCGTGCGC GATATGCACG GACTGTTGAG TACCGTCAAA GCGCAAAcgg 

2 51 GCAAGGTATT AGACAAAATA CCCGGCGCGA TGGagtTTGC C C AAAAAGT C 

3 01 GAACAcaaaA TCAAAACCCT TGCCGAAGAA GCCGAACACG CCAAACAGTC 
351 GCTGTCGCTG TTTGAAAATT TCGGCTTCCG CTACACCGGC CCCGTGGACG 
401 G AC AC AACG T CGAGAATCTG GTGGACGTAT TGAAAGACTT GCGCAGCCGC 

4 51 AAAGGCCCTC AGTTGCTGCA CGTCATCACC AAAAAGGG C A ACGGCTACAA 

5 01 ACTCGCCGAA AACGACCCcg tcaAATACCA CGCCGTCGCc aACCTGCcta 
551 AAGAAGGCGG GGCGCAAATg ccGTCTGAAA AAGAACCCAA GCCCGCCgCc 
601 aaaccgACCT ATACCCAAGT ATTCGGCAAA TGGCTGTGCG ACCGGGCGGC 
651 GGCAGATTCC CGACTGGTTG CGATTACCCC CGCCATGCGC GAGGGCAGCG 
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701 GACTGGTGGA GTTTGAACAA CGATTCCCCG ACCGCTATTT CGATGTCGGC 

751 ATCGCCGAGC AGCACGCCGT tacCTTTGCC GGCGGTTTGG CGTGCGAAGG 

801 CATGAAGCCC GTCGTGGCGA TTTATTCCAC CTTTTTACAA CGCGCCTACG 

851 ACCAACTGGT GCACGACATC GCCCTGCAAA ACCTGCCCGT TTTGTTTGCC 

901 GTCGACCGTG CGGGCATCGT CGGCGCGGAC GGTCCGACCC ATGCCGGCTT 

951 GTACGATTTG AGCTTCTTGC GCTGTGTGCC GAACATGATT GTTGCCGCGC 

1O01 CGAGCGATGA AAA CG AATGC CGCCTGCTGC TTTCGACCTG CTATCAGGCG 

10 51 GATGCGCCCG CCGCCGTCCG CTATCCGCGC GGCACGGGTA CGGGCGCGCC 

1101 GGTTTCAGAC GGCATGGAAA CCGTGGAAAT CGGCAAGGGC ATTATCCGCC 

1151 GCGAAGGTGA GAAAACCGCC TTcatTGCCT 7CGGCAGTAT GGTCGCCACC 

12 01 GCATTGGCGG TTGCCGAAAA ACTGAACGCC ACCGTCGCCG ATATGCGCTt 

12 51 cgccaaacCG ATAG\CGAAG AGTTGATTGT CCGCCTTGCC CGAAGCCAcg 

13 01 accGCATCGT TACCCTTGAA GAAAACGCCG AACAGGGCGG CGCAGGCGGC 

13 51 GCGG TCTTGG AAGTGTTGGC GAAACACGGC ATCTGCAAAC CCGTTTTGCT 

14 01 TTTGGGCGTT GCCGATACCG TAACCGAACA CGGCGAT CCG AAAAAACTTT 
14 51 TGGACGATTT GGGTTTGAGT GCCGAAGCGG TGGAACGCCG GGTGCGCGAG 
1501 TGGCTGCCGG ACCGTGATGC GGCAAATTAA 

This corresponds to the amino acid sequence <SEQ ID 116; ORF 033. ng>; 

g0 3 3 .pep 

1 MAAADKLLGG DRRSVAIIGD GAMTAGQAFE ALNCAGDMDV DLLWLNDNE 

51 MSISPNVGAL PKYLASNWR DMHGLLSTVK AQTGKVLDKI PGAMEFAQKV 

101 EHKIKTLAEE AEHAKQSLSL FENFGFRYTG PVDGHNVENL VDVLKDLRSR 

151 KGPQLLHVIT KKGNGYKLAE NDPVKYHAVA NLPKEGGAQM PSEKEPKPAA 

2 01 KPTYTQVFGK WLCDRAAADS RLVAITPAMR EGSGLVEFEQ RFPDRYFDVG 
251 IAEQKAVTFA GGLACEGMKP WAIYSTFLQ RAYDQLVHDI ALQNLPVLFA 

3 01 VDRAGIVGAD GPTHAGLYDL SFLRCVPNMI VAAPSDENEC RLLLSTCYQA 

3 51 DAPAAVRYPR GTGTGAPVSD GMETVEIGKG I IRREGEK TA FIAFG5MVAT 

4 01 ALAVAEKLNA TVADMRFVKP IDEELIVRLA RSHDRIVTLE ENAEQGGAGG 

4 51 AVLEVLAKHG ICKPVLLLGV ADTVTEHGDP KKLLDDLGLS AEAVERRVRE 

5 01 WLPDRDAAN* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 17>: 

m033 . seq 

1 ATGGCGGCGG CAGACAAACT CTTGGGCAGC GACCGCCGCA GCGTCGCCAT 

51 CATCGGCGAC GGCGCGATGA CGGCGGGGCA GGCGTTTGAA GCCTTGAATT 

101 GCGCaG.CGA TATGGATGTr GATTTGCTrG TCGTCCTCAA CGACAACGAA 

151 ATGTCGATTT CCCCCAACGT CGGCGCGCTG CCG AAATACC TTGCCAGCAA 

201 CGTCGTGCGC GATATGCACG GCCTGTTGAG TACCGTCAAA GCGCAAACGG 

2 51 GCAAGGTATT AGACAAAATA CCCGGCGCGA TGGAGTTTGC CCAAAAAGTC 

3 01 GAACACAAAA TCAAAACCCT TGCCGAAGAA GCCGAACACG CCAAACAGTC 

3 51 GCTGTCTTTG TTTGAAAACT TCGGCTTCCG CTACACCGGC CCCGTGGACG 

4 01 GACACAACGT CGAAAATCTG GTGGACGTAT TGAAAGACTT GCGCAGCCGC 

4 51 AAAGGCCCTC AGTTGCTGCA CGTCATCACC AAAAAGGGCA ACGGCTACAA 
501 ACTCGCCGAA AACGACCCCG TC AAATACC A CGCCGTCGCC AACCTGCCTA 

5 51 AAGAAAGCGC GGCGCAAATG CCGTCTGAAA AAGAACCCAA GCCCGCCGCC 

6 01 AAACCGACCT ATACCCAAGT GTTCGGCAAA TGGCTGTGCG ACCGGGCGGC 

6 51 GGCAGATTCC CGACTGGTTG CGATTACCCC CGCCATGCGC GAGGGCAGCG 

7 01 GCTTGGTTGA GTTTGAACAA CGATTCCCCG ACCGCTATTT CGATGTCGGC 

7 51 ATCGCCGAGC AGCACGCCGT TACCTTTGCC GGCGGTTTGG CTTGCGAAGG 

8 01 GATGAAGCCC GTCGTGGCGA TTTATTCCAC CTTTTTACAA CGCGCCTACG 

8 51 ACCAACTGGT GCACGACATC GCCCTGCAAA ACCTACCCGT TTTGTTTGCC 

9 01 GTCGACCGCG CGGGCATCGT CGGCGCGGAC GGCCCGACCC ATGCCGGTCT 
9 51 GTACGATTTG AGCTTTTTGC GCTGCGTGCC GAACATGATT GTCGCCGCGC 

10 01 CGAGCGATGA AAACGAATGC CGCCTGTTGC TTTCGACCTG CTATCAGGCA 

10 51 GACGCGCCCG CCGCCGTCCG CIV. SCCGCGC GGCACGGGTA CGGGCGCGCC 

1101 GGTTTCAGAC GGCATGGAAA CCGTGGAAAT CGGCAAGGGC ATTATCCGCC 

1151 GCGAAGGTGA GAAAACCGCA TTCATTGCCT TCGGCAGTAT GGTCGCCCCC 

12 01 GCATTGGCGG TTGCCGAAAA ACTGAACGCC ACCGTCGCCG ATATGCGCTT 

12 51 CGTCAAACCG ATAGACGAAG AGTTGATTGT CCGCCTTGCC CGAAGCCACG 
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13 01 ACCGCATCGT TACCCTTGAA GAAAACGCCG AACAGGGCGG CGCAGGCGGC 
1351 GCGGTGCTGG AAGTATTGGC GAAACACGGC ATCTGCAAAC CCGTTTTGCT 

14 01 TTTGGGCGTT GCCGATACCG TAACCGGACA CGGCGATCCG AAAAAACTTT 
14 51 TAGACGATTT GGGCTTGAGT GCCGAAGCGG TGGAACGGCG TGTGCGCGCG 
1501 TGGCTGTCGG ATCGGGATGC GGCAAATTAA 

This corresponds to the amino acid sequence <SEQ ID 1 IS; ORF 033>: 

m0 3 3 .pep 

1 MAAADKLLGS DRRSVAI IGD GAMTAGQAFE ALNCAXDMDV DLLWLNDNE 

51 MSISPNVGAL PKYLASNWR DMHGLLSTVK AQTGKVLDKI P GAME FAQ KV 

101 EHKI KTLAEE AEHAKQSLSL FENFGFRYTG PVHGHNVENL VDVLKDLRSR 

151 KGPQLLHVIT KKGNGYKLAE NDPVKYHAVA NLPKESAAQM PSEKEPKPAA 

201 KPTYTQVFGK WLCDRAAADS RLVAITPAMR EGSGLVEFEQ RFPDRYFDVG 

251 IAEQHAVTFA GGLACEGMKP WAIYSTFLQ RAYDQLVHDI ALQNLPVLFA 

3 01 VDRAGIVGAD GPTHAGLYDL SFLRCVPNMI VAAPSDENEC RLLLSTCYQA 

351 DAPAAVRYPR GTGTGAPVSD GMETVEIGKG I IRREGEK TA FIAFGSMVAP 

401 ALAVA EKLNA TVADMRFVKP IDEELIVRLA RSHDRIVTLE ENAEQGGAGG 

451 AVLEVLAKHG ICKPVLLLGV ADTVTGHGDP KKLLDDLGLS ASAVERRVRA 

501 WLSDRDAAN* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 19>: 

a033 . seq 

1 ATGGCGGCGG CGGACAAACA GTTGGGCAGC GACCGCCGCA GCGTCGCCAT 

51 CATCGGCGAC GGCGCGATGA CGGCGGGTCA GGCGTTTGAA GCCTTGAACT 

101 GCGCGGGCGA TATGGATGTG GATTTGCTGG TCGTCCTCAA CGACAACGAA 

151 ATGTCGATTT CCCCCAACGT CGGTGCGTTG CCCAAATACC TTGCCAGCAA 

201 CGTCGTGCGC GATATGCACG GACTGTTGAG TACCGTCAAA GCGCAAACGG 

2 51 GCAAGGTATT AGACAAAATA CCCGGCGCGA TGGAGTTTGC CCAAAAAGTC 
301 GAACATAAAA TCAAAACCCT TGCCGAAGAA GCCGAACACG CCAAACAGTC 

3 51 ACTGTCTTTG TTTGAAAACT TCGGCTTCCG CTATACCGGC CCCGTGGACG 

4 01 GACACAACGT CGAAAATCTG GTCGATGTAT TGGAAGACCT GCGCGGACGC 
4 51 AAAGGCCCGC AGCTTCTGCA CGTCATCACC AAAAAGGGCA ACGGCTACAA 
501 ACTCGCCGAA AACGATCCCG TCAAATACCA CGCCGTCGCC AACCTGCCTA 
551 AAGAAAGCGC GGCGCAAATG CCGTCTGAAA AAGAACCCAA GCCCGCCGCC 
601 AAACCGACCT ATACCCAAGT GTTCGGCAAA TGGCTGTGCG ACCGGGCGGC 
651 GGCAGATTCC CGACTGGTTG CGATTACCCC CGCCATGCGC GAGGGCAGCG 
701 GCTTGGTTGA GTTTGAACAA CGATTCCCCG ACCGCTATTT CGATGTCGGC 
7 51 ATCGCCGAGC AGCACGCCGT TACCTTTGCC GGCGGTTTGG CTTGCGAAGG 
301 GATGAAGCCC GTCGTGGCGA TTTATTCCAC CTTTTTACAA CGCGCCTACG 
851 ACCAACTGGT GCACGACATC GCCCTGCAAA ACCTGCCCGT TTTGTTTGCC 
901 GTCGACCGCG CGGGCATCGT CGGCGCGGAC GGCCCGACCC ATGCCGGTTT 
951 GTACGATTTA AGCTTTTTGC GCTGCATTCC GAATATGATT GTCGCCGCGC 

1001 CGAGCGATGA AAATGAATGC CGCCTGCTGC TTTCGACCTG CTATCAGGCA 

1051 GACGCGCCCG CCGCCGTCCG CTATCCGCGC GGCACGGGTA CGGGCGTGCC 

1101 GGTTTCAGAC GGCATGGAAA CCGTGGAAAT CGGCAAGGGC ATTATCCGCC 

1151 GCGAAGGTGA GAAAACCGCA TTCATTGCCT TCGGCAGTAT GGTCGCCCCT 

1201 GCATTGGCGG TCGCCGGAAA ACTGAACGCC ACCGTCGCCG ATATGCGCTT 

12 51 CGTCAAACCG ATAGACGAAG AGTTGATTGT CCGCCTTGCC CGAAGCCACG 

1301 ACCGCATCGT TACCCTTGAA GAAAACGCCG AACAGGGCGG CGCAGGCAGC 

1351 GCGGTGCTGG AAGTGTTGGC GAAACACGGC ATCTGCAAAC CCGTCTTGCT 

14 01 TTTGGGCGTT GCCGATACCG TAACCGGACA CGGCGATCCG AAAAAACTTT 

14 51 TAGACGATTT GGGCTTGAGT GCCGAAGCGG TGGAACGGCG TGTGCGCGCG 

1501 TGGCTGTCGG ATCGGGATGC GGCAAATTAA 

This corresponds to the amino acid sequence <SEQ ID 120; ORF 033. a>: 

a03 3 . pep 

1 MAAADKQLGS DRRSVAI IGD GAMTAGQAFE ALNCAG DMDV DLLWLNDNE 

51 MSISPNVGAL PKYLASNVVR DMHGLLSTVK AQTGKVLDKI PGAMEFAQKV 

101 EHKIKTLAEE AEHAKQSLSL FENFGFRYTG PVDGHNVENL VDVLEDLRGR 

151 KGPQLLHVIT KKGNGYKLAE NDPVKYHAVA NLPKESAAQM PSEKEPKPAA 

201 KPTYTQVFGK WLCDRAAADS RLVAITPAMR EGSGLVEFEQ RFPDRYFDVG 

251 IAEQHAVTFA GGLACEGMKP WAIYSTFLQ RAYDQLVHDI ALQNLPVLFA 
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301 VDRAGIVGAD GPTHAGLYDL SFLRCIPNMI VAAP3DENEC RLLLSTCYQA 

3 51 DAPAAVRYPR GTGTGVPVSD GMETVEIGKG I IRREGEK TA FIAFGSMVAP 

4 01 ALAVA GKLNA TVADMRFVK? IDEELIVRLA RSHDRI VTLE ENAEQGGAGS 
4 51 AVLEVLAKHG ICKPVLLLGV ADTVTGHGDP KKLLDDLGLS AEAVERRVRA 
501 WLSDRDAAN* 

3/a033 98.4% identity over a 509 aa overlap 

10 20 30 40 50 60 

m033 .pep MAAADKLLGSDRRSVAI IGDGAMTAGQAFEALNCAXDMDVDLLVVLNDNEMS ISPNVGAL 

I M I I I M I I I I 1 I I I I I I 1 I I I i I I I I .11 I I II I I I I I t I I I II I ! I i M I I 1 
a03 3 MAAADKQLGSDRRSVAI I GDGAMTAGQAFEALNCAGDMDVDLLVVLNDNEMS ISPNVGAL 

10 20 30 40 50 60 

70 80 90 100 110 120 

m033 .pen PKYLASNVVRDMHGLLSTVKAQTGKVLDKIPGAMEFAQKVEHKIKTLAEEAEHAKQSLSL 
I I I I II I I I I I I I I M I I I I I I I I ! I I I I i i I I I I I I I I 1 I 1 I I I I I I I I M I I I I I I I I 
a0 3 3 PKYLASNVVRDMHGLLSTVKAQTGKVLDKIPG/a^EFAQKVEHKIKTLAEEAEHAKQSLSL 

70 80 90 100 110 120 

130 140 150 160 170 180 

m033 . pep FENFGFRYTGPVDGHNVENLVDVLKDLRSRKGPQLLHVITKKGNGYKLAENDPVKYHAVA 
I I I I I 11 I I M I M I I I I I t I I I I : I I i : I I I I I I I M I I I I II I I I I I I I I I I I I I I I I 
a 0 3 3 FEN FG FRYTGP VDGHN VEN LVDVLE DLRGRKG PQLLHV I TKKGNGYKLAEN D PVKYHAVA 

130 140 150 160 170 180 

190 200 2iJ 220 230 240 

m033 .pep NLPKESAAQMPSEKEPKPAAKPTYTQVFGKWLCDRAAADSRLVAITPAMREGSGLVEFEQ 
! II I I I I 11 I I I I I I I M I I I M I I I I I I I t I I I I ! I II I I I M I I I I I I ! I I I I I I I I I 
a03 3 NLPKESAAQMPSEKEPKPAAKPTYTQV FGKWLCDRAAADSRLVAITPAMREGSGLVEFEQ 

190 200 210 220 230 240 

250 260 270 280 290 300 

m033 . pep RFPDRYFDVGIAEQHAVT FAGGLACEGMKPVVAI YSTFLQRAYDQLVHDIALQNLPVLFA 

I I I i ! I I I I I I I I I I I I I I I I I i II I I I I M ! 1 I \ I I I I I i I t 1 I I I I I I I I I I I I I I I I 
a033 RFPDRYFDVGIAEQHAVT FAGGLACEGMKPVVAI YSTFLQRAYDQLVHDIALQNLPVLFA 

250 260 270 280 290 300 

310 320 330 340 350 360 

m03 3 .peo VDRAGIVGADGPTHAGLYDLSFLRCVPNMIVAAPSDENECRLLLSTCYQADAPAAVRYPR 
I I I II I M 1 I I I I I I i I I I I i I I I I : I I 1 M I I I I I ! M I II t M I I I ! I I I I I I II I I I 
a033 VDRAGIVGADGPTHAGLYDLSFLRCIPNMIVAAPSDENECRLLLSTCYQADAPAAVRYPR 

310 320 330 340 350 360 

370 380 390 400 410 420 

m033 .pep GTGTGAPVSDGMETVEIGKGI IRREGEKTAFIAFGSMVAPALAVAEKLNATVADMRPVKP 

I I I It : I I I t I I I I I M I I I I I M I I I I I I I I I I I I I I I i I I I II I I I I II I I I I I I! I 
a 0 3 3 GTGTGV P V S DGMETVE I GKG 1 1 RREGEKT AFI AFG SMVAPALAV AGKLN AT VADMRFVK P 

370 380 390 400 410 420 

430 440 450 460 470 430 

m033 . pep IDEELIVRLARSHDRIVTLEENAEQGGAGGAVLEVLAKHGICKPVLLLGVADTVTGHGDP 
! I I I I I j I I I I I I I I I I I I I I I I I I I I I I : I M M I I I i I I I I I I I I I I I II I M I I j I ! 
a033 IDEELIVRLARSHDRIVTLEENAEQGGAGSAVLEVLAKHGICKPVLLLGVADTVTGHGDP 

430 440 450 4 60 470 480 

490 500 510 

m03 3 .pep KKLLDDLGLSAEAVERRVRAWLSDRDAANX 
I I I i I I I I I M I I I I I I I I I I i I I I I I I I I 
a 03 3 KKLLDDLGLSAEAVERRVRAWLSDRDAANX 

490 500 510 
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Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 033 shows 98.4% identity over a 509 aa overlap with a predicted ORF (ORF 033.ng) 
from A', gonorrhoeae: 

m033/g033 

m033 . oep MAAADKLLGSDRRSVAI IGDGAMTAGQAFEALNCAXDMDVDLLVVLNDNEMSTS^.njvGAL 60 

Milllllhlllllllllllilllllllllllii IliMiilllMMIIIIIIIIII 

g03 3 MAAADKLLGGDRRSVAI I GDGAMTAGQAFEALNCAGDMDVDLLWLNDNEMS ISPNVGAL 60 

mO 3 3 . pep PKYLASNWRDMHGLLSTVKAQTGKVLDKI PGAMEFAQKVEHKI KTLAE E AEHAKQS LS L 12 0 

IliilllillllillUllllllllliliilillllllllililllilllllMIIIMI 

g03 3 PKYLASNWRDMHGLLSTVKAQTGKVLDKI PGAMEFAQKVEHKI KTLAE EAEHAKQSLSL 120 

m033 .pep FENFGFRYTGPVDGHNVENLVDVLKDLRSRKGPQLLHVITKKGNGYKLAENDPVKYHAVA 180 

IliMIIMIIIIIIMMIIMIIIIIIIillllllllllllllMillllllllllll 

g03 3 FENFGFRYTGPVDGHNVENLVDVLKDLRSRKGPQLLHVITKKGNGYKLAENDPVKYKAVA 180 

m0 3 3 .pep NLPKESAAQMPSEKEPKPAAKPTYTQVFGKWLCDRAAADSRLVAITPAMREGSGLVEFEQ 24 0 

ilMI-IIUMUIIMIIillMIIMIIIIIIMIIIIIilllMlilllllilll 

g03 3 NLPKEGGAQMPSEKEPKPAAKPTYTQVFGKWLCDRAAADSRLVAITPAMREGSGLVEFEQ 24 0 

m03 3 .pep RFPDRYFDVGIAEQHAVTFAGGLACEGMKPWAI YSTFLQRAYDQLVHDIALQNLPVLFA 3 00 

IMIIIIIIIIIIIIIIIIIMMIIIMIMMIIMMIMIMMIMIIIIIIMI 

g03 3 R FPDRYFDVG I AEQHAVTFAGGLACEGMKPWAI YSTFLQRAYDQLVHDIALQNLPVLFA 3 0 0 

m03 3 .pep VDRAGIVGADGPTHAGLYDLSFLRCVPNMIVAAPSDENECRLLLSTCYQADAPAAVRYPR 3 60 

lllllllllllllllllllllllllllllllillllilllllllll!lll!lllll!lll 

g0 3 3 VDRAGIVGADGPTHAGLYDLSFLRCVPNMIVAAPSDENECRLLLSTCYQADAPAAVRYPR 3 60 

m03 3 .pep GTGTGAPVS DGMETVEI GKG I IRREGEKTAFIAFGSMVAPALAVAEKLNATVADMRFVKP 42 0 

IIIMIIIIIIillllllllMIMIIIIIIIIIIIMI IIIIIIIMIIIIIIIIIII 

g033 GTGTGAPVSDGMETVEIGKG I IRREGEKTAF I AFGSMVATALAVAEKLNATVADMRFVKP 42 0 

m03 3 .pep IDEELIVRLARSHDRIVTLEENAEQGGAGGAVLEVLAKHGICKPVLLLGVADTVTGHGDP 4 80 

IIIIIIMMIIIllllllMllliMMIIMIIIIMIIMIMIIIIMIII III! 

g0 3 3 IDEELI VRLARSHDRIVTLEENAEQGGAGGAVLEVLAKHGICKPVLLLGVADTVTEHGDP 4 80 

m033 .pep KKLLDDLGLS AEAVERRVRAWLS DRDAANX 510 

: ! ■ : I , I ■ I II MIIMI 

g033 KKLLDDLGLSAEAVERRVREWLPDRDAANX 510 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 121>: 

g034 . seq 

1 ATGAGCCGTT TATGGTTTTT TGCCGTAAAA AACATTATAA TCCGCCTTAT 

51 TTACCTATTG CCCAAGGAGA CACAAATGGC ACTCGTATCC ATGCGCCAAC 

101 TGCTTGACCA CGCCGCCGAA AACAGCTACG GCCTGCCCGC GTTCAACGTC 

151 AACAACCTCG AACAAATGCG CGCCATTATG GAAGCCGCCG ACCAAGTCAA 

2 01 CGCGCCCGTC ATCGTACAGG CGAGCGCAGG TGCGCGCAAA TACGcggGCG 
251 CGCCGTTTTT GCGCCACCTG ATTCTGGCGG CAGTCGAAGA ATTTCCGCAC 

3 01 ATCCCCGTCG TGATGCACCA AGACCACGGC GCATCGCCCG ACGTgtgCCA 

3 51 ACGCTCCATC CAACTGGGCT TCTCCTCCGT GATGATGGAC GGCTCTTTGC 

4 01 TCGAAGACGG CAAAACCCCT TCTTCTTACG AATACAACGT CAACGCCACC 
4 51 CGTACCGTCG TCAACTTCTC CCACGCCTGC GGCGTGTCCG TCGAAGGCGA 
501 AATCGGCGTA TTGGGCAACC TCGAAACCGG CGAAGCAGGC GAAGAAGACG 
551 GAGTGGGCGC GGCAGGCAAA CTCTCACACG ACCAAATGCT CACCAGCGTT 
601 GAAGATGCCG TGCGTTTCGT TAAAGATACC GGCGTTGACG CATTGGCGAT 
651 TGCCGTCGGC ACCAGCCACG GCGCATACAA ATTCACCCGT CCGCCCACAG 
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701 GCGACGTATT GCGTATCGAC CGCATCAAGG AAATCCACCA AGCCCTGCCC 

751 AATACACACA TCGTGATGCA CGgctCCAGC TCCGTTCCGC AAGAatgGCT 

8 01 GAAAGTCATC AACGAATACG GCGGCAATAT CGGCGAAACC TACGGCGTGC 

851 CGGTTGAAGA AATCGTCGAA GGCATCAAAC ACGGCGTGCG CAAAGTCAAC 

901 ATCGATACCG ACCTGCGCCT CGCTTCCACC GGCGCGGTAC GCCGCTACCT 

951 TGCCGAAAAC CCGTCCGACT TTGATCCGCG CAAATACTTG GGCAAAACCA 

10 01 TTGAAGCGAT GAAGCAAATC TGCCTCGACC GTTATCTTGC GTTCGGTTGC 

1051 GAAGGTCAGG CAGGCAAAAT CAAACCTGTT TCGTTGGAAA AAATGGCAAG 

1101 CCGTTATGCC AAGGGCGAAT TGAACCAAAT CGTCAAATAA 



This corresponds to the amino acid sequence <SEQ ID 122; ORE 034.ng>: 



g034 . pep 
1 


MSRLWFFAVK 


NI I IRLIYLL 


PKETQMALVS 


MRCLLDKAAE 


NSYGLPAFNV 


51 


NNLEQMRAIM 


EAADQVNAPV 


IVQASAGARK 


YAGAPFLRHL 


ILAAVEEFPH 


101 


I PWMHQDHG 


ASPDVCQRSI 


QLGFSSVMMD 


GSLLEDGKTP 


SSYEYNVNAT 


151 


RTWNFSHAC 


GVSVEGEIGV 


LGNLETGEAG 


EEDGVGAAGK 


LSHDOMLTSV 


201 


EDAVRFVKDT 


GVDALAIAVG 


TSHGAYKFTR 


PPTGDVLRID 


RIKEIHQALP 


251 


NTHIVMHGSS 


SVPQEWLKVI 


NEYGGNIGET 


YGVPVEEIVE 


GIKHGVRKVN 


301 


IDTDLRLAST 


GAVRRYLAEN 


PSDFDPRKYL 


GKTI EAMKQI 


CLDRYLAFGC 


351 


EGQAGKIKPV 


SLEKMASRYA 


KGELNQIVK* 







The following partial DNA sequence was identified in N. meningitidis <SEQ ID 123>: 

m034.seq (partial) 

1 ATGAGCTGTT TATGGTTTTT TGCTGTAAAA AACATTATAA TCCGCCTTAT 

51 TTACCTATTG CCCAAGGAGA CACAAATGGC ACTCGTATCC ATGCGCCAAC 

101 TGCTTGATCA TGCTGCCGAA wACAGCTACG GCyTGCCGGC GTTCAACGTC 

151 AACAACCTGG wACAGATGCG CGCCATCATG GAGGCTGCAG ACCAAGTCGA 

201 CGCCCCCGTC AT CG T AC AG G CGAGTGCCGG TGCGCGCAAA TATGCGGGTG 

251 CGCCGTTTTT ACGCCACCTG ATTTTGGCGG CTGTCGAAGT ATTTCCACAC 

301 ATCCCCGTCG TCATGCACCA AGACCACGGC GCATCACCCG ACGTGTGCCA 

351 ACGCTCCATC CAACTGGGCT TCTCCTCTGT AATGATGGAC GGCTCGCTGA 

4 01 TGGAAGACGG CAAAACCCCT TCTTCTTACG AATACAACGT CAACGCCACA 

451 CGTACCGTGG TTAACTTCTC CCACGCTTGC GGCGTATCCG TTGAAGGCGA 

501 AATCGGCGTA TTGGGCAACC TCGAAACCGG CGATGCAGGC GAAGAAGACG 

551 GTGTAGGCGC AGTGGGCAAA CTTTCCCACG ACCAAATGCT GACCAGCGTC 

601 GAAGATGCCG TATGTTTCGT TAAAGATACC GGCGTTGACG CATTGGCTAT 

651 TGCCGTCGGC ACCAGCCACG GCGCATACAA ATTCACCCGT CCGCCCACAG 

701 GCGATGTATT ACGTATCGAC CGCATCAAAG AAATCCACCA AGCCCTGCCC 

751 AATACACACA TCGTGATGCA C. . . 



This corresponds to the amino acid sequence <SEQ ID 124; ORF 034>: 

m034.pep (partial) 

1 MSCLWFFAVK Nil IRLIYLL PKETQMALVS MRQLLDHAA E XSYGLPAFNV 

51 NNLXQMRAIM EAADQVDAPV IVQASAGARK YAGAPFLRHL ILAAVEVFPH 

101 I PWMHQDHG ASPDVCQRSI QLGFSSVMMD GSLMEDGKTP SSYEYNVNAT 

151 RTWNFSHAC GVSVEGEIGV LGNLETGDAG EEDGVGAVGK LSHDQMLTSV 

201 EDAVCFVKDT GVDALAIAVG TSHGAYKFTR PPTGDVLRID RIKEIHQALP 

251 NTHIVMH. . . 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 125>: 

a034 . seq 

1 ATGAGCCGTT TATGGTTTTT TGCCGCAAAA AACATTATAA TCCGCCTTAT 

51 TTACCTATTG CCCAAGGAGA CACAAATGGC ACTCGTATCC ATGCGCCAAC 

101 TGCTTGATCA TGCTGCCGAA AACAGCTACG GCCTGCCCGC GTTCAACGTC 

151 AACAACCTCG AACAAATGCG CG . JATTATG GAAGCCGCCG ACCAAGTCAA 

201 CGCGCCCGTC ATCGTACAGG CGAGCGCAGG TGCGCGCAAA TACGCGGGCG 

251 CGCCGTTTTT GCGCCACCTG ATTTTGGCGG CTGTCGAAGA ATTTCCGCAC 

301 ATCCCCGTCG TGATGCACCA AGACCACGGC GCATCGCCCG ACGTGTGCCA 

3 51 ACGCTCCATC CAACTGGGCT TTTCCTCCGT GATGATGGAC GGCTCGCTGA 

•101 TGGAAGACGG CAAAACCCCT TCTTCTTATG AATACAACGT CAAC'^ACC 
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4S1 


CGTACCGTGG 


TTAATTTCTC 


501 


AATCGGCGTA 


TTGGGCAACC 


551 


GTGTAGGCGC 


AGTGGGCAAA 


601 


GAAGATGCCG 


TGCGTTTCGT 


651 


TGCCGTCGGC 


ACCAGCCACG 


7C1 


GCGACGTGTT 


GCGTATCGAC 


751 


AATACACACA 


TCGTGATGCA 


801 


GAAAGTCATC 


AACGAATACG 


851 


CGGTTGAAGA 


AATCGTCGAA 


901 


ATCGATACCG 


ACTTGCGCCT 


951 


TGCCGAAAAC 


CCGTCCGACT 


1001 


TTGAAGCGAT 


GAAGCAAATC 


1051 


GAAGGTCAGG 


CAGGCAAAAT 


1101 


CCGTTATGCC 


AAGGGCGAAT 



CCACGCCTGC GGCGTATCCG TTGAAGGCGA 
TCGAAACTGG CGAAGCCGGC GAAGAAGACG 
GTTTCCCACG ACCAAATGCT CACCAGCGTC 
TAAAGATACC GGCGTTGACG CATTGGCGAT 
GCGCGTACAA ATTCACCCGT CCGCCCACAG 
CGCAXAAAG AAATCCACCA AGCCCTGCCC 
CGGCTCCAGC TCCGTTCCGC AAGAATGGCT 
GCGGCAATAT CGGCGAAACC TACGGCGTGC 
GGCATCAAAC ACGGCGTGCG TAAAGTCAAC 
TGCTTCCACC GGCGCGGTAC GCCGCTACCT 
TCGATCCGCG CAAATATTTG AGCAAAACCA 
TGCCTCGACC GCTACCTCGC GTTCGGTTGC 
CAAACCGGTT 7CC7TGGAAA AAATGGCAAA 
TGAACCAAAT CGTCAAATAA 



This corresponds to the amino acid sequence <SEQ ID 126; ORF 034. a>: 

a034 . pep 

MSRLWFFAAK NIIIRLIYLL PKETQMALVS MRQLLDHAAE 



1 
51 
101 
151 
201 
251 
301 
351 



NNLEQMRAIM 
IPWMHQDHG 
RTVVNFSHAC 
EDAVRFVKDT 
MTHI VMHGSS 
IDTDLRLAST 
EGQAGKIKPV 



EAADQVNAPV 
ASPDVCQRSI 
GVSVEGEIG7 
GVDALAIAVG 
SVPQEWLKVI 
GAVRRYLAEN 
SLEKMANRYA 



I VQASAGARK 
OLGFSSVMMD 
LGNLETGEAG 
TSHGAYKFTR 
NEYGGNIGET 
PSDFDPRKYL 
KGELNQIVK+ 



YAGAPFLRHL 
GSLMEDGKTP 
EEDGVGAVGK 
PPTGDVLRID 
YGVPVEEIVE 
SKTIEAMKQI 



NSYGLPAFNV 
I LAAVEEFPH 
SSYEYNVNAT 
LSHDQMLTSV 
RIKEIHQALP 
GIKHGVRKVN 
CLDRYLAFGC 



m034/a034 96.9% identity over a 257 aa overlap 

10 20 30 40 50 60 

m03 4 .pep MSCLWFFAVKNI I IRLI YLLPKETQMALVSMRQLLDHAAEXSYGLPAFNVNNLXQMRAIM 

If I I ! M : 1 I i I i i I ! I M I I I ! I I I I I I I I i I I I I i I I I I ! I I I i j j ! i j Mill 
a03 4 MSRLWFFAAKNII IRLIYLLPKETQMALVSMRQLLDRAAENSYGLPAFNVNNLEQMPAIM 

10 20 30 40 50 60 



70 80 90 100 110 120 

m034 .pep EAADQVDAPVIVQASAGARKYAGAPFLRHLILAAVEVFPHIPVVMKQDHGASPDVCQRSI 
I I M I I : I I I I i II M I II I i i I ! I I I I M I I I I ! i I I I I I i M I I M i I I ! I M i 1 I I 
a 03 4 EAADQVNAPVIVQASAGARKYAGAP FLRHL I LAAVEEFPH IP WMHQDHGASPDVCQRS I 

70 80 90 100 110 120 

130 140 150 160 170 180 

m034 .pep QLGFSSVMMDGSLMEDGKTPSSYEYNVNATRTVVNFSKACGV5VEGEIGVLGNLETGDAG 
I I I I I I I I I I I I i I I I ! I M I I I I I I I I I I I I I I M I ! I I I M I I I I I I I M I I I I I : I I 
a034 QLGFSSVMMDGSLMEDGKTPSSYEYNVMATRTWNFSHACGVSVEGEIGVLGNLETGEAG 

130 140 150 160 170 180 

190 200 210 220 230 240 

m03 4 .pep EEDGVGAVGKLSHDQMLTSVEDAVCFVKDTGVDALAIAVGTSHGAYKFTRPPTGDVLRID 
I I I 1 I I I I i I I I I II I I I M II I I I I I I II i I I ! I I I t I I 1 I i I | I | | | | M M | | | ] ! 
a03 4 EEDGVGAVGKLSHDQMLTSVEDAVRFVKDTGVDALAIAVGTSHGAYKFTRPPTGDVLRID 

190 200 210 220 230 240 

250 

m034 .pep RIKEIHQALPNTHIVMH 
I I I I I I I I I I I I I I I I ! 

a034 RIKEIHQALPNTHIVMHGSSSVPQEWLKVINEYGGNIGETYGVPVEEIVEGIKHGVRKVN 
250 260 270 280 290 300 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from TV. gonorrhoeae 

ORF 034 shows 96.5% identity over a 257 aa overlap with a predicted ORF (ORF 034.ne) 
from A', gonorrhoeae: 
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m034/g034 

m03 4 .pep MSCLWFFAVKL3I I IRLI YLLPKETQMALVSMRQLLDHAAEXSYGLPAFNVNNLXQMRAIM 6 0 

II IIIMMlllllllMIIIIIIIMIIIIIilMIII IIIINII11II HUM 

g034 MSRLWFFAVKNI I IRLIYLLPK^TQMALVSMRQLLDKAAENSYGLPAFNVN^JLEQMRAIM 60 

m034 .pep EAADOVDAPVIVOASAGARKYAGAPFLRHLILAAVEVFPHIPWMHODKGASPDVCORSI 120 

1 1 1 1 1 h I ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i i I I i I I i 1 1 1 1 II i i ! 1 1 i 1 1 M I i I i I 

gO 3 4 EAADQVNAPV I VQASAGARKYAGAPFLRHLI LAAVEE FPH I PWMHQDHGAS PDVCQRS I 120 

m034 .pep OLGFSSVMMDGSLMEDGKTPSSYEY^^WAT?;'^VVNrSHACGVSVEGEIGVLGNLETGDAG 180 

MiiiiilllMhllllllillimiilMMMiliKIMIIillllillllhll 

□034 QLGFSSVMMDGSLLEDGKTPSSYEYNVNATRTWNFSHACGVSVEGEIGVLGNLETGEAG 180 

m034 .pep EEDGVGAVGKLSH^QMLTSVEDAVCFVKDTGVDALAIAVGTSHGAYKFTRPPTGDVLRID 24 0 

Miiiii:iii!iiiMiiiMii iiiiMiitmiiiiiiiiiimiiniiiMi 

g034 EEDGVGAAGKLSHDQMLTSVEDAVRFVKDTGVDALAIAVGTSHGAYKFTRPPTGDVLRID 24 0 

m034.pep RIKEIHQALPNTHIVMH 25 7 
lllllllllllllllll 

g03 4 RIKEIHQALPNTHIVMHGSSSVPQEWLKVINEYGGNIGETYGVPVEEIVEGIKHGVRKVN 3 00 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 127>: 

g036 . seq 

1 ATGCTGAAGC CGTGTTTGGT ATACAGTGCC TGTGCGGCGG cgttacCTGC 

51 GCGGACTTCG AGCAGCAGGC GTTGCGTGCC TTCGGGCAGA TGTGCGTACC 

101 AATATTCGAG CAGGGCGGAC GCAACGCCCC GTCGGCGGCA TTCGGGCGCG 

151 GTGGCAATCA GGTGCAGTTC GGATTCGTCG GGCAGGTTCT GCCAAACGAT 

201 AAAGGCGGCA ATCCTGCCGT CTTTTTCCGC AAGGAAAACC TGTTCGGACG 

2 51 GCGAAACAAG CGCGGACTCA AATTGGCGTT GCGTCCACGC GGACGGGTTG 

301 CAGACGGTAT CGAGCGCGGC CAGTGCGGCG CAGTCGGACG GTGAGGCTGG 

351 GCGGATGTTC ATGTTCGTGC CTTCCGTTCC GCCTGTTCTT TGGCAG7CAG 

4 01 GGCGATTTTG TTGCGGACGT AGAGCAGTTC GGCGTGTGCC GCGCCAGTTG 

4 51 CGGGATAGCC GCCGCCGAGG GCGAGCGCGA GAAAATCGGC GGCGGTCGGC 

501 ATATCGGGTT TGCCTGAGAA GGGCGGACGG TTTTCCAGTG CGAACGCACT 

551 GCCGATGCCG TCTGAAAAGA CGTACCCCTC GGGGAGGGCA ATGTCTGCCG 

601 CCCTACCGAC TTGATAATCG CTCAAACGGC GGCGGTTCAG CGTGTCGAAC 

651 CACGCATAAA ACACTTCGCC CATACGCGCG TCCGCAGCGG CGAGTATGCA 

7 01 GCTTTGCGGC GGCGGCAGCG AGGCGGCGGC ATCGAGCGTG GGGATGCCGA 

7 51 TTAAAGGCGT GTCGAACGGC GTTGCCAAAC CTTGCGCCAC GCCGATGCCG 

801 ATACGCAGTC CGGTAA 

This corresponds to the amino acid sequence <SEQ ID 128; ORF 036.ng>: 

g036 .pep 

1 MLKPCLVYSA CAAAL PARTS SSRRCVPSGR CAYQYSSRAD ATPRRRHSGA 

51 VAIRCSSDSS GRFCQTIKAA ILPSFSARKT CSDGETSADS NWRCVKADGL 

101 QTVSSAASAA QSDGEAGRMF MFVPSVPPVL WQSGRFCCGR RAVRRVPRQL 

151 RDSRRRGRAR ENRRRSAYRV CLRRADGFPV RTHCRCRLKR RTPRGGQCLP 

201 PYRLDNRSNG GGSACRTTHK TLRPYARPQR RVCSFAAAAA RRRHRAWGCR 

2 51 LKACRTALPN LAPRRCRYAV R* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 129>: 

m036. seq 

1 ATGCTGAAGC CGTGCGCCGT GTACAGTGCC TGTGCGGCGG TGTTGCCTGC 

51 ACGGACTTCG AGCAGCAGGC GTTGCGTGTC TTCGGGCAGA TGTGTGAACC 

101 AATATTCGAG CAGGGCGGAC GCAATTCCTT GGCGGCGGCA TTCGGGCGCG 

151 GTGGCAATCA GGTGCAGTTC GGATTCGTCG GGCAGGTTCT GCCAAACGAT 

201 AAAGGCGGCA ATCCCg . CGT CTTTTTCCGC AAGGAAAACC TGTTCGGACG 

251 GCGAAACCAG TGCGGACTCA AATTGGCGTT GCGTCCATGC GGACGGGTTG 

301 CAGACGGCAT CGAGTGCGGC CAGCTCCTCA CAATCGGCAC AAACGGCACG 
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351 GCGGATGTTC ACGGGCGCGC TCTCCGTTCG 

401 GGCGATTTTG TTGCGGACGT AGAGCAAACC 

4 51 CAGGATAACC GCCCTTGGCT GCCAATGCGA 

501 ATATCCGGTC TGCCTGAGAA CGGCGGAGCT 

551 GCCTATGCCG TCTGAAAAGG CGCATCCCTC 

601 CCCGCCCGAC CTGATAA7CG CTCAAACGGT 

651 CAT G CAT AAA ACACTTCGCC CATACGAGCG 

701 GCTTTGCGGC GGCGGCAGCG AGGCGGCGGC 

751 TTAAGGGGGT ATCAAACGGC GTTGCCAAAC 

801 ATACGCAGTC CGGTAA 

This corresponds to the amino acid sequence <SEQ ID 130; ORP 036>: 

m036 .pep 

1 MLKPCAVYSA CAAVL PARTS SSRRCVSSGR CVNQYSSRAD AI PWRRHSGA 

51 VAIRCSSDSS GRFCQTIKAA IPXSFSARKT CSDGETSADS NWRCVHADGL 

101 QTASSAASSS QSAQTARRMF TGALSVRPVL WQSGRFCCGR RANRRVRHGR 

151 QDNRPWLPMR ESRRQSAYPV CLRTAELLPA RTRCLCRLKR RIPPAAGCLP 

201 PARPDNRSNG GSSAYRTMHK TLRPYERP*R QGCSFAAAAA RRRHRARVRR 

2 51 LRGYQTALPN PELHRCRYAV R* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 13 1>: 

a036 . seq 

1 ATGCTGAAGC CGTGCGCCGT GTACAGTGCC TGTGCGGCGG TGTTGCCTGC 

51 ACGGACTTCG AGCAGCAGGC GTTGCGTGTC TTCGGGCAGA TGTGTGAACC 

101 AATATTCGAG CAGGGCGGAC GCAATTCCTT GGCGGCGGCA TTCGGGCGCG 

151 GTGGCAATCA GGTGCAGTTC GGATTCGTCG GGCAGGTTCT GCCAAACGAT 

201 AAAGGCGGCA ATCCCGCCGT CTTTTTCCGC AAGGAAAACC TGTTCGGACG 

2 51 GCGAAACCAG TGCGGACTCA AATTGGCGTT GCGTCCACGC GGACGGGTTG 

301 CAGACGGCAT CGAGCGCGGC GAGTGCGGCG CAATCGGCAT AAACGGCGCG 

351 GCGGATGTTC ACAGGCGCGC CCTCCGTTCC GCCTGTTCTT TGGCAGTCAA 

4 01 GGCGATTTTG TTGCGGACGT AGAGCAGCTC GGCGTGTGCC GCAGCGACGG 

4 51 CGGGAAAACC GCCTTCAGCC GCCAGATTGA GGAAGTCGGC GGCGGTCGGC 

501 ATATCGGGTT TGCCTGAGAA GGGCGGACGG TTTTCCAGCG CGAACGCATT 

551 GCCGATGCCG TCTGAAAAGG CGCATCCTTC CGGCAGCCGG ATGTCTGCCG 

601 CCCGACCGAC CTGATAATCG CTCAAACGGC GGCGGTTCAG CGTGTCGAAC 

651 CATGCATAAA ACACTTCGCC CATACGTGCG TCCGCAGCGG CAAGGATGCA 

7 01 GCTTTGCGGC GGCGGCAGCG AGGCGGCGGC ATCGAGCGAG GGTACGCCGA 

7 51 TTAAAGGAGT ATCAAACGGC GTTGCCAAAC CTTGCGCCAC GCCGATGCCG 

801 ATACGCAGTC CCGTAA 

This corresponds to the amino acid sequence <SEQ ID 132; ORF 036. a>: 

a036 . pep 

1 MLKPCAVYSA CAAVLPARTS SSRRCVSSGR CVNQYSSRAD AIPWRRHSGA 

51 VAIRCSSDSS GRFCQTIKAA IPPSFSARKT CSDGETSADS NWRCVHADGL 

101 QTASSAASAA QSA+TARRMF TGAPSVPPVL WQSRRFCCGR RAARRVPQRR 

151 RENRLQPPD* GSRRRSAYRV CLRRADGFPA RTHCRCRLKR RILPAAGCLP 

201 PDRPDNRSNG GGSACRTMHK TLRPYVRPQR QGCSFAAAAA RRRHRARVRR 

251 LKEYQTALPN LAPRRCRYAV P* 

m036/a036 85.6% identity over a 270 aa overlap 

10 20 30 40 50 60 

m03 6.pep MLKPCAVYSACAAVLPARTSSSRRCVSSGRCVNQYSSRADAIPWRRHSGAVAIRCSSDSS 
I | M 1 I I I I I I I I M I I I I I I I I I I I ! I I t M I I I i ( I I I I I t 1 I I I ! M II I 1 i I i I I I 
a03 6 MLKPCAVYSACAAVLPARTSSSRRCVSSGRCVNQYSSRADAI PWRRHSGAVAIRCSSDSS 

10 20 30 40 50 60 

70 80 90 100 110 120 

m036.pep GRFCQTIKAAI PXSFSARKTCSDGETSADSNWRCVHADGLQTASSAASSSQSAQTARRMF 

i I I I I I i i I II I I I I I I 1 I I I ! I M I I I ! I I I I I I I I I I I I I I I I I i : : M I MINI 
a 0 3 6 GRFCQTI KAA I P P S F S ARKT C S DG ET S A D S N W RC VH A DG LQT A S S AA S AAQ S AX TAP. RM F 

70 30 90 100 110 120 



GCCTGTTCTT 
GGCGTGTGCG 
GAAAGTCGGC 
TCTTCCAGCG 
CGGCAGCCGG 
GGCAGTTCAG 
TCCGTAGCGG 
ATCGAGCGAG 
CCTGAGCTAC 



TGGCAGTCAG 
GCATGGACGG 
GGCAGTCGGC 
CGAACGCGCT 
ATGTCTGCCG 
CGTATCGAAC 
CAAGGATGCA 
GGTACGCCGA 
ACCGATGCCG 



WO 99/57280 



PCTAJS99/09346 



221 



130 140 150 160 170 180 

m0 3 6. pep TGALSVRPVLWQSGRFCCGRRANRRVRHGRQDNRPWLPMRESRRQSAYPVCLRTAELLPA 
III M 111:1! I I I I ! I It III : I : : ! I I II I : I I I lit) I : : I I 
a03 6 TGAPSVPPVLWQSRRFCCGRRAARRVPQRRRENRLQPPDXGSRRRSAYRVCLRRADGFPA 

130 140 150 160 170 ISO 

190 200 210 220 230 240 

m036.pep RTRCLCRLKRRI PPAAGCLPPARPDNRSNGGSSAYRTMHKTLRPYERPXRQGCSFAAAAA 

i I : I I I I I I II I ! I i I II I 1 I I II I II I : i I I I I I M I ! I I II I I I I I I I I I I j 
a036 R T H C RCR LKR R I L P AAG C L P P DR P DN R S N G GG S AC RTM H KT L R P Y V R PQRQGC S FAAAAA 

190 200 210 220 230 24 0 

250 260 270 

m03 6.pep RRRHRARVRRLRGYQTALPNPELHRCRYAVRX 

I I I II ! I I 1 I i : II I I I I I : I I I I I I 
a03 6 RRRHRARVRRLKEYQTALPNLAPRRCRYAVPX 

250 260 270 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 036 shows 74.9% identity over a 271 aa overlap with a predicted ORF (ORF 036. ng) 
from N. gonorrhoeae: 

m036/g036 

10 20 30 40 50 60 

m03 6.pep MLKPCAVYSACAAVLPARTSSSRRCVSSGRCVNQYSSRADAI PWRRHSGAVAIRCSSDSS 

Mill I I I i I M : I I I I I I i M i I I MM: M I M I M 1 I M M M M I M I II I 
g036 MLKPCLVYSACAAALPARTSSSRRCVPSGRCAYQYSSRADATPRRRHSGAVAIRCSSDSS 

10 20 30 40 50 60 

70 80 90 100 110 120 

m036 .pep GRFCQTIKAAIPXSFSARKTCSDGETSADSNWRCVHADGLQTASSAASSSQSAQTARRMF 
M I I M II I M I I M I M M I M I M M I M II I M I M M II I M : M I I ill 
g036 GRFCQTIKAAILPSFSARKTCSDGETSADSNWRCVHADGLQTVSSAASAAQSDGEAGRMF 

70 80 90 100 110 120 

130 140 150 160 170 180 

m0 36.pep TGALSVRPVLWQSGRFCCGRRANRRVRHGRQDNRPWLPMRESRRQSAYPVCLRTAELLPA 
: II M I M I M I II M M III : MM I I M I M 1 I MM I : M : 

g03 6 MFVPSVPPVLWQSGRFCCGRRAVRRVPRQLRDSRRRGRARENRRRSAYRVCLRRADGFPV 

130 140 150 160 170 180 

190 200 210 220 230 240 

m03 6.pep RTRCLCRLKRRI PPAAGCLPPARPDNRSNGGSSAYRTMKKTLRPYERPXRQGCS FAAAAA 

MM I I M M I : : MM ! M M M I M I II M I I I M II I : M I M II I 
g03 6 RTHCRCRLKRRTPRGGQCLPPYRLDNRSNGGGSACRTTHKTLRPYARPQRRVCS FAAAAA 

190 200 210 220 230 240 

250 260 270 

mO 3 6 . pep RRRHRARVRRLRG YQTALPNPELHRCRYAVRX 

I II I II II:: M II I I : II I II M I 
g0 36 RRRHRAWGCRLKACRTALPNLAPRRCRYAVRX 

250 260 270 

The following partial DNA sequence was identified in N, meningitidis <SEQ ID 133>: 

m036-l . seq 

1 ATGCTGAAGC CGTGCGCCGT GTACAGTGCC TGTGCGGCGG TGTTGCCTGC 

51 ACGGACTTCG AGCAGCAGGC GTT~~GTGTC TTCGGGCAGA TGTGTGAACC 

101 AATATTCGAG CAGGGCGGAC GCAATTCCTT GGCGGCGGCA TTCGGGCGCG 

151 GTGGCAATCA GGTGCAGTTC GGATTCGTCG GGCAGGTTCT GCCAAACGAT 

201 AAAGGCGGCA ATCCCGCCGT CTTTTTCCGC AAGGAAAACC TGTTCGGACG 

251 GCGAAACCAG TGCGGACTCA AATTGGCGTT GCGTCCATGC GGACGGGTTG 

301 CAGACGGCAT CGAGTGCGGC CAGCTCCTCA CAATCGGCAC AAACGGCACG 
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351 
401 
451 
501 
551 
601 
651 



GCGGATGTTC 
GGCGATTTTG 
CAGGATAACC 
ATATCCGGTC 
GCCTATGCCG 
CCCGC^CGAC 
CATGCATAAA 



ACGGGCGCGC 
TTGCGGACGT 
GCCCTTGGCT 
TGCCTGAGAA 
TCTGAAAAGG 
CTGATAATCG 
ACACTTCGCC 



TCTCCGTTCG 
AGAGCAAACC 
GCCAATGCGA 
CGGCGGAGCT 
CGCATCCCTC 
CTCAA ^CGGT 
CATACGAGCG 



GCCTGTTCTT 
GGCGTGTGCG 
GAAAGTCGGC 
TCTTCCAGCG 
CGGCAGCCGG 
GGCAGTTCAG 
TCCGTAG 



TGGCAGTCAG 
GCATGGACGG 
GGCAGTCGGC 
CGAACGCGCT 
ATGTCTGCCG 
CGTATCGAAC 



This corresponds to the amino acid sequence <SEQ ID 134; ORF 0036-1 >: 

m036-l .pep 

1 MLKPCAVYSA CAAVL PARTS SSRRCVSSGR CVNQYSSRAD AIPWRRKSGA 

51 VAIRCSSDSS GRFCQTIKAA IPPSFSARKT CSDCETSADS NWRCVHADGL 

101 QTASSAASSS QSAQTARRMF TGALSVRPVL WQSGRFCCGR RANRRVRHGR 

151 QDNRPWLPMR ESRRQSAYPV CLRTAELLPA RTRCLCRLKR RIPPAAGCLP 
201 PARPDNRSNG GSSAYRTMHK TLRPYERP* 

m036-i/g036 76.8% identity in 228 aa overlap 

10 20 30 40 50 60 

m03 6-l .pep MLKPCAVYSACAAVLPARTSSSRRCVSSGRCVNQYSSRADAI PWRRHSGAVAIRCSSDSS 
Mill I I I 1 I i I : I I I I 1 I 1 M I M Mil: I I I M II ! I I I I I M ! I II I ! I I M 
g 0 3 6 MLKPCLVYSACAAALPARTSSSRRCVPSGRCAYQYSSRADATPRRRHSGAVAIRCSSDSS 

10 20 30 40 50 60 

7 0 SO 90 100 110 12 0 

m03 6-l .pec GRFCQTIKAAI PPSFSAKKTCSDGETSADSNWRCVHADGLQTAS3AASSSQSAQTARRMF 
M I I I I I II M I M ! I I I 1 I M M M II M I I I II M M I I : M I I I : : I I I Ml 
g 03 6 GRFCQTIKAAI LPSFSARKTCSDGETSADSNWRCVHADGLQTVSSAASAAQSDGEAGRMF 

70 80 90 100 110 120 

130 140 150 160 170 180 

m0 3 6-l .pep TGALSVRPVLWQSGRFCCGRRANRRVRHGRQDNRPWLPMRESRRQSAYPVCLRTAELLPA 
: M M II M I I I II M I I III : : 1 : I M : ! I : I I i I I I I I : : 1 : 

g 03 6 MFVPSVPPVLWQSGRFCCGRRAVRRVPRQLRDSRRRGRARENRRRSAYRVCLRRADGFPV 

130 140 150 160 170 180 

190 200 210 220 229 

m036-l .pep RTRCLCRLKRRIPPAAGCLPPARPDNRSNGGSSAYRTMHKTLRPYERPX 

MM i I I M I I : : Mil I I M I II I : I I II I I M I I I II 
g036 RTHCRCRLKRRTPRGGQCLPPYRLDNRSNGGGSACRTTHKTLRPYARPQRRVCSFAAAAA 
190 200 210 220 230 240 



g036 



RRRHRAWGCRLKACRTALPNLAPRRCRYAVRX 
250 260 270 



The following partial DNA sequence was identified in TV. gonorrhoeae <SEQ ID 135>: 

g038 .seq 

1 ATGACTGATT TCCGCCAAGA TTTCCTCAAA TTCTCCCTCG CCCAAAATGT 

51 TTTGAAATTC GGCGAATTTA CCACCAAAGC CGGACGGCGG TCGCCCTATT 

101 TCTTCAATGC CGGCCTCTTC AACGACGGCG CGTCCACGCT GCAACTGGCA 

151 AAATTCTATG CACAATCCAT CATTGAAAGC GGCATCCGAT TCGATATGCT 

2 01 GTTCGGCCCC GCCTACAAAG GCATTATTTT GGCGGCGGCA ACCGCGATGA 

2 51 TGCTGGCGGA AAAAGGCGTG AACGTCCCGT TTGCCTACAA CCGCAAAGAA 

301 GCCAAAGACC GCGGCGAAGG CGGCGTGTTG GTCGGCGCGC CGCTTAAAGG 

351 GCGCGTGCTG ATTATCGACG ACGTGATTTC CGCCGGCACA TCCGTACGCG 

4 01 AATCAATCAA ACTGATTGAA GCGGAGGGTG CAACCCCCGC CGGTGTCGCC 

4 51 ATCGCGCTCG ACCGCATGGA AAAAGGCACG GGTAAATTGT CCGCCGTTCA 

501 GGAAGTGGAA AAACAATACG GCCTGCCCGT CGCCCCCATC GCCAGCCTGA 

551 ACGATTTGTT TATCCTGTTG CAAAACAACC CCGAATTCGG ACAGTTCCTC 

601 GAACCCGTCC GCACCTACCG CCGGCAGTAC GGCGTAGAAT AA 
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This corresponds to the amino acid sequence <SEQ ID 1 36; ORF 03S.ng>: 

q03S . pep 

1 MTDFRQDFLK FSLAQNVLKF GEFTTKAGRR SPYFFNAGLF NDGASTLQLA 

51 KFYAQSIIES GIRFDMLFGP AYKG 1 1 LAAA TAMMLAEKGV NVPFAYNRKE 

101 AKDRGEGGVL VGAPLKGRVX IIDDVISAGT SVRESIKLIE AEGATPAGVA 

151 IALDRMEKGT GKLSAVQEVE KOY GLPVAPI ASLNDLFILL QNNPEFGQFL 

201 EPVRTYRRQY GVE* 

The following partial DNA sequence was identified in A . meningitidis <SEQ ID 137>: 

m038 .seq 

1 ATGACCGATT TCCGCCAAGA TTTCCTCAAA TTCTCCCTCG CCCAAAATGT 

51 TTTGAAATTC GGCGAATTTA CCACCAAGGC AGGACGGCGG TCGCCCTATT 

101 TCTTCAATGC CGGCCTCTTT AACGACGGCT TGTCCACGCT GCAACTGGCA 

151 AAATTTTACG CACAATCCAT CATTGAAAGC GGCATCCGAT TCGATATGCT 

201 GTTCGGTCCC GCCTACAAAG GCATTATTTT GGCGGCGGCA ACCGCGATGA 

251 TGCTGGCGGA AAAAGGCGTG AACGTCCCGT TTGCCTACAA CCGCAAAGAA 

301 GCCAAAGACC ACGGCGAAGG CGGCGTGTTG GTCGGCGCGC CGCTTAAAGG 

351 GCGCGTGCTG ATTATCGACG AC GT GATTTC CGCCGGCACA TCCGTACGCG 

401 AATCGATCAA ACT GATTGAA GCGGAGGGTG CAACCCCcGC CGGTGTCGCC 

451 ATCGCGCTCG ATCGCATGGA AAAAGGCACG GGTGAATTGA GCGCGGTTCA 

501 GGAAGTGGAr AAACAATACG GkCTGCCCGT CGCCCCCATC GCCAGCCTGA 

551 ACGATTTGTT TATTCTGTTG CAAAACAACC CCGAATTCGG ACAGTTCCTC 

601 GAACCCGTCC GAGCCTACCG TCGGCAGTAC GGCGTAGAAT AA 

This corresponds to the amino acid sequence <SEQ ID 138; ORF 038>: 

m038 . pep 

1 MTDFRQDFLK FSLAQNVLKF GEFTTKAGRR SPYFFNAGLF NDGLSTLQLA 

51 KFYAQSIIES GIRFDMLFGP AYKGIILAAA TAMMLAEKGV NVPFAYNRKE 

101 AKDFGE GGVL VGAPLKGRVX IIDDVISAGT SVRESIKLIE AEGATPAGVA 

151 IALDRMEKGT GELSAVQEVE KOY GLPVAPI ASLNDLFILL QNNPEFGQFL 

201 EPVRAYRRQY GVE* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 139>: 

a038 . seq 

1 ATGACCGATT TCCGCCAAGA TTTCCTCAAA TTCTCCCTCG CCCAAAATGT 

51 TTTGAAATTC GGCGAATTCA CCACCAAAGC CGGACGGCGG TCGCCCTATT 

101 TCTTCAATGC CGGCCTCTTT AACGACGGCT TGTCCACGCT GCAACTGGCA 

151 AAATTTTACG CACAATCCAT CATTGAAAGC GGCATCCGAT TCGATATGCT 

2 01 GTTCGGCCCC GCCTACAAAG GCATTATTTT GGCGGCGGCA ACCGCGATGA 

251 TGCTGGCGGA AAAAGGCGTG AACGTCCCGT TTGCCTACAA CCGCAAAGAA 

301 GCCAAAGACC ACGGCGAAGG CGGCGTGTTG GTCGGCGCGC CGCTTAAAGG 

351 GCGCGTGCTG ATTATCGACG ACGTGATTTC CGCCGGCACA TCCGTACGCG 

401 AATCGATCAA ACTGATTGAA GCGGAGGGTG CAACCCCCGC CGGTGTCGCC 

4 51 ATCGCGCTCG ACCGCATGGA AAAAGGCACG GGTGAATTGA GCGCGGTTCA 

501 GGAAGTGGAA AAACAATACG GCCTGCCCGT CGCCCCCATC GCCAGCCTGA 

551 ACGATTTGTT TATTCTGTTG CAAAACAACC CCGAATTCGG ACAGTTCCTC 

601 GAACCCGTCC GAGCCTACCG TCGGCAGTAC GGCGTAGAAT AA 

This corresponds to the amino acid sequence <SEQ ID 140; ORF 038.a>: 

a038 . pep 

1 MTDFRQDFLK FSLAQNVLKF GEFTTKAGRR SPYFFNAGLF NDGLSTLQLA 

51 KFYAQSIIES GIRFDMLFGP AYKGIILAAA TAMMLAEKGV NVPFAYNRKE 

101 AKDHGEGGVL VGAPLKGRVL IIDDVISAGT SVRESIKLIE AEGATPAGVA 

151 IALDRMEKGT GELSAVQEVE KQY GLPVAPI ASLNDLFILL QNNPEFGQFL 

201 EPVRAYRRQY GVE- 

m038/a038 100.0% identity over a 213 aa overlap 

10 20 30 40 50 60 

m038 .Dec MT D FRQDFLKFS LAQN V LK FGE FTT KAGRRS P Y F FN AG LFN DG LST LQLAKFYAQS 1 1 ES 
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! I I I I i I I I I I M I i I 1 I I I I I I I ! I i ! M 1 I i I I I I I I 1 I II M I i i II i I I I i I I t i I 
a0 3 8 MTDFRQDFLKFSLAQNVLKFGEFTTKAGRRSPYFFNAGLFNDGLSTLQLAKFYAQSIIES 
10 20 30 40 50 60 

70 80 90 100 110 120 

m038 . pep GIRFDMLFGPAYKGI ILAAATAMMLAEKGVNVPFAYNRKEAKDHGEGGVLVGAPLKGRVL 

I I I I I I i I ! t [ ! ! I i ! i I 1 ! I ! I I I I I ! ! I f I I i I ! I I ! M I ! I I I i I I If I I I I I I M I 
a 038 GI RFDMLFGPAYKGI ILAAATAMMLAEKGVNVPFAYNRKEAKDKGEGGVLVGAPLKGRVL 

70 SO 90 100 110 120 

130 140 150 160 170 180 

m03 8 . pep I I DDVI SAGTSVRES I KL I EAEGATPAGVAIALDRMEKGTGELSAVQEVEKQYGLPVAP I 

I i I I I ! I I I I I I i I i M I I I ! ! I I ! i ! I ! i I I I I M I I M I I I i I i I I I I ! ! ! M I I I I i 
a03 8 I I DDVI SAGTSVRES IKLIEAEGATPAGVAIALDRMEKGTGELSAVQEVEKQYGLPVAPI 

130 140 150 160 170 180 

190 200 210 

m038 . pep ASLNDLFILLQNNPEFGQFLEPVRAYRRQYGVEX 
I I I I I M I I I I ! I I I I I I I 1 ! i I I I I I I I I I I t i 
a038 ASLNDLFILLQNNPEFGQFLEPVRAYRRQYGVEX 

190 200 210 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 038 shows 98. 1 % identity over a 2 1 3 aa overlap with a predicted ORF (ORF 038.ng) 
from N, gonorrhoeae: 

in038/g038 

10 20 30 40 50 60 

m038.pep MTDFRQDFLKFSLAQNVLKFGEFTTKAGRRSPYFFNAGLFNDGLSTLQLAKFYAQSI IES 

I M I I I I t I I I I I I 1 I I I M I I I I I ! I I M I I I I I I I I I I i I i II I i I I I i I It I I I I I 
g038 , MTDFRQDFLKFSLAQNVLKFGEFTTKAGRRSPYFFNAGLFN DGASTLQLAKFYAQSI IES 

10 20 30 40 50 60 

70 80 90 100 110 120 

m03 8.pep GIRFDMLFGPAYKGI ILAAATAMMLAEKGVNVPFAYNRKEAKDHGEGGVLVGAPLKGRVL 

I I I I I I I I I I I I I I I I I i I M I i I ! I I I M I I I I I I I M I I i I : I I M I I I I I II i I I i | 
g038 GIRFDMLFGPAYKGI ILAAATAMMLAEKGVNVPFAYNRKEAKDRGEGGVLVGAPLKGRVL 

70 80 90 100 110 120 

130 140 150 160 170 180 

m038.pep II DDVI SAGTSVRES I KLIEAEGATPAGVAIALDRMEKGTGELSAVQEVEKQYGLPVAP I 

I I I i I II II II I I I I M II I I M I M II I I I I I I I I I I I I i : I t I I I I I i I! I II I I I I I 
g038 1 1 DDVI SAGTSVRES I KLIEAEGAT PAGVAIALDRMEKGTGKLSAVQEVEKQYGLPVAPI 

130 140 150 160 170 180 

190 200 210 

m038 . pep AS LN DL FI LLQNN PE FGQFLE PVRAYRRQY GVEX 

I I M I I I M I I I I I I I I I I I M I I : i I I I I I I I I 
gO 3 8 AS LNDLFI LLQNNPE FGQFLE PVRTYRRQYGVEX 

190 200 210 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 141 >: 

g039 . seq 

1 ATGCCGTCCG AACCACCTGC CGCTTCAGAC GG CATC AAA C CGACACACAC 

51 CGAGAAAACA TCATGCCCGC CTGTTTCTGT CCGCACTGCA AAACCCGCCT 

101 CTGGGTCAAA GAAAcccagC TCAAcgtCgC ccaagGCTTC GTCGTCTgcc 

151 aaAAAtgcga agGGCTgttt aaAgccaaaG accAtctggc aaGcacGAAA 

201 gaacctatat tcaacgattg gcccgaagct gtttcgggat gTcaaaCTCG 



WO 99/57280 PCT/US99/09346 

225 



251 TCcaccgcaT cggcacgcac gccattagca aGAaacagat gtcccgcgac 

301 gaaatCgccg atatcctcaa cggcggtaca acCCTGCACG ATACGCCGCC 

3 51 CGCAACCGCC GCTGCCGCac ctGCCGCCGC ACCGCaggTT TCCGTACCGC 

4 01 CCGCCCGTCA GGAAGGGCTC AACTGGACTA TTGCAACCCT GTTCGCACTT 
451 ATCGTCCTCA TTATGCAGCT TTCCTACCTC TTCATCCTAT GA 



This corresponds to the amino acid sequence <SEQ ID 142; ORF 039.ng>: 

g0 3 9 .pep 

1 MPSEPPAASD GIKPTHTEKT SCPPVSVRTA KPASGSKKPS STSPKASSSA 

51 KNAKGCLKPK TIWQARKNLY STIGPKLFRD VrdVHRIGTH AISKKQMSRD 

101 EIADILNGGT TLH?TPPATA AAAPAAAPQV SV7PARQEGL NWT I ATLFAL 

151 IVLIMQLSYL FIL* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 143>: 



m03 9 . seq 












1 


ATGCCGTCCG 


AACCGCCTTA 


CGCCTCAGAC 


GGCATCAAAC 


CTGACACACA 


51 


CGAGGAAATA 


CCATGCCCGC 


CTGTTTCTGC 


CCCCACTGCA 


AAACCCGTCT 


101 


CTGGGTCAAA 


GAAACCCAAC 


TCAATGTCGC 


CGnnnnnnnn 


nnnnnnnnnn 


151 


nnnnnnnnnn 


nnnr nnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


201 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnCCC 


GAGGCTGTTT 


251 


CGGATGTCAA 


ACTCGTTCAC 


CGTATCGGCA 


CGCGCGCCAT 


CGGCAAGAAA 


301 


CAGATTTCCC 


GTGACGAAAT 


CGCCGGCATC 


CTCAACGGCG 


GTACAACCCA 


351 


GCCCGATATT 


CCGCCCGCAA 


CCGCCGCCAC 


CCCTGCTGCC 


GCACCGCAGG 


401 


TTACCGTACC 


GCCCGCCGCG 


CCCGCCCGTC 


AGGATGGGTT 


CAACTGGACG 


451 


ATTGCAACCC 


TGTTTGCCCT 


TATCGTCCTC 


ATTATGCAGC 


TTTCCTACCT 


501 


CGTCATCCTA 


TGA 









This corresponds to the amino acid sequence <SEQ ID 144; ORF 039>: 

m03 9 .pep 

1 MPSEPPYASD GIKPDTHEEI PCPPVSAPTA KPVSGSKKPN SMSPXXXXXX 

51 XXXXXXXXXX XXXXXXXXXX XXXXXXXXXP EAVSDVKLVH RIGTRAIGKK 

101 QISRDEIAGI LNGGTTQPDI PPATAATPAA APQVTVPPAA PARQDGFNWT 

151 IATLFALIVL IMQL5YLVIL * 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 145>: 

a039 . seq 

1 ATGCCGTCTG AACCGCCTTA CGCCTCAGAC GGCATCAAAC CTGACACACA 

51 CGAGGAAATA CCATGCCCGC CTGTTTCTGC CCCCACTGCA AAACCCGTCT 

101 CTGGGTCAAA GAAACCCAAC TCAATGTCGC CCAAGGCTTC GTCGTCTGCC 

151 AAAAATGCGA AGGAATGTTT AAAGCCAAAG ACCATCTGGC AAGCACGAAA 

2 01 GAACCCATAT TCAACGATT . TGCCCGAAGC TGTTTCGGAT GTCAAACTCG 

2 51 TTCACCGCAT CGGCACGAGC GCCATCGGCA AGAAACAGAT TTCCCGTGAC 

301 GAAATCGCCG GCATCCTCAA CGGCGGCACA ACCCAGCCCG ATATTCCGCC 

351 CGCAACCGCC GCCACCCCTG CTGCCGCACC GCAGGTTACC GTACCGCCCG 

4 01 CCGCGCCCGC CCGTCAGGAT GGGTTCAACT GGACGATTGC AACCCTGTTT 

4 51 GCCCTTATCG TCCTCATTAT GCAGCTTTCC TACCTCGTCA TCCTATGA 

This corresponds to the amino acid sequence <SEQ ID 146; ORF 039.a>: 

a039 .pep 

1 MPSEPPYASD GIKPDTHEEI PCPPVSAPTA KPVSGSKKPN SMSPKASSSA 

51 KNAKECLKPK TIWQARKNPY STIXPEAVSD VKLVHRIGTS AIGKKQISRD 

101 EIAGILNGGT TQPDI PPATA ATPAAAPQVT VPPAAPARQD GFNWTIATLF 

151 ALIVLIMQLS YLVI L* 



m039/a039 79.4% identity over a 1 70 aa overlap 

10 20 30 40 50 60 

m039.pep MPSEPPYASDGIKPDTHEEI PCPPVSAPTAK PVSGSKKPNSMSPXXXXXXXXXXXXXXXX 

I I I i I I I I I I I I I I I I I I I I I I I I I i i I It I 1 I I 1 | | ] | | | | | ! 
a03 9 MPSEPPYASDGIKPDTHEEIPCPPVSAPTAKPVSGSKKPNSMSPKASSSAKNAKECLKPK 
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10 20 30 ^0 50 60 

70 80 90 100 110 120 

m03 9. pep XXXXXXXXXXXXXXXXXXXPEAVSDVKLVHRIGTRAIGKKQI SRDEIAGI LNGGTTQPDI 
: : I ! I I I ! I ! 1 I I I I ! j 1 I I ! I I I I I I I I f I I i I I I | I | | I | | 

a 0 3 9 T I v. Q AR K N P Y S T I X ? E A V 5 Tv KLV HRIGTSAIGKKQI SRDE I AGI LNGGTTQPDI 

70 80 90 100 110 

130 140 150 160 170 

mO 3 9 . p e p P PATAAT P AAA PQVT V P PAA PARQDG FNWTIATLF ALIVLIMQLS Y L V I LX 

I I I I f I I I I I t I I I I I I I M I I f I I ! I I I I I I I | 1 I II | | | 1 | | | | l | | M 
a03 9 ?? AT AATP AAA PQVT VPPAA PARQDG ENWT IATLFALI VLIMQLSYLVILX 

120 130 140 Ho 160 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted QRF from N. gonorrhoeae 

ORF 039 shows 60.8% identity over a 1 71 aa overlap with a predicted ORF (ORJF 039. ng) 
from N. gonorrhoeae: 

m039/g039 

10 20 30 40 50 60 

m03 9 . pep MPSEPPYASDGIKPDTHEEIPCPPVSAPTAKPVSGSKKPN5M3PXXXXXXXXXXXXXXXX 

hum mini h Huh ■■! .;■: || 

g03 9 MPSEPPAASDGIKPTHTEKTSCPPVSVRTAKPASGSKKPSSTSFKASS3AKNAKGCLKPK 
10 20 30 40 50 60 

70 80 90 100 110 120 

m03 9 .pep XXXXXXXXXXXXXXXXXXX P E AVS DVKLVHR I GTRMGKKQ I SRDEIAGI LNGGTTQPDI 

h IMMMMhlhllhlMIM llllMI | 

903 9 TIWQARKNLYSTIG PKLFRDVKLVHRIGTHAI SKKQMSRDEI ADI LNGGTTLHDT 

70 80 90 100 110 

130 140 150 160 170 

m03 9 .pep P PATAAT - PAAAPQVTVPPAAPARQDGFNWT I ATLFALI VL IMQLS YLV I LX 

IMIIh IIMMhllll IhhMIIIIIIIIMMIMIM Ml 

g03 9 PPATAAAAPAAAPQVSVPPA RQEGLNWTI ATLFALI VLIMQLSYLF I LX 

120 130 140 150 160 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 147>: 

g040 . seq 

1 ATGAACGCGC CCGACAGCTT TGTCGCCCAC TTCCGCGAAG CCGCCCCCTA 

51 CATCCGCCAA ATGCGCGGCA CGACACTGGT CGCCGGCATA GAcggCCGCC 

101 TGCTCGAAGG CGGCACCTTA AATAAGCTCG CCGCCGACAT CGGGCTGTTG 

151 TCGCAACTGG GCATCCGACT CGTCCTCATC CACGGCGCGT ACCACTTCCT 

201 CGAccgCCTC GCCGCCGCGC AAGgccGCAC GCCGCATTAT TGCCGgggtt 

251 tGCGCGTTAC CGACGaAACc tcGctcgGAC AGGCGCAGCA GtttGCCGGC 

301 AccgTCCGCA GCCGTTTTGA agcCGCATTG tgcggcagCG tttcaggatt 

351 cgcgCGCGCG CCTTCCGTCC CGCTCGTAtc gggcaacttc ctgacCGCCC 

401 GTCcgatggg cgtgactgac ggaACCGata tggaatacgc gggggttatc 

4 51 cgcaaaaccg ACACCGCCGC CCTCCGTTTC CAACTCGACG CGGGCAATAT 

501 CGTCTGGATG CCGCCGCTCG GGCATTCCTA CGGCGGCAAA ACCTTCAATC 

551 TCGATATGGT GCAGGCCGCC GCTTCCGTCG CCGTCTCGCT TCAGGCCGAA 

601 AAACTCGTTT ACCTGACCCT TTCAGACGGC ATTTCCCGCC CCGACGGCAC 

651 GCTCGCCGAA ACCCTCTCGG CACAGGAAGC GCAATCGCTG GCGGAACACG 

7 01 CCGCCAGCGA AACCCGACGA CTGATTTCGT CCGCCGTTGC CGCGCTCGAA 
751 GGCGGCGTGC ATCGCGTCCA AATCCTCAAC GGGGCCGCCG ACGGCAGCCT 
801 GCTGCAAGAA CTCTTCACCC GCAACGGCAT CGGCACGTCC ATTGCCAAAG 

8 51 AAGCCTTCGT CTCCATCCGG CAGGCGCACA GCGGCGACAT CCCGCACATC 
901 GCCGCCCTCA TCCGCCCGCT GGAAGAACAG GGCGTCCTAT TGCACCGCAG 
951 CCGCGAATAC CTCGAAAACC ACATTTCCGA ATTT7CCATC CTCGAACACG 
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1001 ACGGCGACCT GTACGGCTGT GCCGCACTCA AAACCTTTGC CGAAGCCGAT 

1051 TGCGGCGAAA TCGCCTGCCT TGCCGTCTCG CCGCAGGCAC AGGACGGCGg 

1101 ctACGGCGAA CGCCTGCTTG CCCACATTAT CGATAAGGCG CGCGGCATAG 

1151 GCATAAGCAG GCTGTTCGCA CTGTCCACAA ATACCGGCGA ATGGTTTGCC 

1201 GAACGCGGCT TTCAGACGGC ATCGGAAGAC GAGCTGCCCG AAACGCGGCG 

12 51 CAAAGACTAC CGCAGCAACG GACGAAACCC GCATATTCTG GTGCGTCGCC 

1301 TGCACCGCTG A 

This corresponds to the amino acid sequence <SEQ ID 148; ORF 040. ng>: 

g04 0 . pep 

1 MNAPDSFVAH FREAAPYIRQ MRGTTLVAGI _V-r,lLEGGTL NKLAADIGLL 

51 SQLGIRLVLI 11GAYHFLDRL AAAQGRTPHY CRGLRVTDET SLGQAQQFAG 

101 TVRSRFEAAL CGSVSGFARA PSVPLVSGNF LTARPMGVID GTDMEYAGVI 

151 RKTDTAALRF QLDAGNIVWM PPLGHSYGGK TFNLDMVQAA ASVAVSLQAE 

201 KLVYLTLSDG I SRPDGTLAE TLSAQEAQSL AEHAASETRR LISSAVAALE 

251 GGVHRVQILN GAADGSi^QE LFTRNGIGTS IAKEAFVSIR QAHSGDI PHI 

301 AALIRPLEEQ GVLLHRSREY LENHISEFSI LEKDGDLYGC AALKTFAEAD 

351 CGEIACLAVS PQAQDGGYGE RLLAHI IDKA RGIGISRLFA LSTNTGEWFA 

4 01 ERGFQTASED ELPETRRKDY RSNGRNPHIL VRRLHR* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 149>: 

m040 . seq 

1 ATGAGCGCGC CCGACCTCTT TGTCGCCCAC TTCCGCGAAG CCGTCCCCTA 

51 CATCCGCCAA ATGCGCGGCA AAACGCTGGT CGCCGGCATA GACGACCGCC 

101 TGCTCGAAGG TGATACCTTA AACAAGCTCG CCGCCGACAT CGGGCTGTTG 

151 TCGCAACTGG GCATCAGGCT CGTCCTCATC CACGGCGCGC GCCACTTCCT 

201 CGACCGCCAC GCCGCCGCTC AAGGCCGCAC GCCGCATTAT TGCCGGGGCT 

2 51 TGCGCGTTAC CGACGAAACC TCGCTCGAAC AGGCGCAgCA GTTTGCCGGC 

301 ACCGTCCGCA GCCGTTTTGA AGCCGCATTG TGCGGCAGCG TTTCCGGGTT 

351 CGCGCGCGCG CCTTCCGTCC CGCTCGTATC GGGCAACTTC CTGACCGCCC 

4 01 GTCr^ATA^G TGTGATTGAC GGAACCGATA TGGAATACGC GGGCGTTATC 

4 51 CGCAAAACCG ACACCGCCGC CCTCCGTTTC CAACTCGACG CGGGCAATAT 

501 CGTCTGGCTG CCGCCGCTCG GACATTCCTA CAGCGGCAAG ACCTTCTATC 

551 TCGATATGCT TCAAACCGCC GCCTCCGCCG CCGTCTCGCT TCAGGCCGAA 

601 AAACTCGTTT ACCTGACCCT TTCAGACGGC ATTTCCCGCC CCGACGGCAC 

651 GCTCGCCGAA ACCCTCTCGG CACAGGAAGC GCAATCGCTG GCGGAACACG 

7 01 CCGGCGGGCA AACGCGACGG CTGATTTCGT CCGCCGAACT CTTCACCCGC 

7 51 AACGGCATCG GCACGTCCAT TGCCAAAGAA GCCTTCGTCT CCATCCGGCA 

801 rGCGCAywgG G . CGACATCC CGCACATCGC CGCCCTCATC CGCCCGCTGG 

851 AAGAACAGGG CATCCTGCTG CACCGCAs . c GCGAATACCT CGAAAACCAC 

901 ATTTCCGAAT TTTCCATCCT CGAACACGAC GGCAACCTGT ACGGTTGCGC 

951 CGCCCTGAAA ACCTTTGCCG AAGCCGATTG CGGCGAAATC GCCTGCCTTG 

1001 CCGTCTCGCC GCag . cACAG GACGGCGGCT ACGGCGAACG CnTGCTTGCC 

1051 CACATTATCG ATAAGGCGCG CGGCATAGGC ATAAGCAGGC TGTTCGCACT 

1101 GTCCACAAAT ACCGGCGAAT GGTTTGCCGA ACGCGGCTTT CAGACGGCAT 

1151 CGGAAGACGA GTTGCCCGAA ACGCGGCGCA AAGACTACCG CAGCAACGGA 

1201 CGGAACTCGC ATATTCTGGT ACGTCGCCTG CACCGCTGA 

This corresponds to the amino acid sequence <SEQ ID 150; ORF 040>: 

m04 0 . pep 

1 MSAPDLFVAH FREAVPYIRQ MRGKTLVAGI DDRLLEGDTL NKL AADIGLL 

51 SQLGIRLVLI HGARHFLDRH AAAQGRTPHY CRGLRVTDET SLEQAQQFAG 

101 TVRSRFEAAL CGSVSGFARA PSVPLVSGNF LTARPIGVID GTDMEYAGVI 

151 RKTDTAALRF QLDAGNIVWL PPLGHSYSGK TFYLDMLQTA ASAAVSLQAE 

201 KLVYLTLSDG ISRPDGTLAE TLSAQEAQSL AEHAGGQTRR LISSAELFTR 

251 NGIGTSIAKE AFVSIRQAHX XDIPHIAALI RPLEEQGILL HRXREYLENH 

301 ISEFSILEHD GNLYGCAALK TFAEADCGEI ACLAVSPQXQ DGGYGERXLA 

351 HI IDKARGIG ISRLFALSTN TGEWFAERGF QTASEDELPE TRRKDYRSNG 

4 01 RNSHILVRRL HR* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 15 1>: 

a040.seq 
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I ATGATCGTGC CCGACCTCTT TGTCGCCCAC TTCCGCGAAG CCGCCCCCTA 

51 CATCCGCCAA ATGCGCGGCA AAACGCTGGT CGCCGGCATA GACGACCGCC 

101 TGCTCGAAGG TGATACCTTA AACAAGTTCG GCGCCGACAT CGGGCTTTTG 

151 TCGCAACTGG GCATCAGGCT CGTCCTCATC GkCGGCGCGC GCCACTTCCT 

201 CGACCGCCAC GCCGCCGCGC AAGGCCGCAC GCCGCATTAT TGCCGGGGCT 

251 TGCGCGTTAC CGACGAAACC TCGCTCGAAC AGGCGCAGCA GTTTGCCGGC 

3 01 ACCGTCCGCA GCCGTTTTGA AGCCGCATTG TGCGGCAGCG TTTCCGGGTT 
351 CGCGCGCGCG CCTTCCGTCC CGCTCGTATC GGGCAACTTC CTGACCGCCC 

4 01 GTCCGATAGG TGTGATTGAC GGAACCGATA TGGAATACGC GGGCGTTATC 
4 51 CGCAAAACCG ACACCGCCGC CCTCCGTTTC CAACTCGACG CGGGCAATAT 
501 CGTCTGGCTG CCGCCGCTCG GACATTCCTA CAGCGGCAAG ACCTTCCATC 
551 TCGATATGCT TCAAACCGCC GCCTCCGTCG CCGTCTCGCT TCAGGCCGAA 
601 AAACTCGTTT ACCTGACCCT TTCAGACGGC ATTTCCCGCC CCGACGGCAC 
651 GCTCGCCGTA ACCCTCTCGG CACAGGAAGC GCAATCGCTG GCGGAACACG 
7 01 CCGGCGGCGA AACGCGACGG CTGATTTCGT CCGCCGTTGC CGCGCTCGAA 

7 51 GGCGGCGTGC ATCGCGTCCA AATCCTCAAC GGAGCCGCCG ACGGCAGCCT 
801 GCTGCAAGAA CTCTTCACCC GCAACGGCAT CGGCACGTCC ATTGCCAAAG 

8 51 AAGCCTTCGT CTCCATCCGG CAGGCGCACA GCGGCGACAT CCCGCACATT 
901 GCCGCCCTCA TCCGCCCGCT GGAAGAACAG GGCATCCTGC TGCACCGCAG 
951 CCGCGAATAC CTCGAAAACC ACATTTCCGA ATTTTCCATC CTCGAACACG 

1001 ACGGCAACCT GTACGGTTGC GCCGCCCTGA AAACCTTTGC CGAAGCCGAT 

1051 TGCGGCGAAA TCGCCTGCCT TGCCGTCTCG CCGCAGGCAC AGGACGGCGG 

1101 CTACGGCGAA CGCCTGCTTG CCCACATTAT CGATAAGGCG CGCGGCATAG 

1151 GCATAAGCAG GCTGTTCGCA CTGTCCACAA ATACCGGCGA ATGGTTTGCC 

1201 GAACGCGGCT TTCAGACGGC ATCGGAAGAC GAGTTGCCCG AAACGCGGCG 

1251 CAAAGACTAC CGCAGCAACG GACGGAACTC GCATATTCTG GTGCGTCGCC 

1301 TGCACCGCTG A 

This corresponds to the amino acid sequence <SEQ ID 1 52; ORF 040.a>: 

a040 . pep 

1 MIVPDLFVAH FREAAPYIRQ MRGKTLVAGI DDRLLEGDTL NKF AADIGLL 

51 SQLGIRLVLI HGARHFLDRH AAAQGRTPHY CRGLRVTDET SLEQAQQFAG 

101 TVRSRFEAAL CGSVSGFARA PSVPLVSGNF LTARPIGVIQ GTDMEYAGVI 

151 RKTDTAALRF QLDAGNIVWL PPLGHSYSGK TFHLDMLQTA ASVAVSLQAE 

201 KLVYLTLSDG ISRPDGTLAV TLSAQEAQSL AEHAGGETRR LISSAVAALE 

251 GGVHRVQILN GAADGSLLQE LFTRNGIGTS IAKEAFVSIR QAHSGDIPHI 

301 AALIRPLEEQ GILLHRSREY LENHISEFSI LEHDGNLYGC AALKTFAEAD 

351 CGEIACLAVS PQAQDGGYGE RLLAHIIDKA RGIGI SRLFA LSTNTGEWFA 

4 01 ERGFQTASED ELPETRRKDY RSNGRNSHIL VRRLHR* 

m040/a040 91.5% identity in 436 aa overlap 

10 20 30 40 50 60 

MSAPDLFVAHFREAVPYIRQMRGKTLVAGIDDRLLEGDTLNKLAADIGLLSQLGIRLVLI 
f = I I I i ! M I 1 I I : 1 I I ] [ I I I 1 | [ | ] I | | | I N I I I ! I I I : j ! I I | | M I I i I I I I ! i 
MIVPDLFVAH FREAAPYIRQMRGKTLVAGIDDRLLEGDTLNKFAADIGLLSQLGIRLVLI 
10 - 20 30 40 50 60 

"70 SO 90 100 110 120 

HGARHFLDRHAAAQGRTPHYCRGLRVTDETSLEQAQQ FAGTVRSRFEAALCGSVSGFARA 
I I ! I I I I I I f I I I I I I I I I I I I i | | | | | | | | | | | | | | j | | | | | | | | | | | | | | | | | | ( | ; ( 

HGARHFLDRHAAAQGRTPHYCRGLRVTDETSLEQAQQ FAGTVRSRFEAALCGSVSGFARA 
7 0 80 90 100 110 120 

130 140 150 160 170 130 

PSVPLVSGNFLTARPIGVIDGTDMEYAGVIRKTDTAALRFQLDAGNIVWLPPLGHSYSGK 
M I I I I I t i I I ! I I I I I Ei! | | ! i | | | | | | | ] | | | | j | } j i | ] j 

PSVPLVSGNFLTARPIGVI DGTDMEYAGVIRKTDTAALRFQLDAGNIVWLPPLGHSYSGK 
130 1^0 150 160 170 180 

190 200 210 220 230 240 

TFYLDMLQTAASAAVSLQAEKLVYLTLSDGISRPDGTLAETLSAQEAQSLAEHAGGQTRR 

M : i I M I M ! I : I I I I I I ! I I I I I I I I I M I i I I I I [ j I t I I I 1 I I ! i ! : ! I ! 

TFKLDHLQTAASVAVSLQAEKLVYLTLSDGISRPDGTLAVTLSAQEAQSLAEHAGGETRR 



m04 0 . pep 
a040 

m04 0 , pep 
a040 

m040 . pep 
a040 

m04 0 . pep 
a040 
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190 200 210 220 230 240 



250 260 270 

m04 0. pep LISSA ELFTRNGIGTS IAKSAFVS IRQAHXXDI PHI 

I I I I I I I I I I i I I II I I : I I I I I I I I I I f I I i ! I 

a 0 4 0 L I S S A V AA LE GG V H R VQ I L N G AA D G S L LQE L FT RNG I GT S I AKE A FV S I RQAK SGDIPK. 

250 260 270 280 290 300 

280 290 300 310 320 330 

m04 0 .pep AALIRPLEEQGILLHRXREYLENHISEFSILEHDGNLYGCAALKTFAEADCGEIACLAVS 

I I I I I I M I I I ( I I ! I I I I I I I I I ! I M I I I ' : i 1 I I I I ! ! I I i I I I I i I I I I I I I I I 
a04 0 AALIRPLEEQGILLHRSREYLENHISEFSILLK JGNLYGCAALKTFAEADCGEIACLAVS 

310 320 330 340 350 360 

340 350 360 370 380 390 

m04 0 .pep PQXQDGGYGERXLAHI IDKARGIGISRLFALSTNTGEWFAERGFQTASEDELPETRRKDY 

II I I I I I I M M I I M II I I i I I I I I I 1! I I I I i I I I I M I I I II I 1 I I I i I I I I I I I 
a 04 0 PQAQDGGYGERLLAHI IDKARGIGISRLFALSTNTGEWFAERGFQTASEDELPETRRKDY 

370 330 390 400 410 420 

400 410 
m04 0.pep RSNGRNSHILVRRLHRX 
I M t I II I I I I I I II I I 

a0 4 0 RSNGRNSHILVRRLHRX 
430 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from M. gonorrhoeae 

ORF 040 shows 88.3% identity over a 436 aa overlap with a predicted ORF (ORF 040.ng) 
from N. gonorrhoeae: 

m040/g040 



m04 0.pep MSAPDLFVAHFREAVPYIRQMRGKTLVAGI DDRLLEGDTLNKLAADIGLLSQLGIRLVLI 60 

1:111 I I I M I I I : I I M I I I I I I I ! I I I I ! I ! I I I I M It I M ! I ! I I I I I M I f 

g04 0 MNAPDSFVAHFREAAPYIRQMRGTTLVAGIDGRLLEGGTLNKLAADIGLLSQLGIRLVLI 60 

m04 0 . pep HGARHFLDRHAAAQGRTPHYCRGLRVTDETSLEQAQQFAGTVRSRFEAALCGSVSGFARA 120 

Ml I I I I I II I I I i I M I I I I I I I I! II I! I I I i I ! I I I! I If I I ! I I I I I I I I I! I 

g04 0 HGAYHFLDRLAAAQGRTPHYCRGLRVTDETSLGQAQQFAGTVRSRFEAALCGSVSGFARA 120 

m04 0.pep PSVPLVSGNFLTARPIGVIDGTDMEYAGVIRKTDTAALRFQLDAGNIVWLPPLGHSYSGK 180 

I I I I I I II 1 I I M ! I : I I I I M I I I I I I I I I I I I I I I I I I I i I I I I I I I : I I I I I I I : I I 

g04 0 PSVPLVSGNFLTAR PMGVI DGTDMEYAGVIRKTDTAALRFQLDAGNIVWMPPLGHSYGGK 180 

m04 0 . pep TFYLDMLQTAASAAVSLQAEKLVYLTLSDGISRPDGTLAETLSAQEAQSLAEHAGGQTRR 24 0 

II II I : I : I I I : I I I I I I I I I I I I I II I I I I I I II M I !! I I ! I I I I I I I I i I ::: I I I 

g04 0 TFNLDMVQAAASVAVSLQAEKLVYLTLSDGISRPDGTLAETLSAQEAQSLAEHAASETRR 24 0 

mO 4 0 . pep LISSA ELFTRNGIGTS I AKEAFVS IRQAHXXDI PHI 27 6 

I I I I i M I I I I I I I 1 I I I I I I I I I I I I I i I I M I 

g04 0 LI SSAVAALEGGVHRVQILNGAADGSLLQELFTRNGIGTS IAKEAFVSIRQAHSGDI PHI 300 

m040.pep AALIRPLEEQGILLHRXREYLENHISEFSILEHDGNLYGCAALKTFAEADCGEIACLAVS 336 

I I I ! I I I I I I I : I I I I I I I I I I I I I I I II I I I I I : i I I I II I I I I I I I I I I I I I I I I I 1 

g04 0 AALIRPLEEQGVLLHRSREYLENHI SE FS I LEHDGDLYGCAALKTFAEADCGE IACLAVS 3 60 

m04 0 . pep PQXQDGGYGERXLAHI I DKARGIGISRLFALSTNTGEWFAERGFQTASEDELPETRRKDY 3 9 6 

II I I I I I M I I I I I I I I I I I I I I I I t I I M M I M I I I I I I I I I I t t I ! I I I I I I I I | 
g040 PQAQDGGYGERLLAHI I DKARG- " ISRLFALSTNTGEWFAERGFQTASEDELPETRRKDY 420 

m040.pep RSNGRNSHILVRRLHRX 413 

I I I I I I I I I I I M I I I 
g040 RSNGRNPHILVRRLHRX 4 37 
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The following partial DNA sequence was identified in A', gonorrhoeae <SEQ ID 153>: 

g04 1 . seq 

1 ATGAGTTCGC CCAAACACAT CGGCTT3CAG GGCGGCAGCA ACGGCGGCCT 

51 GATTACCGCC GCCGCCTTCG TGCGCGAACC GCAAAGCATC GGTGCGCTGG 

101 TGTGCGAAGT ACCGCTGACC GATATGATCC GTIATCCGCT GCTGTCCGCC 

151 GGTTCAAGTT GGACGGACGA ATACGGCAAT CCGCAGAAAT ACGAAGCCTG 

201 CAAACGCCGG CTGGGCGAAT TGTCGCCGTA TCACAATCTT TCAGACGGCA 

2 51 TCGATTATCC GCCCGCACTC ATTACCACCA GCCTCAGCGA CGACCGCGTC 
301 CATCCCGCCC ACGCGCTCAA ATTCTACGCC AAACTGCGCG AAACCTCGCC 

3 51 GCAATCTTGG CTCTACTCGC CTGACGGCGG CGGCCATACC GGCAACGGCA 

4 01 CCCAACGCGA ATCCGCCGAC AAACTCGCCT GCGTGTTGCT GTTTTTGAAA 
4 51 GAATTTTTGG GATAA 

This corresponds to the amino acid sequence <SEQ ID 154; ORF 041.ng>: 

g04 1 . pep 

1 MSSPKH IGLQ GGSNGGLITA AAFVREPQSI GALVCEVPLT DMIRYPLLSA 
51 GSSWT DEYGN PQKYEACKRR LGELSPYHNL SDGIDYPPAL 1TTSLSDDRV 
101 HPAHALKFYA KLRETSPQSW LYSPDGGGHT GNGTQRESAD KLACVLLFLK 
151 EFLG * 

The following partial DNA sequence was identified in A ; . meningitidis <SEQ ID 155>: 

CCGAACACAT CGGCTTGCAG GGCGGCAGCA ACGGCGGACT 
GCCGCCTTCG TGCGCGAACC GCAAAGCATC GGCGCGCTGG 
GCCGCTGACC GACATGATCC GTTATCCGCT GCTCTCCGCC 
GGACAGACGA ATACGGCAAT CCGCAAriAAT ACGAAGTCTG 
TTGGGCGAAT TGTCGCCGTA TCACAATCTT TCAGACGGCA 
GCCCGCGCTC ATTACCACCA GCCTGTCCGA CGATCGCGTC 
ACGCGCTCAA GTTCTACGCC AAACTGCGCG AAACCTCCGC 
CTCTACTCGC CTGACGGCGG CGGCCATACC GGCAACGGCA 
ATCCGCCGAC GAACTCGCCT GCGTCTTGCT GTTTTTGAAA 
GCTAA 

This corresponds to the amino acid sequence <SEQ ID 156; ORF 04 1>: 

m04 1 . pep 

1 ISSPEHIGLQ GGSNGGLITA AAFVREPQSI GALVCEVPLT DMIRYPLLSA 
51 GSSWT DEYGN PQKYEVCKRR LGELSPYHNL SDGIDYPPAL ITTSLSDDRV 
101 HPAHALKFYA KLRETSAQSW LYSPDGGGHT GNGTQRESAD ELACVLLFLK 
151 EFLG* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 157>: 



m04 1 . seq 




1 


ATCAGTTCGC 


51 


GATTACTGCC 


101 


TGTGCGAAGT 


151 


GGTTCAAGCT 


201 


CAAACGCCGG 


251 


TCGATTATCC 


301 


CATCCCGCCC 


351 


GCAATCTTGG 


401 


CCCAACGCGA 


451 


GAGTTTTTGG 



a04 1 . seq 




1 


ATCAGTTCGC 


51 


GATTACTGCC 


101 


TGTGCGAAGT 


151 


GGTTCAAGCT 


201 


CAAACGCCGG 


251 


TCGATTATCC 


301 


CATCCCGCCC 


351 


GCAATCTTGG 


401 


CGCAGCGCGA 


451 


GAGTTTTTGG 



TTGGGCGAAT TGTCGCCGTA TCACAATCTT TCAGACGGCA 

GCCCGCGCTC ATTACCACCA GCCTGTCCGA CGATCGCGTC 

ACGCGCTCAA GTTCTACGCC AAACTGCGCG AAACCTCGCC 

CTCTACTCGC CTGACGGCGG CGGCCATACC GGCAACGGCA 

AGCCGCCGAC GAACTCGCCT GCGTGTTGCT GTTTTTGAAA 
GCTAA 

This corresponds to the amino acid sequence <SEQ ID 158; ORF 041. a>: 

a04 1 . pep 

1 ISSPEHIGLQ GGSNGGLITA AAFVREPQSI GALVCEVPLT DMIRYPLLSA 

SI GSSWTDEYGN PQKYEVCKRR LGELSPYHNL SDGIDYPPAL ITTSLSDDRV 

101 HPAHALKFYA KLRETSPQSW LYSPDGGGHT GNGTQREAAD ELACVLLFLK 
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151 EFLG* 

m04i/a04i 98.7% identity over a 154 aa overlap 

10 20 30 -;0 50 60 

in04 1 . pep ISSPEHIGLQGGSNGGLI T AAA FVRE ?Q S I G A L V C E V ? LT DM I R Y ? L L SAG S S W T D E Y GN 

t i I ! I i II I I I I f I I I I I I I I I f I 1 I I i I I I I I I I I I I I i t I I I I I I I I I I i I M I I I |[ 
a0 4 1 ISSPEHIGLQGGSNGGLITAAAFVREPQSIGALVCEVPLTDMIRYPLLSAGSSWTDEYGN 

10 20 30 40 50 60 

70 80 90 100 110 120 

mO'll . pep PQKYEVCKRRLGELSPYHNLSDGI DYPPALITTSLSDDRVHPAHALKFYAKLRETSAQSW 

I i I I I I I I 1 M i I I I I I I I I I ( I I I I I I II II I ! I I I I I ! I I I I I 1 I I | | U I M I III 
a0 41 PQKYEVCKRRLGELSPYHNLSDGIDYPPALITTSLSDDRVHPAHALKFYAKLRETSPQSW 

70 80 90 100 110 120 

130 140 150 

m041 .pep LYSPDGGGHTGNGTQRESADELACVLLFLKEFLGX 
I I I I i I I I I I I I ! I I I I : I I II I I I I I M II I I I 1 
a0 41 LYSPDGGGHTGNGTQREAADELACVLLFLKEFLGX 
130 140 150 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 041 shows 96.8% identity over a 154 aa overlap with a predicted ORF (ORF 041 .ng) 
from N. gonorrhoeae: 

m041/g041 

10 20 30 40 50 60 

m041 .pep ISSPEHIGLQGGSNGGLITAAAFVREPQS IGALVCEVPLTDMIRYPLLSAGSSWTDEYGN 

: ( I I : I I I I i I I I I I I I I I I ! II I I I ( i I i I I I ! I I I I M I I I M I I I I i I II II I I I I f 
g041 MSSPKHIGLQGGSNGGLITAAA FVRE PQSIGALVCEVPLTDMIRYPLLS AGS SWTDEYGN 

10 20 30 40 50 60 

70 30 90 100 110 120 

ra041 .pep PQKYEVCKRRLGELSPYHNLSDGIDYPPALITTSLSDDRVHPAHALKFYAKLRETSAQSW 
I I I I I : I I I I I I I I I I I I I I I I I I I I I i I I I I II I I I ! I M ! I I I 1 M M I I I M I III 
g0 41 PQKYEACKRRLGELSPYHMLSDGIDYPPALITTSLSDDRVHPAHALKFYAKLRETSPQSW 

70 80 90 100 110 120 

130 140 150 

m04 1 . pep LYSPDGGGHTGNGTQRESADELACVLLFLKEFLGX 
I I I I I I I I I I I I I I I 1 I I I I : I I I I I I I I I ! I I I I 
g04 1 LYSPDGGGHTGNGTQRESADKLACVLLFLKEFLGX 

130 140 150 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 159>: 

g041-l . seq 

1 ATGAAATCCT 

51 CGAAACGCAA 

101 TTTTAAACAA 

151 ATGCAGGACA 

201 GTACCATTTC 

2 51 GTACGGCGGC 

301 TCGGTGGCGG 

351 CGTGTCGCAC 

401 AATCGGGCGG 

4 51 GAATTGGTAG 

501 GTGGCGCGAT 

551 AGTTGACCGA 



ACCCCGACCC CTACCGCCAT 
AACTTCGCTG CTGAAGCGAA 
CGACAAGGCG CGCGCACTTT 
CGCGGCAGAT TCCGTTTTGT 
CATCAGAATG CGGAATATCC 
GACCTACCGT TCCGGCTATC 
ATTTCGATGA GTTGCTCGGC 
TTGGTGGAGC AGCCCAACCG 
CGATACGGCG TATACGCTGG 
AGGGCGGTTT TCACTTTCCG 
GAAAACAGCG TGTGGGTGTG 
ATCGGGCTAT CCGCGCGAAG 



TTTGAAAACC TCGATTCCGC 
TGCCGAAACG CGCGCGCGTT 
CAGACGGCAT TTTGAATCAA 
CAGGAACACC GCGCGCGGAT 
GAAGGGCGTG TACCGCATGT 
CCGAGTGGAA AATCCTGTTT 
GACGATGTGT ATTTGGGCGG 
CGCGCTGCTG ACTTTGAACA 
AAGTGGATTT GGAAGCAGGG 
GCAGGCAAAA ACCATGTGTC 
TCCGGCTTGG GACGAACGCC 
TGTGGCTGGT GGAACGCGGC 
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601 AAGAGTTTCG AGGAAAGCCT GCCGGCGTAC CAAATCGATA AAGGCGCGAT 

651 GATGGTAAAC GCGTGGCGTT ACCTCGATCC GCAGGGTTCG CCGATTGATT 

701 TGATTGAAGC GTCGGACGGT TTTTACACCA AGACGTATTT GCAGGTGTCG 

7 51 TCCGAAGGCG GGGCGAAACC GTTGAACCTG CCTAATGATT GCGATGTGGT 

801 CGGCTATCTG GCGGGACATC TTTTGCTGAC GCTGCGCAAG GACTGGCACC 

851 GCGCGAACCA AAGCTATCCG AGTGGCGCGT TGGTGGCGGT GAAACTGAAT 

901 CGGGGCGAAC TCGGGGCGGC GCAGCTTTTG TTTGCGCCCG ATGAAACGCA 

9 51 GGCATTGGAA AGCGTGGAAA CGACCAAGCG TTTTGTGGTG GCAAGCCTGC 

1001 TGGAGAATGT ACAAGGCCGT CTGAAAGCGT GGCGGTTTGC CG A C AG C AAA. 

1051 TGGCAGGAAG CCGAGTTGCC GCACCTGCCC TCGGGCGCGT TGGAAATGAC 

1101 CGACCAACCG TGGGGCGGCG ACGTGGTTTA TCTTGCCGCC AGCGATTTCA 

1151 CCACGCCGCT GACGCTGTTT GCGCTGGATT TGAACGTGAT GGAACTGACC 

1201 GTCATGCGCC TCCAGCCGCA GCAGTTTGTT TCAGACGGCA TCGAAGTGCG 

1251 GCAGTTTTGG GCGGTGTCGT CCGACGGCGA ACGCATTCCT TATTTCCACG 

1301 TCGGCAAAAA CGCCGCGCCC GACACGCCGA CCTTAGTCTA TGCTTACGGA 

1351 GGTTTCGGCA TTCCTGAATT GCCGCATTAT CTGGGCAGCG TCGGCAAATA 

14 01 TTGGCTGGAA GAGGGCAATG CCTTTGTATT GGCAAACATC CGCGGCGGCG 

14 51 GAGAATTCGG CCCGCGCTGG CATCAGGCGG CGCAGGGAAT CAGCAAACAC 

1501 AAAAGCGTTG ATGATTTGTT GGCAGTCGTG CGTGATTTGT CCGAACGCGG 

1551 CATGAGTTCG CCCAAACACA TCGGCTTGCA GGGCGGCAGC AACGGCGGCC 

1601 TGATTACCGC CGCCGCCTTC GTGCGCGAAC CGCAAAGCAT CGGTGCGCTG 

1651 GTGTGCGAAG TACCGCTGAC CGATATGATC CGTTATCCGC TGCTGTCCGC 

1701 CGGTTCAAGT TGGACGGACG AATACGGCAA TCCGCAGAAA TACGAAGCCT 

17 51 GCAAACGCCG GCTGGGCGAA TTGTCGCCGT ATCACAATCT TTCAGACGGC 

1801 ATCGATTATC CGCCCGCACT CATTACCACC AGCCTCAGCG ACGACCGCGT 

1851 CCATCCCGCC CACGCGCTCA AATTCTACGC CAAACTGCGC GAAACCTCGC 

1901 CGCAATCTTG GCTCTACTCG CCTGACGGCG GCGGCCATAC CGGCAACGGC 

1951 ACCCAACGCG AATCCGCCGA CAAACTCGCC TGCGTGTTGC TGTTTTTGAA 

2 001 AGAATTTTTG GGATAA 

This corresponds to the amino acid sequence <SEQ ID 160; ORF 041-l.ng>: 

g04 1-1 . pep 

1 MKSYPDPYRH FENLDSAETQ NFAAEANAET RARFLNNDKA RALSDGILNQ 

51 MQDTRQIPFC QEHRARMYHF HQNAEYPKGV YRMCTAATYR SGYPEWKILF 

101 SVADFDELLG DDVYLGGVSH LVEQPNRALL TLNKSGGDTA YTLEVDLEAG 

151 ELVEGGFHFP AGKNHVSWRD ENSVWVCPAW DERQLTESGY PREVWLVERG 

201 KSFEESLPAY QIDKGAMMVN AWRYLDPQGS PIDLIEASDG FYTKTYLQVS 

251 SEGGAKPLNL PNDCDWGYL AGHLLLTLRK DWHRANQSYP SGALVAVKLN 

301 RGELGAAQLL FAPDETQALE SVETTKRFVV ASLLENVQGR LKAWRFADSK 

351 WQEAELPHLP SGALEMTDQP WGGDVVYLAA SDFTTPLTLF ALDLNVMELT 

401 VMRLQPQQFV SDGIEVRQFW AVSSDGERIP YFHVGKNAAP DTPTLVYAYG 

451 GFGIPELPHY LGSVGKYWLE EGNAFVLANI RGGGEFGPRW HQAAQGISKH 

501 KSVDDLLAVV RDLSERGMSS PKHIGLQGGS NGGLITAAAF VREPQSIGAL 

551 VCEVPLTDMI RYPLLSAGSS WTDEYGNPQK YEACKRRLGE LSPYHNLSDG 

601 IDYPPALITT SLSDDRVHPA HALKFYAKLR ETSPQSWLYS PDGGGHTGNG 

651 TQRESADKLA CVLLFLKEFL G* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 161>: 

m04 1-1 . seq 

1 ATGAAATCCT ACCCCGACCC CTACCGCCAT TTTGAAAACC TCGATTCCGC 

51 CGAAACGCAA AACTTCGCTG CTGAAGCGAA TGCCGAAACG CGCGCGCGTT 

101 TTTTAGAAAA CGACAAGGCG CGCGCGCTTT CAGACGGCAT TTTGGCGCAG 

151 TTGCAGGACA CGCGGCAGAT TCCGTTTTGT CAGGAACACC GCGCGCGGAT 

201 GTACCATTTC CATCAGGACG CGGAGTATCC GAAGGGCGTG TACCGCGTGT 

2 51 GTACCGCGGC GACGTATCGT TCCGGCTATC CCGAGTGGAA AATCCTGTTT 

301 TCGGTGGCGG ATTTCGACGA ATTGCTTGGC GACGATGTGT ATTTGGGCGG 

351 CGTGTCGCAC TTGGTGGAAC AGCCCAACCG CGCGTTGTTA ACACTGAGCA 

4 01 AATTGGGCAG CGATACGGCG TACACGCTGG AAGTGGATTT GGAAGCAGGG 

4 51 GAGTTGGTCG AAGGCGGTTT TCACTTTCCG GCAGGCAAAA ACCATGTGTC 

501 GTGGCGCGAT GAAAACAGCG TGTGGGTGTG TCCGGCTTGG AACGAACGCC 

551 AGTTGACCCA ATCGGGCTAT CCGCGCGAAG TATGGCTGGT GGAACGCGGC 

601 AAGAGTTTCG AGGAAAGCCT GCCTGTGTAT CAAATCGGCG AAGACGGCAT 

651 GATGGTGAAC GCGTGGCGTT ATCTCGATCC GCAGGGTTCG CCGATTGATT 

7 01 TGATTGAAGC GTCGGACGGT TTTTACACCA AAACCTATTT GCGGGTCTCA 
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7 51 GCCGAAGGCG AGGCGAAACC GTTAAACCTG CCCAACGATT GCGACGTGGT 
801 CGGCTATCTG GCGGGGCATC TTTTGCTGAC GCTGCGCAAG GACTGGAACC 

8 51 GCGCGAACCA AAGCTATCCG AGCGGCGCGC TGGTGGCGGT GAAGCTGAAT 
901 CGGGGCGAAC TCGGGGCGGC GCAGCTTTTG TTTGCGCCCG ATGAAACGCA 
951 GGCATTGGAA AGCGTGGAAA CGACCAAGCG TTTTGTGGTG GCGAGCCTGT 

1001 TGGAGAACGT ACAAGGCCGT CTGAAAGCAT GGCGGTTTGC CGACGGCAAA 

1051 TGGCAGGAAG TCGAATTGCC GCGCCTGCCT TCGGGCGCGT TGGAAATGAC 

1101 CGACCAACCT TGGGGCGGCG ACGTGGTTTA CCTTGCCGCC AGCGATTTCA 

1151 CCACGCCGCT GACGCTGTTT GCGCTGGATT TGAACGTGAT GGAACTGACC 

1201 GTCATGCGCC GCCAGCCGCA GCAGTTTGAT TCAGACGGCA TTAACGTGCA 

1251 GCAGTTTTGG ACGACTTCGG CTGACGGCGA GCGCATTCCT TATTTCCACG 

1301 TCGGCAAAAA CGCCGCGCCC GACATGCCGA CC 2 ."GGTCTA TGCCTACGGC 

13 51 GGTTTCGGCA TTCClGAATT GCCGCATTAT CTGGGCAGCA T7GGCAAATA 

14 01 TTGGCTGGAA GAGGGCAATG CCTTTGTATT GGCGAACATC CGCGGCGGCG 
14 51 GCGAGTTCGG CCCGCGCTGG CATCAGGCGG CGCAGGGAAT CAGCAAACAT 
1501 AAAAGCGTTG ATGATTTATT GGCAGTCGTG CGCGATTTGT CCGAACGCGG 
1551 TATCAGTTCG CCCGAACACA TCGGCTTGCA GGGCGGCAGC AACGGCGGAC 
1601 TGATTACTGC CGCCGCCTTC GTGCGCGAAC CGCAAAGCAT CGGCGCGCTG 
1651 GTGTGCGAAG TGCCGCTGAC CGACATGATC CGTTATCCGC TGCTCTCCGC 
1701 CGGTTCAAGC TGGACAGACG AATACGGCAA TCCGCAAAAA TACGAAGTCT 

17 51 GCAAACGCCG GTTGGGCGAA TTGTCGCCGT ATCACAATCT TTCAGACGGC 

18 01 ATCGATTATC CGCCCGCGCT CATTACCACC AGCCTGTCCG ACGATCGCGT 
18 51 CCATCCCGCC CACGCGCTCA AGTTCTACGC CAAACTGCGC GAAACCTCCG 
1901 CGCAATCTTG GCTCTACTCG CCTGACGGCG GCGGCCATAC CGGCAACGGC 
1951 ACCCAACGCG AATCCGCCGA CGAACTCGCC TGCGTCTTGC TGTTTTTGAA 
2001 AGAGTTTTTG GGCTAA 

This corresponds to the amino acid sequence <SEQ ID 162; ORF 041 -1>; 

mO 4 1 - 1 .pep 

1 MKSYPDPYRH FENLDSAETQ NFAAEANAET RARFLENDKA RALSDGILAQ 

51 LQDTRQIPFC QEHRARMYHF HQDAEYPKGV YRVCTAATYR SGYPEWKILF 

101 SVADFDELLG DDVYLGGVSH LVEQPNRALL TLSKLGSDTA YTLEVDLEAG 

151 ELVEGGFHFP AGKNHVSWRD ENSVWVCPAW NERQLTQSGY PREVWLVERG 

201 KSFEESLPVY QIGEDGMMVN AWRYLDPQGS PIDLIEASDG FYTKTYLRVS 

251 AEGEAKPLNL PNDCDVVGYL AGHLLLTLRK DWNRANQSYP SGALVAVKLN 

301 RGELGAAQLL FAPDETQALE SVETTKRFVV ASLLENVQGR LKAWRFADGK 

351 WQEVELPRLP SGALEMTDQP WGGDVVYLAA SDFTTPLTLF ALDLNVMELT 

401 VMRRQPQQFD SDGINVQQFW TTSADGERIP YFHVGKNAAP DMPTLVYAYG 

451 GFGI PELPHY LGSIGKYWLE EGNAFVLANI RGGGEFGPRW HQAAQGISKH 

501 KSVDDLLAVV RDLSERGISS PEHIGLQGGS NGGLITAAAF VREPQSIGAL 

551 VCEVPLTDMI RYPLLSAGSS WTDEYGNPQK YEVCKRRLGE LSPYHNLSDG 

601 IDYPPALITT SLSDDRVHPA HALKFYAKLR ETSAQSWLYS PDGGGHTGNG 

651 TQRESADELA CVLLFLKEFL G* 

m041-l/g041-l 94.6% identity in 671 aa overlap 

10 20 30 40 50 60 

mO 4 1 - 1 . pep MKS YPDPYRHFENLDSAETQNFAAEANAETRARFLENDKARALSDGI LAQLQDTRQI PFC 
I I I I ! I I I I I I ! I I I I I I M 11 I I M I I 1 I I [ i I I : I I I I II I I I I 1 I 1:111111111 
g041-l MKSYPDPYRH FENLDSAETQNFAAEANAETRARFLNNDKARALSDG I LNQMQDTRQI PFC 

10 20 30 40 50 60 

70 80 90 100 110 120 

m041-l .pep QEHRARMYHFHQDAEYPKGVYRVCTAATYRSGYPEWK I LPS VADFDELLG DDVYLGGVSH 
I I i I I I 1 I I I i I : I I I I I I I I I : I I I I I I I M II I I I I M I I I I I I I I 1 I I I I I I I I I I I 
g041-l QEHRARMYHFHQNAEYPKGVYRMCTAATYRSGYPEWKILFSVADFDELLGDDVYLGGVSH 

70 80 90 100 110 120 

130 140 150 160 170 180 

m04 1- 1 . pep LVEQPNRALLTLSKLGSDTAYTLEVDLEAGELVEGGFHFPAGKNHVSWRDENSVWVCPAW 
I I I I I ! M I I I I : I I : I I I I I M M I I I I 1 II I I I I I t I I I M I I I I I 1 I M M I I I i I 
g04l-l LVEQPNRALLTLNKSGGDTAYTLEVDLEAGELVEGGFHFPAGKNHVSWRDENSVWVCPAW 

130 140 150 160 170 180 

190 200 210 220 230 240 

m04i-l .pep NERQLTQSGYPREVWLVERGKSFEESLPVYQIGEDGMMVNAWRYLDPQGS PI DLIEASDG 
: I I I I I : I I I I I I I I M | | I | I I I I 11 I :! I i : : I j I I I I ! i I I M ! I 1 ! 1 t I I I I I I 
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g 04 1-1 DER0LTESGY?REV.."_7ERGKSFEESLPAYQIDKG.^MVNAWRYLDPQGSPIDLIEASDG 
190 200 210 220 230 240 

250 260 270 280 290 300 

m04 1-1 .pep FYTKTYLRVSAEGEAKPLNLPNDCDWGYLAGHLLLTLRKDWNRANQSYPSGALVAVKLN 
M I I 1 i I : I I : ! I I i I I i I I i i 1 1 I I M 1 f I I l I ! I I I I M : I I 1 I I I ! 1 |j I 1 I I I I I 
g0 41-l F Y T KT Y LQ V S S E GG AK P LN L PN D C D V VG Y LAG H L L L 7 LR K DW H RAN Q S Y P SG A L V A VK LN 

250 260 270 280 290 300 

310 320 330 340 350 360 

m04 1-1 .pep RGELGAAQLLFAPDETQALESVETTKRFVVASLLENVQGRLKAWRFADGKWQEVELPRLP 
! i I I ! i I I I I [ ! M I I I I I M I i I I I f I ! I I I ! I I I I I i I j I I j j M I : i I [ I : I I I : M 
g041-l RGELGAAQLLFAPDETOALESVETTKRFVVASLLE XT V0GRU^AWRFADSKWQEAELPHLP 
310 320 330 340 350 360 

370 380 390 400 410 420 

m04 1-1 . pep SGALEMTDQPWGGDVVYLAASDFTTPLTLFALDLNVMELTVMRRQPQQFDSDGINVQQFW 
I I f I I I I I I I I I I I I I I I I I I I I I I M I I I I I i 1 I I I I I I I I i Mill I If I : I : I IS 
g041-l SGALEMTDQPWGGDWYLAASDFTTPLTLFALDLNVMELTVMRLQPQQFVSDGIEVRQFW 

370 380 390 400 410 420 

430 440 450 460 470 480 

m041-l .pep TTSADGERIPYFHVGKNAAPDMPTLVYAYGGFGIPELPKYLGSIGKYWLEEGNAFVLANI 
: : I : I I I I I I I I I I I I I M ! I I I M I I! I I I ! i i I I i M 1 I I : I I I i I I I j | | j | i j | | 
g041-l AVSSDGERIPYFHVGKNAAPDTPTLVYAYGGFGIPELPHYLGSVGKYWLEEGNAFVLANI 

430 440 450 460 470 480 

490 500 510 520 530 540 

m041-l .pep RGGGEFGPRWHQAAQGI SKHKSVDDLLAVVRDLSERGISSPEHIGLQGGSNGGLITAAAF 
M I I ! I II I I If I I i M I I I II I I I i I j II ! I I I 1 I I : M I : I I ! I M i ! i j I I i ! I | M 
g04 1-1 RGGGEFGPRWHQAAQGISKHKSVDDLLAVVRDLSERGMSSPKHIGLQGGSNGGLITAAAF 

490 500 510 520 530 540 

550 560 570 580 590 600 

m041-l .pep VREPQSIGALVCEVPLTDMIRYPLLSAGSSWTDEYGNPQKYEVCKRRLGELSPYHNLSDG 
M I I I I I I I I I i I II I I M ! I I M I I I I I I i I I I i I I I i I M : I I I M I I i 1 I I I | | | | | 
g041-l VREPQSIGALVCEVPLTDMIRYPLLSAGSSWTDEYGNPQKYEACKRRLGELSPYHNLSDG 

550 560 570 580 590 600 

610 620 630 640 650 660 

m041-l . pep IDYPPALITTSLSDDRVHPAHALKFYAKLRETSAQSWLYSPDGGGKTGNGTQRESADELA 
I I M I I I I M I I I I I I M I M I I M I I I I I I M S I i I I I I I I I I | I I | | | I | | M | : M 
g041-l IDYPPALITTSLSDDRVHPAHALKFYAKLRETSPQSWLYSPDGGGHTGNGTQRESADKLA 

610 620 630 640 650 660 



670 

m041-l .pep CVLLFLKEFLGX 
I I I I ! I I I I I I i 
g 0 4 1 - 1 CVLLFLKEFLGX 
670 



m041-l/P55577 

sp| P55S77! Y4NA_RHISN PROBABLE PEPTIDASE Y4NA >gi|2182536 (AE0000S6) Y4nA [Rhizobium sp. 
NGR234] Length = 726 
Score - 370 bits (940), Expect = e-101 

Identities = 217/682 (31%), Positives = 331/682 (47%) , Gaps = 22/682 (3%) 

'PDPYRHFENLDSAETQNFAAEANAETRARFLENDKARALSDGILAOLQDTROIPFCQ 61 
DP + +D + + N T + ++ + L LQ T +1 



R M + F QD + +G++R T +YRSG P+W+ + 



Query : 


2 


Sbjct : 


42 


Query : 


62 


Sbjct : 


102 



Query: 121 LVEQPNRALLTLSKLGSDTAYTLEVDLEAGELVEGGFHFPAGKNHVSKRDENSVWVCPAW 180 
L N L+ LS G D E D+ GE V+ GF P GK V+W DEN + + + V W 
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Sbjct: 162 LPPTSNLCLIRLSDGGKDADVVREFDIAKGEFVKEGFVLPEGKQSVTWVDEMTI YVTREW 221 

Query: 181 N ERQLTQSGY PRE VWLVERGKS FEES LPVYO IGEDGMM- -VNAWRYLDPQGSPI 232 

++T SGY ^V+RG+S + + + + +++ E G++ *D * 

Sbjct: 222 TPGEVTSSGYAYVTKVVKRGQSLDQAVEI FRGQKKDVSAERGVLRDI DGKYVMDTS YRGL 281 

Query: 233 DLIEASDGFYTKTYLRVSAEGEAKPLNLPNDCDVVGYLAGHLLLTLRKDWNRAHQS-YPS 291 

D FY + + L LP GY G + L+ DW A + + + 

Sbjct: 282 DFFNTELAFYPNGH PDTRKVVLPLPTTAVFSGY YKGQAI YWLKSDWTSAKGTVFHN 337 

Querv: 292 GALVAVKLNRGELGAAQL LFAPDETQALESVETTKRFVVASLLENVQGRLKAWRFA 34*7 

GA++A h A + + LF P + E Q+ + TK + V S + L MV + + ++ F 

Sbjct: 338 GAI IAFDLKAALADPARVEPLVLFMPNEHQSVAGTTQTKNFXVLSII..: T/T3EVRS FDFG 397 

Query: 348 DGKWQEVELPRLPSGALEMTDQPWGGDVVYLAASDFTTPLTLFALDLNVMELTVMRRQPQ 407 

G W -t-L + L +T D +^+ + F P TLF D + P 

Sbjct: 398 KGGWSSFKLALPENSTLSLTSSDDESDQLFVFSEGFLEPSTLFCADAATGQVEKITSTPA 4S7 

Qu«-v: 408 QFDSDGINVQQFWTTSADGERI PYFHVur\HAAP DMPTLVYAYGGFGI PELPHYLGS1 464 

+FD+ G+ 00 FW TS DG ++PYF V + PT++YAYGGF IP P Y + 

Sb]Ct: 458 RFDAGGLQAOQFWATSKDGTKVPYFLVARKDVKLDGTNPTILYAYGGFQIPMQPSYSAVL 517 

Query: 465 GKYWLEEGNAFVLANIRGGGEFGPRWHQAAQGI SKHKSVDDLLAVVRDLSERGI SSPEHI 524 

GK WLE+G A+ LANIRGGGEFGP+WH A +4 + DD AV +DL + + +S H + 
Sbjct: 518 GKLWLEKGGAYALANIRGGGEFGPKWH DAGLKTNRQRVYDDFQAVAQDLI AKKVTSTPHL 577 

Query: 525 GLQGGSNGGLITAAAr VREPQSIGALVCEVPLTDMIRYPLLSAGSSWTDEYGMPQKYEVC 584 

G+ GGSNGGL+ ++ P A+V -VPL DM+ + +SAG+SW EYG-P V 

Sb^Ct: 578 GIMGGSNGGLLMGVQMIQRPDLWNAVVIQVPLLDMVNFTRMSAGASWQAEYGSPDD-PVE 636 

Query: 585 KRRLGELSPYHNLSDGI DYPPALITTSLSDDRVHPAHALKFYAKLRETSAQSWLYSPDGG 644 

L +SPYHN+ G+ YP TS DDRV PHAKA + -r Y G 

Sbjct: 637 GAFLRS I S PYHNVKAGVAYPEPFFETSTKDDRVGPVHARKMAALFEDMGLPFYYYENIEG 696 

Query: 64 5 GHTGNGTQRESADELACVLLFL 666 
GH +E A A +++ 

69" ~::AAAANLQEHP.:.^ALZYIYM 718 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 163>: 

a04 1- 1 . seq 

1 ATGAAATCCT ACCCCGACCC CTACCGCCAT TTTGAAAACC TCGATTCCGC 

51 CG7AAACGCAA AACTTCGCTG CTGAAGCG7AA TGCCGAAACG CGCGCGCGTT 

101 TTTTAAACAA CGACAAGGCA CGCGCATTGT CTGACGGCAT TTTGGCGCAG 

151 TTGCAGGACA CGCGGCAAAT TCCGTTTTGT CAGGAACACC GCGCGCGGAT 

2 01 GTACCATTTC CATCAAGATG CGGAATATCC GAAAGGCGTG TACCGCGTGT 

2 51 GTACCGCGGC GACTTACCGT TCGGGCTATC CTGAGTGGAA AATCCTGTTT 
301 TCGGTGGCGG ATTTCGACGA ATTGCTCGGT GACGATGTAT ATCTAGGCGG 

3 51 CGTGTCGCAC CTGGTGGAAC AGCCCAACCG CGCGTTGTTA ACACTGAGCA 

4 01 AATCGGGCGG CGATACCGCG TACACGCTGG AAGTGGATTT GGAAGCAGGG 
4 51 GAGTTGGTAG AAGGCGGTTT TCACTTTCCG GCAGGCAAAA ACCATGTGTC 
501 GTGGCGCGAT GAAAACAGCG TGTGGGTGTG TCCGGCTTGG GACGAACGCC 
551 AGTTGACCGA ATCGGGCTAT CCGCGCGAGG TGTGGCTGGT GGAACGCGGC 
601 7AAGAGTTTCG AGGAAAGCCT GCCGGTGTAC CAAATTGCTG AAGACGGCAT 
651 GATGGTGAAC GCGTGGCGTT ACCTCGATCC GCAGGGTTCG CCGATTGATT 
7 01 TGATTG7AAGC GTCTGACGGT TTTTACACCA AAACCTATTT GCAGGTCTCA 

7 51 GCCGAAGGCG AAGCGAAACC GTTAAACCTG CCCAACGATT GCGACGTAGT 

8 01 CGGCTATCTG GCCGGACATC TTTTGCTGAC CTTGCGTAAA GACTGGCACC 
851 GCGCGAACCA AAGCTATCCG AGTGGCGCAT TGGTAGCAGT AAAATTAAAC 
901 CGCGGCGAAT TGGGCGCGGC GCAGCTTTTG TTTGCGCCCA ATGAAACGCA 
951 GGCATTGGAA AGCGTGGAAA CGACCAAGCG TTTTGTCGTG GCGAGCCTGC 

1001 TGGAAAACGT ACAGGGTCGT CTGAAAGCGT GGCGTTTTAC TGATGGCAAA 

1051 TGGCAGGAAA CCGAGTTGCC GCGCCTGCCT TCGGGCGCGT TGGAAATGAC 

1101 CGACCAACCG TGGGGGGGCG ACGTAGTTTA CCTTGCCGCC AGCGATTTCA 

1151 CCACGCCGCT GACGCTGTTT GCATTGGATT TGAACGTGAT GGAACTGACC 

1201 GTCATGCGCC GCCAGCCGCA GCAGTTTGAT TCAGACGGCA TTAACGTGCA 

12 51 GCAGTTTTGG ACGACTTCGG CTGACGGCGA GCGCATTCCT TATTTCCACG 

1301 TCGGCAAAAA CGCCGCGCCC GACATGCCGA CGCTGGTCTA TGCCTACGGC 

1351 GGTTTCGGCA TTCCCGAATT GCCGCATTAT CTGGGCAGCA TTGGCAAATA 

14 01 TTGGCTGGAA GAGGGCAATG CCTTTGTATT GGCGAACATC CGCGGCGGCG 
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1/151 GCGAGTTCGG CCCGCGCTGG CATCAGGCGG CGCAGGGAAT CAGCAAACAT 

1501 AAAAGCGTTG AT GAT TT ATT GGCAGTCGTG AGCGATTTGT CCGAACGCGG 

1551 TATCAGTTCG CCCGAACACA TCGGCTTGCA GGGCGGCAGC AACGGCGGAC 

1601 TGATTACTGC CGCCGCCTTC GTGCGCGAAC CGCAAAGCAT AGGCGCGCTG 

1651 GTGTGCGAAG TGCCGCTGAC CGACATGATC CGTTATCCGC TGCTCTCCGC 

1701 CGGTTCAAGC TGGACAGACG AATACGGCAA TCCGCAAAAA TACGAAGTCT 

17 51 GCAAACGCCG GTTGGGCGAA TTGTCGCCGT ATCACAATCT TTCAGACGGC 

1801 ATCGATTATC CGCCCGCGCT CATTACCACC AGCCTGTCCG ACGATCGCGT 

1851 CCATCCCGCC CACGCGCTCA AGTTCTACGC CAAACTGCGC GAAACCTCGC 

1901 CGCAATCTTG GCTCTACTCG CCTGACGGCG GCGGCCATAC CGGCAACGGC 

1951 ACGCAGCGCG AAGCCGCCGA CGAACTCGCC TGCGTGTTGC TGTT7TTGAA 

2001 AGAGTTTTTG GGCTAA 

This corresponds to the amino acid sequence <SEQ ID 164; ORF 041-l.a>: 

a041-l .pep 

1 MKSYPDPYRH FENLDSAETQ NFAAEANAET RARFLNNDKA RALSDGILAQ 

51 LQDTRQI PFC QEHRARMYHF HQDAEYPKGV YRVCTAATYR 3GYPEWKILF 

101 SVADFDELLG DDVYLGGVSH LVEQPNRALL TLSKSGGDTA YTLEVDLEAG 

151 ELVEGGFHFP AGKNHVSWRD ENSVWVCPAW DERQLTESGY PREVWLVERG 

201 KSFEESLPVY QIAEDGMMVN AWRYLDPQGS PI DLIEASDG FYTKTYLQVS 

251 AEGEAKPLNL PNDCDVVGYL AGHLLLTLRK DWHRANQSYP SGALVAVKLN 

301 RGELGAAQLL FAPNETQALE SVETTKRFVV ASLLENVQGR LKAWRFTDGK 

351 WQETELPRLP SGALEMTDQP WGGDVVYLAA SDFTTPLTLF ALDLNVMELT 

401 VMRRQPQQFD SDGINVQQFW TTSADGERI P YFHVGKNAAP DMPTLVYAYG 

4 51 GFGIPELPHY LGSIGKYWLE EGNAFVLANI RGGGEFGPRW HQAAOGISKH 

501 KSVDDLLAVV SDLSERGISS PEHIGLQGGS NGGLITAAAF VREPQSIGAL 

551 VCEVPLTDMI RYPLLSAGSS WTDEYGNPQK YEVCKRRLGE LSPYHNLSDG 

601 IDYPPALITT SLSDDRVHPA HALKFYAKLR ETSPQSWLYS PDGGGHTGNG 

651 TQREAADELA CVLLFLKEFL G + 



a041-l/m041-l 97.9% identity in 671 aa overlap 



10 20 30 40 50 60 

a041-l - pep MKSYPDPYRHFENLDSAETQN FAAEANAETRARFLNNDKARALSDGILAQLQDTRQIPFC 
I | | | I M I 1 I I M I li I I I I I I I I I I I M i I I I I I : II I M I I I I I I I I I i I I I I I I I i I 
m041-l MKSYPDPYRHFENLDSAETQNFAAEANAETRARFLENDKARALSDGI LAQLQDTRQIPFC 

10 20 30 40 50 60 



70 80 90 100 110 120 

a041-l .pep QEHRARMYHFHQDAEYPKGVYRVCTAATYRSGYPEWKILFSVADFDELLGDDVYLGGVSH 
I M 11 I I I I I I I I I I i I I I ! i i M I M I II I , i i I I I I I I i I i I I I 11 I I I M I I I I M ! 
ra041-l QEHRARMYHFHQDAEYPKGVYRVCTAATYRSGYPEWKILFSVADFDELLGDDVYLGGVSH 

70 30 90 100 110 120 



130 140 150 160 170 180 

a041-l .pep LVEQPNRALLTLSKSGGDTAYTLEVDLEAGELVEGGFHFPAGKNHVSWRDENSVWVCPAW 
1 II | I I M I I I I I I I : I I I I I I I I I I I I II I I M I I 11 I M i I i I I I I I I I M I I t II I 
m041-l LVEQPNRALLTLSKLGSDTAYTLEVDLEAGELVEGGFHFPAGKNHVSWRDENSVWVCPAW 
130 140 150 160 170 180 

190 200 210 220 230 240 

a041-l .pep DERQLTESGYPREVWLVERGKSFEESLPVYQIAEDGMMVNAWRYLDPQGS PI DLIEASDG 
: | | S | | : I I I I I i ! ! I I I I i I I I I I I I I I I I i : l I I I I I I ! ! I I ! i M I i I I I I i I i I I I 
m041-l NERQLTQSGYPREVWLVERGKSFEESLPVYQ I GEDGMMVNAWRYLDPQGS PI DLIEASDG 

190 200 210 220 230 240 



250 260 270 280 290 300 

aO 4 1-1 .pep FYTKTYLQVS AEGEAKPLNLPNDCDVVGYLAGHLLLTLRKDWHRANQSYPSGALVAVKLN 
I | M I I I : I I I j ] I I I I I I i I t I ! I I I I I I M I I i f M 1 II I : I II M 11 I I I I I I I i S I 
m0 41-l FYTKTYLRVSAEGEAKPLNLPNDCDVVGYLAGHLLLTLRKDWNRANQSYPSGALVAVKLN 
250 260 270 280 290 300 



310 320 330 340 350 360 

a04 1-1 . pep RGELGAAQLLFAPNETQALESVETTKRFVVASLLENVQGRLKAWRFTDGKWQETELPRLP 
i I I I I I I I I I I I I : I I I I t I I I I I I I I I I i I I I I I I I I f I i I M I i : I I I I I 1 : I I i t I I 
m041-l RGELGAAQLL FAPDETQALESVETTKRFVVASLLENVQGRLKAWRFADGKWQEVELPRLP 

310 320 330 340 350 360 
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370 380 390 400 410 420 

a 04 1-1 . pep SGALEMTDQPWGGDVVYLAASDFTTPLTLFALDLNVMELTVMRRQPQQFDSDGINVQQFW 
M | n I I ) I f j I f M I I I I I J I ) I 1 I I M 1 I I I ) I M I I I I I I i I I i I I I 1 I I ! I M ) I I 
m04 1-1 SGALEMTDQPWGGDWYLAASDFTTPLTLFALDLNVMELTVMRRQPQQFDSDGINVQOFW 

370 380 390 400 410 420 



430 440 450 460 470 480 

a041-l .pep TTSADGERIPYFHVGKNAAPDMPTLVYAYGGFGIPELPHYLGS IGKYWLEEGNAFVLANI 
I I j I [ I I I I i I 1 I I I I M I I I I I I I I I i I I I ! I 1 1 M It I 1 i 1 I ! I I 1 I I I I I I 1 I I I I ! 
m041-l TTSADGERIPYFKVGKNAAPDMPTLVYAYGGFG I PELPHYLGS IGKYWLEEGNAFVLANI 

430 440 450 4 60 470 480 



490 500 510 520 530 540 

a041-l -pep RGGGEFGPRWHQAAQGISKHKSVDDLLAVVSDLSERGISSPEHIGLQGGSNGGLITAAAF 
1 I M j 1 I I i I t I ! 1 I II i i ! 1 I ! I I i I I I I I I M i I I I I I I i i I I I I ! M I I II I I It I 
m041-l RGGGEFGPRWHQAAQGISKHKSVDDLLAVVRDLSERGISSPEHIGLQGGSNGGLITAAA.F 

490 500 510 520 530 540 



550 560 570 580 590 600 

a041-l .pep VREPQSIGALVCEVPLTDMIRYPLLSAGSSWTDEYGNPQKYEVCKRRLGELSPYHNLSDG 
I I M I I I I I I I i ! i I II I I I I ( I I f t I t I i I t I I II II I I I I I II I I I I I I I I I i I II I I 
m041-l VREPQSIGALVCEVPLTDMIRYPLLSAGSSWTDEYGNPQKYEVCKRRLGELSPYHNLSDG 

550 560 570 580 590 600 



610 620 630 640 650 660 

aO 4 1 - 1 . pev I DYPPALITTSLSDDRVHPAHALKFYAKLRETS PQSWLYS PDGGGHTGNGTQRSAADELA 
I | i 1 | | | | I | I I 1 I I I I I I I I I I I I I I II i I I I i ! I I ! M M I I I 1 1 I I II I I : I I I i I 
m041-l IDYPPALITTSLSDDRVHPAHALKFYAKLRETSAQSWLYSPDGGGKTGNGTQRESADELA 

610 620 630 640 650 660 



670 

a041-l.pep CVLLFLKEFLGX 
I f I I I M ! I M ) 
m041-l CVLLFLKEFLGX 
670 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 165>: 

g042 . seq 

1 ATGACGATGA TTTGCTTGCG CTTCCAagcG TTCGTGCCGC ATACCAGCGC 

51 GTTATCCAAC ACTTCCACGG CAGCCGGCCC TTCCTGCCCG ATGGCGGCGG 

101 TGCGGTCGAT GATGAAAATC CAGCCGGGGT TTTTCTCTTT GATGTATTCG 

151 AAGGAAACGG GCTGCCCGTG CCCTTCGTTG CGTAAAGATT CGTCCACGGG 

2 01 CGGCAGGCCG ATGTCGCCGT GTATCCAACT TGCCAACCGC GATTGCGTGC 

2 51 CGAAGGCGGA CACCTTGTTG CCTGTAACCG ACAGCACCAG CCCGCGTCCT 

301 TTGCCTTTGG cggCTTCGCG CTTTTGGGCG AACAGCGCGT CAATCTGCGC 

351 ATTCAATTCC GCCACGCGCG CTTCCTTACC GAAAATCCGC GACAGGGTCT 

401 CCATCTGCTT CTCGCCGCTG GTGCGGATAT TGCCGTTGTC CACCGTCAAA 

4 51 TCTATGgtgG TCGCGTTTTT CGCCAACTGT TCATACGCTT CCGCACCCGG 

501 CCCGCCGGTA ATGACAAACT GCGGATTGTG GCGGTGCAGG GATTCGCAAT 

551 CGGGCTCAAA CAGCGTCCCC ACCGTTGCCG CCTTGTCAAA TGCAGGCTGC 

601 AAATAG 



This corresponds to the amino acid sequence <SEQ ID 166; ORF 042.ng>: 

g042 .pep 

1 MTMICLRFQA FVPHTSALSN TSTAAGPSCP MAAVRSMMKI QPGFFSLMYS 

51 KETGCPCPSL RKDSSTGGRP MSPCIQLANR DCVPKADTLL PVTDSTSPRP 

101 LPLAASRFWA NSASICAFNS ATRASLPKIR DR VSICFSPL VRILPLSTV K 

151 SMVVAFFANC SYASAPGPPV MTNCG^WRCR DSQSGSNSVP TVAALSNAGC 

201 K + 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 167>: 
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m042 . seq 



I 


ATGACGATGA 


TTTGCTTGCG 


51 


GTTATCCAmT 


ACTTCGACAG 


101 


TACGGTCGAT 


GATGAAAATC 


151 


AAGGAAACAG 


GCTGCCCGTG 


201 


CGGTAGGC-G 


ATGTCGCCGT 


251 


CGAAGGCGGA 


CACCTTGTTG 


301 


TTGCCTTTGG 


CGGCTTCGCG 


351 


CTTCAATTCC 


GCCGCGCGCG 


401 


CCATCTGCTT 


TTCGCCGCTG 


451 


TCTATGGTGG 


TCGCGTTTTT 


501 


CCCGCCGGTA 


ATGACAAGCT 


551 


CGGGCTCGAA 


CAGCGTCCCC 


601 


AAATAA 





CTTCCAAGCG TTCGTGCCGC GTACCAGCGC 
CCGcCGGCCy TTCyTGCCCG ATGGCGGCGG 
CAATCGGGGT TTTTCTCTTT GATGTATTCG 
CCCCTCGTTG CGTAAAGATT CGTCTACAGG 
GTATCCAACT TGCCAACCGC GACTGCGTGC 
CCCGTAA^CG ACAGCACCAG CCCGCGTCCT 
CGTTTGGGCG AACAGCGCG7 CAATCTGCGC 
CTTCCTTGCC GAAAATCCGC GCCAAGGTCT 
GTGCGGATAT TGCCGTTGTC CACCGTCAGA 
CGCTAACTGT TCATACGCTT CCGCGCCCGG 
GAGGATTGTA GCGGTGCAGG GCTTCGTAAT 
ACCGTTGCCG CCTTG7CAAA TGCAGGCTGC 



This corresponds to the amino acid sequence <SEQ ID 168; ORF 042>: 

m042 .pep 

1 MTMICLRFQA FVPRTSALSX TSTAAGXSCP MAAVRSMMKI QSGFFSLMYS 

51 KETGCPCPSL RKDSSTGGRP MSPCIQLANR DCVPKADTLL PVTDSTSPRP 

101 LPLAASRVWA NSASICAFNS AARASLPKIR AK VSICFSPL VRILPLSTV R 

151 SMWAFFANC SYASAPGPPV MTSXGLXRCR ASXSGSNSVP TVAALSNAGC 

201 K + 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 169>: 

a042 . seq 

1 ATGACGATGA TTTGCTTGCG CTTCCAAGCG TTCGTGCCGC GTACCAGCGC 

51 GTTATCCAAT ACTTCGACAG CCGCCGGCCC TTCCTGCCCG ATGGCGGCGG 

101 TACGGTCGAT GATGAAAATC CAATCGGGGT TTTTCTCTTT GATGTATTCG 

151 AAGGAAACAG GCTGCCCGTG CCCCTCGTTG CGTAAAGATT CGTCTACAGG 

2 01 CGGTAGGCCG ATGTCGCCGT GTATCCAACT TGCCAACCGC GACTGCGTGC 

2 51 CGAAGGCGGA CACCTTGTTG CCCGTAACCG ACAGCACCAG CCCGCGTCCT 
301 TTGCCTTTGG CGGCTTCGCG CGTTTGGGCG AACAGCGCGT CAATCTGCGC 

3 51 CTTCAATTCC GCCGCGCGCG CTTCCTTGCC GAAAATCCGC GCCAAGGTCT 

4 01 CCATCTGCTT TTCGCCGCTG GTGCGGATAT TGCCGTTGTC CACCGTCAGA 

4 51 TCTATGGTGG TCGCGTTTTT CGCCAACTGT TCATACGCTT CCGCGCCCGG 
501 CCCGCCGGTA ATGACAAGCT GAGGATTGTA GCGGTGCAGG GCTTCGTAAT 

5 51 CGGGCTCGAA CAGCGTCCCC ACCGTTGCCG CCTTGTCAAA TGCAGGCTGC 
601 AAATAA 



This corresponds to the amino acid sequence <SEQ ID 170; ORF 042. a>: 

a042.pep 

1 MTMICLRFQA FVPRTSALSN TSTAAGPSCP MAAVRSMMKI QSGFFSLMYS 

51 KETGCPCPSL RKDSSTGGRP MSPCIQLANR DCVPKADTLL PVTDSTSPRP 

101 LPLAASRVWA NSASICAFNS AARASLPKIR AK VSICFSPL VRILPLSTV R 

151 SMWAFFANC SYASAPGPPV MTS*GL + RCR ASXSGSNSVP TVAALSNAGC 

201 K* 



m042/a042 99.0% identity over a 201 aa overlap 



10 20 30 40 50 60 

m042 . pep MTMICLRFQAFVPRTSALSXTSTAAGXSCPMAAVRSMMKIQSGFFSLMYSKETGCPCPSL 
I ! I I I ! I i M I I M I ! i I I I I t I I I I I 1 I I t M I ! i I i I I I 1 I I I t I I I ! I I I M ; I I 
a042 MTMICLRFQAFVPRTSALSNTSTAAGPSCPMAAVRSMMKIQSGFFSLMYSKETGCPCPSL 

10 20 30 40 50 60 



70 80 90 100 110 120 

m042 .pep RKDSSTGGRPMSPCIQLANRDCVPKADTLLPVTDSTSPRPLPLAASRVWANSASICAFNS 
I I I It I I I t I II I I 1 ! I i I t I I I I I I I N II I I i I 1 I | ] M i f I II I! I I II t II I I I I I 
a04 2 RKDSSTGGRPMSPCIQLANRDCVPKADTLLPVTDSTSPRPLPLAASRVWANSASICAFNS 
"70 80 90 100 110 120 



m042 . pep 



130 140 150 160 170 130 
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I I I I M I j I I M I I I i I I I I I I I I I ! I I I I 1 I I I I i I M I I I I I ! I i I I I I I i I I I I I ! I 
a0 4 2 AARASLPKIRAKVSICFSPLVRILPLSTVRSMVVAFFANCSYASAPGPPVKTSXGLXRCR 
130 140 150 160 170 180 

190 200 
mO'J 2 . pep ASXSGSNSVPTVAALSNAGCKX 
I M I I i I I I I i I M i i I I i M i 
a04 2 ASXSGSNSVPTVAALSNAGCKX 

190 200 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 042 shows 93.0% identity over a 201 aa overlap with a predicted ORF (ORF 042.ng) 
from N. gonorrhoeae: 

m042/g042 

10 20 30 40 50 60 

m04 2 .pep MTMICLRFQAFVPRTSALSXTSTAAGXSCPMAAVR3MMKIQSGFFSLMYSKETGCPCPSL 
| | | | i 1 I | ! I I I I : I I I I I Mill! I I t I I I 1 i I i I I II I I II I I I I II I I I I I I I I 
g 04 2 MTMICLRFQAFVPHTSALSNTSTAAGPSCPMAAVRSMMKIQPGFFSLMYSKETGCPCPSL 

10 20 30 40 50 60 

70 80 90 100 110 120 

m04 2 .pep RKDSSTGGRPMSPCIQLANRDCVPKADTLLPVTDSTSPRPLPLAASRVWANSASICAFNS 
i | | i | I i I i M i II I I I I I II M II I It I I I I ! I I I M I 1 I I I I I f ! I I I I I I I i i I I ! 
g 04 2 RKDSSTGGRPMS PCIQLANRDCVPKADTLLPVTDSTSPRPLPLAASRFWANSASICAFNS 

70 80 90 100 110 120 

130 140 150 160 170 180 

m042 .pep AARASLPKIRAKVSICFSPLVRILPLSTVRSMWAFFANCSYASAPGPPVMTSXGLXRCR 
I : I ! 1 ' 1 I I I : ! I I I I ! II I I I I I I M I : I II I I i II ! I I I I I II M i I I I : i I III 
g 04 2 ATRASLPKIRDRVSICFSPLVRI LPLSTVKSMVYAFFANCSYASAPGPPVMTNCGLWRCR 

130 140 150 160 170 180 

190 200 
m04 2 . pep ASXSGSNSVPTVAALSNAGCKX 
I I 1 I I I I I I I i I I I I M ! I I 
g04 2 DSQSGSNSVPTVAALSNAGCKX 
190 200 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 171>: 

m042-l . seq 

1 ATGACGATGA TTTGCTTGCG CTTCCAAGCG TTCGTGCCGC GTACCAGCGC 

51 GTTATCCAAT ACTTCGACAG CCGCCGGCCC TTCCTGCCCG ATGGCGGCGG 

101 TACGGTCGAT GATGAAAATC CAATCGGGGT TTTTCTCTTT GATGTATTCG 

151 AAGGAAACAG GCTGCCCGTG CCCCTCGTTG CGTAAAGATT CGTCTACAGG 

201 CGGTAGGCCG ATGTCGCCGT GTATCCAACT TGCCAACCGC GACTGCGTGC 

2 51 CGAAGGCGGA CACCTTGTTG CCCGTAACCG ACAGCACCAG CCCGCGTCCT 

301 TTGCCTTTGG CGGCTTCGCG CGTTTGGGCG AACAGCGCGT CAATCTGCGC 

351 CTTCAATTCC GCCGCGCGCG CTTCCTTGCC GAAAATCCGC GCCAAGGTCT 

4 01 CCATCTGCTT TTCGCCGCTG GTGCGGATAT TGCCGTTGTC CACCGTCAGA 

4 51 TCTATGGTGG TCGCGTTTTT CGCTAACTGT TCATACGCTT CCGCGCCCGG 

501 CCCGCCGGTA A 

This corresponds to the amino acid sequence <SEQ ID 172; ORF 042-1 >: 

m042-l . pep 

1 MTMICLRFQA FVPRTSALSN TSTAAGPSCP MAAVRSMMKI QSGFFSLMYS 

51 KETGCPCPSL RKDSSTGGRP MSPCIQLANR DCVPKADTLL PVTDSTSPRP 

101 LPLAASRVWA NSASICAFNS AARASLPKIR AK VSICFSPL VRILPLSTV R 

151 SMVVAFFANC SYASAPGPPV MTS* 



m042-l/g042 95.4% identity in 173 aa overlap 
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10 20 30 40 50 60 

m04 2-l .pep MTMICLRFQAFVPRTSALSNTSTAAGPSCPMAAVRSMMKIQSGFFSLMYSKETGCPCPSL 
I | I t I I I I 1 I II I : I ! I I I I I I I I I I I I I I I 1 i I I I I I I i I I I i ! 1 I I i I I I I I I I I i 1 
g 0 4 2 MTMICLRFQAFVPHTSALSNTSTAAGPSCPMAAVR5HMKIQPGFFSLMYSKETGCPCPSL 

10 20 30 40 50 60 

70 80 90 100 110 120 

m04 2-l . pep RKDSSTGGRPMSPCIQLANRDCVPKADTLLPVTDSTSPRPLPLAASRWJANSASICAFNS 
i I I I I ! I I I i I I I I I I I j I I 1 I ! M 1 M I I i I I 1 I I ! I I i ! ! I M ! i I I I I I I I I I II I 
g 04 2 RKDSSTGGRPMSPCIQLANRDCVPKADTLLPVTDSTSPRPLPLAASRFWANSASICAFNS 

70 80 90 100 110 120 

130 140 150 160 170 

m04 2-l .pep AARASLPKIRAKVSICFSPLVRILPLSTVRSMVVAFFANCSYASAPGPPVMTSX 

I : I [ I 1 I i I I : I i I II I I II ( I I I I I t I : I I I I M I I I I I I I I U t I I I I I : 
g 04 2 ATRASLPKIRDRVSICFSPLVRILPLSTVKSMVVAFFANCSYASAPGPPVMTNCGLWRCR 

130 140 150 160 170 180 

g04 2 DSQSGSNSVPTVAALSNAGCKX 
190 200 

The following partial DNA sequence was identified in N, meningitidis <SEQ ID 173>: 

a042-l . seq 

TTTGCTTGCG CTTCCAAGCG TTCGTGCCGC GTACCAGCGC 



1 


AT G AC GAT G A 


51 


GTTATCCAAT 


101 


TACGGTCGAT 


151 


AAGGAAACAG 


201 


CGGTAGGCCG 


251 


CGAAGGCGGA 


301 


TTGCCTTTGG 


351 


CTTCAATTCC 


401 


CCATCTGCTT 


451 


TCTATGGTGG 


501 


CCCGCCGGTA 



This corresponds to the amino acid sequence <SEQ ID 174; ORF 042-1 .a>: 

a042-l .pep 

1 MTMICLRFQA FVPRTSALSN TSTAAGPSCP MAAVRSMMKI QSGFF SLMYS 

51 KETGCPCPSL RKDSSTGGRP MSPCIQLANR DCVPKADTLL PVTDSTSPRP 

101 LPLAASRVWA NSASICAFNS AARASLPKIR AKVSICFSPL VRILPLSTVR 

151 SMVVAFFANC SYASAPGPPV MTS* 



m042-l/a042-l 100.0% identity in 173 aa overlap 

10 20 30 40 50 60 

m04 2-l .pep MTMICLRFQAFVPRTSALSNTSTAAGPSCPMAAVRSMMKIQSGFFSLMYSKETGCPCPSL 
I I I I I M I ! i I I I I I I I I I I I I 1 I I I t I I I I I I I I M S I I I I I ! I I i I I M I I I II 1 ! I i 
a04 2-l MTMICLRFQAFVPRTSALSNTSTAAGPSCPMAAVRSMMKIQSGFFSLMYSKETGCPCPSL 

10 20 30 40 50 60 

70 80 90 100 110 12C 

m04 2-l . pep RKDSSTGGRPMSPCIQLANRDCVPKADTLLPVTDSTSPRPLPLAASRVWANSASICAFNS 
I ! | j I 1 II I I I I I I I I I I M i I t I I I I I ! I i I I I I I I I I ! I i I I I I I I I I 1 II I I I N M 
a04 2-l RKDSSTGGRPMSPCIQLANRDCVPKADTLLPVTDSTSPRPLPLAASRVWANSASICAF!'S 

70 80 90 100 110 120 

130 140 150 160 170 

m04 2-l .pep AARASLPKIRAKVSICFSPLVRILPLSTVRSMWAFFANCGYASAPGPPVMTSX 
I I I I I I ! ! I t I I I I I I I I I II I ( II I i I I I f I I I I I I I I I I f I I I I I I I I I I I I 
a04 2-l AARASLPKIRAKVSICFSPLVRILPLSTVRSMVVAFFANCSYASAPGPPVMTSX 

130 140 150 160 170 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 175>: 



WO 99/57280 



PCT/US99/09346 



241 



gCM 3 . seq 

1 ATGGTTGTTT CAAATCAAAA TATCTATGCC GTCGGCCCAT CAGCACTTTT 

51 TCACATCCGA AGGCAAAAAT CCGTAATGCC GCCTGAACGC TTCgttgaAC 

101 CGTCCCGCGT ggcggtagcc gcAAAAGTGC ATcGCGGCTT GGATGGTGCT 

151 GCCCGATTCG ATGAGGGcga gcGCGTGTTC CAGCCGCAGG CGGCGCAGGC 

201 GTCCGGCGAC GGTTTCGCCG GTTTGCGCTT TGAAATAGCG TTTCAGGTAG 

251 CATTCGTTCA GCCCGACGCG GCGGGCGA7T TCGGCGATGG TCAGCGGGCG 

301 GGCGAATTCG CTGTTCAAAA TATCGGCGGC TTCGTCTATG CGCCGGCGGC 

351 GGTAGCCGTT GTCGTGGCGG CGGAAGGTGA AGCGTAA 



This corresponds to the amino acid sequence <SEQ ID 1 76; ORF 043.ng>: 

g04 3 . pep 

1 MVVSNQNIYA VGPSALFHIR RQKSVMPPER FVEPSRVAVA AKVHRGLDGA 

51 ARFDEGERVF QPQAAQASGD GFAGLRFEIA FQVAFVQPDA AGDFGDGQRA 

101 GE FA VQN I GG FVYAPAAVAV VVAA EGEA* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 77>: 

m04 3 . seq 

1 ATGGTTGTTT CAAATCAAAA TATCTATGCC GCCGGCCCCT CAGCACTTCT 

51 TCACATCCGA AGGCAAAAAT CCGTAATGCC GTCTGAACGC TTCGTTGAAC 

101 CGTCCCGCGT GGCGGTAGCC GCAAAAGTGC ATGGCGGCTT GGACGGTGCT 

151 GCCGGATTCG ATGAGGGCGA GCGCGTGTTC CAGCCGCAGG CGGCGCAgGC 

201 ATCCGGCGAC GGTTTCGCCG GTTTGCGCTT TGAAATAGCG TTTCAGGTAG 

2 51 CATTCGTTCA GTCCGACGCG GCGGGCGATT TCGGCGATGG TCAGCGGACG 

301 GGCGAATTCG TGTTGCAGGA TGTCGGCGGC TTCGTCTATG CGCCGACGGC 

351 GGTAACCGTT GTCGTGGCGG CGGAAGGTGA AGCGCAATAA 

This corresponds to the amino acid sequence <SEQ ID 178; ORF 043>: 

m04 3 . pep 

1 MVVSNQNIYA AGPSALLHIR RQKSVMPSER FVEPSRVAVA AKVKGGLDGA 

51 AGFDEGERVF QPQAAQASGD GFAGLRFEIA FQVAFVQSDA AGDFGDGQRT 

101 GEFVLQDVGG FVYAPTAVTV VVAAEGEAQ* 

Computer analysis of the amino acid sequences gave the following results: 
Homology with a predicted ORF from N. meningitidis menA with menB 
ORF 043 shows 89.8% identity over a 128 aa overlap with a predicted ORF (ORF043.a) from 
N. gonorrhoeae: 

m043/g043 

m0 4 3 -pep 
g043 



10 20 30 40 50 60 

MVVSNQNIYAAGPSALLHIRRQKSVMPSERFVEPSRVAVAAKVHGGLDGAAGFDEGERVF 
I i I ! 11 t I t I : I I 1 M : I I I I i i I I I I I M I I I M I I I I I I I I Hill! I i I I I 1 I i 
MVVSNQNIYAVGPSALFHIRRQKSVMPPERFVEPSRVAVAAKVHRGLDGAARFDEGERVF 
10 20 30 40 50 60 



70 80 90 100 110 120 

m04 3 .pep QPQAAQASGDGFAGLRFE I AFQVAFVQSDAAGDFGDGQRTGEFVLQDVGG FVYAPTAVTV 

I I I I I I I t I I I 1 I I M I 1 I I I I I I I I I I I i 1 I I It ! ! t : I I I : : I : : ! M I I I I : I I : | 
g 0 4 3 QPQAAQASG DG FAG LR FE I A FQVA FVQPDAAG D FG DGQRAGE FA VQN I GG FVYA PAAVAV 

70 80 90 100 110 120 



130 

m04 3.pep VVAAEGEAQX 

I I I I I I I I 
g04 3 VVAAEGEAXX 
130 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 79>: 

a04 3 . seq 

1 ATGGTTGTTT CAAATCAAAA TATCTATGCC GCCGGCCCCT CAGCACTTCT 

51 TCACATCCGA AGGCAAAAAT CCGTAATGCC GTCTGAACGC TTCGTTGAAC 

101 CGTCCCGCGT GGCGGTAGCC GCAAAAGTGC ATGGCGGCTT GGACGGTGCT 
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151 GCCGGATTCG ATGAGGG^-. GCGCGTGTTC CAGCCGCAGG CGGCGCAGGC 

201 ATCCGGCGAC GGTTTCGCCG GTTTGCGCTT TGAAATAGCG TTTCAGGTAG 

2 51 CATTCGTTCA GTCCGACGCG GCGGGCGATT TCGGCGATGG TCAGCGGACG 

301 GGCGAATTCG TGTTGCAGGA TGTCGGCGGC TTCGTCTATG CGCCGACGGC 

351 GGTAACCGTT GTCGTGGCGG CGGAAGGTGA AGCGCAATAA 

This corresponds to the amino acid sequence <SEQ ID 180; ORF 043. a>: 

a043 . oep 

1 MVVSNQN I YA AGPSALLHIR RQKSVMPSER FVEPSRVAVA AKVHGGLDGA 

51 AGFDEGERVF QPQAAQASGD GFAGLRFEI A FQVAFYQSDA AGDFGDGQRT 

101 GEFVLQD VGG FVYAPTAVTV VVAA EGEAQ* 

100.0% identity in 129 aa overlap 

10 20 30 40 50 60 

MVVSNQNI YAAGPSALLH I RRQKSVMPSERFVEPSRVAVAAKVHGGLDGAAGFDEGERVF 
I | U M i I M I ! I I M i 1 1 I U I I M I I I I I H I I i I I I M M I I I I i I I M I I ! I 1 I 1 I 
MVVSNQNI YAAGPSALLHIRRQKSVMPSERFVEPSRVAVAAKVHGGLDGAAGFDEGERVF 
10 20 30 40 50 60 

70 80 90 100 110 120 

QPQAAQASGDG FAG LRFE I AFQVAFVQSDAAGDFGDGQRTGEFVLQDVGG FVYAPTAVTV 
| j I I i I I I 1 I I I I I I 1 I I 1 I I I I 1 I 1 I I I I ' I ! I ! 1 I I ! I i I I ! i I ! 1 I I ' i I I ! t i i i ! 
QPQAAQASGDGFAGLRFEIAFQVAFVQSDAAGDFGDGQRTGEFVLQDVGG FVYAPTAVTV 
70 80 90 100 110 120 

130 

VVAAEGEAQX 
I I I I I I i I t I 
VVAAEGEAQX 
130 

The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 181>: 

g044 . seq 

1 ATGCTGCCCG ACCAGAGCGT CGAGTTCTTG CC AC AAGTCG TCGTTTTTGA 

51 CGGGCTGTTT GGCGGCGGTT TTCCAGCCGT TGCGCTTCCA ACCGTGTATC 

101 CAGTTTTCCA TGCCGTTTTT GACGTATTGC GAGTCGGTGC AGATGATGAC 

151 GGTGCAGCGG CGTTTGAGCG ATTTCAGCCC TTCGATAACG GCGGTCAGCT 

201 CCATGCGGTT GTTGGTGGTT TGCGCTTCGC CGCCGAAAAG TTCTTTTTCG 

251 CGGCTGCCGT AGCGCATTAA 

This corresponds to the amino acid sequence <SEQ ID 182; ORF 044.ng>: 

g044 .pep 

1 MLPDQSVEFL PQ VWFDGLF GGGFPAVAL P TVYPVFHAVF DVLRVGADDD 
51 GAAAFERFQP FDNGGQLKAV VGGLRFAAEK FFFAAAVAH* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 1 83>: 

m044 . seq 

1 ATGCCGTCCG ACTAGAGCGT CGAGTTCTTT CCAGAAGTCG TCGTTTTTGA 

51 CGGGCTGTTT GGAGGCGGTT TTCCAGCCGT TGCGCTTCCA ACCGTGTATC 

101 CAGTTTTCCA TGCCATTTTT GACGTATTGC GAGTCGGTGC AGATGATGAC 

151 GGTGCAGCGG CGTTTGAGCG ATTTCAGTCC TTCGATGACG GCAGTCAGTT 

201 CCATGCGGTT GTTGGTGGTT TGCGCTTCGC CGCCGAAAAG TTCTTTTTCG 

251 TGGCTACCGT AGCGCAyTAa 

This corresponds to the amino acid sequence <SEQ ID 184; ORF 044>: 

m044 .pep 

1 MPSDXSVEFF PE VWFDGLF GGGFPAVAL P TVYPVFHAIF DVLRVGADDD 
51 GAAAFERFQS FDDGSQFHAV VGGLRFAAEK FFFVATVAH* 



ra043/a043 

m0<3 3 . pep 
a043 

m04 3 . pep 
a043 

m04 3 . pep 
a043 
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The following partial DNA sequence was identified in N. meningitidis <SEQ ID 185>: 

a04 4 . seq 

1 GTGCCGTCCG ACCAGCGCGT CGAGTTCTTT CCACAAGTCG TCGTTTTTGA 

51 CGGGCTGTTT GGCGGCGGTT TTCCAGCCGT TGCGCTTCCA ACCGTGTATC 

101 CAGTTTTCCA TGCCGTTTTT GACGTATTGC GAGTCGGTGC AGATGATGAC 

151 GGTGGhGCGG CGTTTGAGCG ATTTCAGTCC TTCGATGACG GCGGTCAGTT 

201 CCATACGGTT GTTGGTGGTT TGCGCTTCGC CGCCGAAAAG TTCTTTTTCG 

2 51 TGGCTGCCGT AGCGCATTAA 

This corresponds to the amino acid sequence <SEQ ID 186; ORF 044. a>: 

a04 4 . pep 

1 VPSDQRVEFF PQ VVVFDGLF GGGFPAVAL P TVYPVFHAVF DVLRVGADDD 
51 GAAAFERFQS FDDGGQFHTV VGGLRFAAEK FFFVAAVAK * 

m044/a044 91.0% identity over a 89 aa overlap 

10 20 30 40 50 60 

m04 4 . pep MPSDXSVEFFPEWVFDGLFGGGFPAVALPTVYPVFHAIFDVLRVGADDDGAAAFERFQS 
: I I I I I I I I : i I ! IS I i I I I I I! I I I I I 1 I I I I I I I : I I I ! I I f I I I I I I I I 1 I I I I I 
a0 4 4 VPSDQRVEFFPQVVVFDGLFGGGFPAVALPTVYPVFHAVFDVLRVGADDDGAAAFERFQS 

10 20 30 40 50 60 

70 80 90 

m04 4 . pep FDDGSQFHAWGGLRFAAEKFFFVATVAHX 
I I I I : I M : I I i I I ! I I I 1 I i I I I ! : I I i I 
a04 4 FDDGGQFHTVVGGLRFAAEKFFFVAAVAHX 

70 80 90 

Computer analysis of this ammo acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

044 .nows 86.5% identity over a 89 aa overlap with a predicted ORF (ORF 044.ng) 
from N. gonorrhoeae: 

m044/g044 



m044 . pep 
g044 



m04 4 .pep 
g044 

70 80 9 



10 20 30 40 50 60 

MPSDXSVEFFPEVWFDGLFGGGFPAVALPTVYPVFHAIFDVLRVGADDDGAAAFERFQS 

I I illhMIIIMIilMlllillllilllllhllllilllllilMMIIII 

MLPDQSVEFLPQVWFDGLFGGGFPAVALPTVYPVFHAVFDVLRVGADDDGAAAFERFQP 
10 20 30 40 50 60 

70 80 90 

FDDGSQFHAWGGLRFAAEKFFFVATVAHX 

IhhhllllllllMMIMhhim 

FDNGGQLHAWGGLRFAAEKFFFAAAVAHX 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 187>: 

g046 . seq 

1 ATGTCGGCAA TGCTGCGTCC GACAAGCAGC CCGCCGCgcc gCGCCTGTAT 

51 GATGACCATC CGCACGCGGT CGTCTGCAAA ACGTAAAACC TGCAATGCGC 

101 CCGGGCAGTC TATCAGGCCG GCAAGCTGTT CGGTAACGAG CTGTTCGGGG 

151 CTGATGGTTT CGGTTATGCC gaATATGGAA AGGCTGCCGt TTTcGTTGTT 

201 TTCGAGCTTG GGGCTGAGGT ATTCGAGGTA TtcgctGGAA CGGACGCGCG 

251 CGATGCGGCC GGGGATGTTG AACAGGTCGG CGGCAACTTT GCAGGCGACG 

3 01 ATGTTGGTTT CGTCGCTGCG GGagaGCGCG AGcagcaagt cggcatcttC 
351 CgcgccggcG Cgttataatg tgAAGGGGGA TGCGccgctg ccgaAAACGG 

4 01 TTTGGacatC gaggcggctg CCTGTTTCCT GCAATGCTTT TTCGTCGATG 
4 51 TCGATAAcgg TTACGTCCTT GTTGGTGATG GCGGCAAGGT TTTGCGCGAC 
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501 GGTAGAACCT ACCTGCCCGT TGCCTAAAAT GAGGATTTTC ACGGTATGGG 
551 TCGCCGGGTG A 



This corresponds to the ammo acid sequence <SEQ ID 1SS; ORF 046. ng>: 

g04 6 . pep 

1 MSAMLRPTSS PFRRACMMTI RTRSSAKRKT CNAPGQSIR? ASCSVTSCSG 

51 L-MVSVMPNME RLPFSLFSSL GLRYSRYSLE RTRAMRPGML NRS AATLQAT 

101 MLVSSLRESA SSKSASSAPA RYNVKGDAPL PKTVWTSRRL PVSCNAFSSM 

151 SITVTSLLVM AARFCATVEP TCPLPKMRIF TVWVAG* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 189>: 

m046 . seq 

1 ATGTCGGCAA TGCTGCGTCC GACAAGCAsT CCGC . r . sGC gCGcCTGTAT 

51 GATGACCATC CGCACGCGGT CGTCTGCAAA ACGTAAAACC TGCAATGCGC 

101 CCGGGCAGTC TATCAGGCCG GCAAGCTGTT CGGTAACGAG CTGTTCGGGG 

151 CTGATGGTTT CGGTTATGCC GAATATGGAA AGGCTGCCGT TTTCGTTGTT 

2 01 TTCGAGCTTG GGGCTGAGGT ATTCGAGGTA TTCGCTGGAA CGGACGCGCG 

251 CGATGCGGCC GGGGATGTTG AACAGGTCGG CGGCAACTTT GCAGGCGACG 

301 ATGTTGGTTT CGTCGCTGCG GGAGAGCGCG AGCAGCAAGT CGGCATCTTC 

351 CGCGCCGGCG CGTTCTAATG TGAAGGGGGA TGCGCCGTTG CCGAAAACGG 

401 TTTGGAGATC GAGGCGGCTG CCTGTTTCCT GCAATGCTTT TTCGTCGATG 

451 TCGATAACGG TTACGTCGTT GTTGGGTATG GCGGCAAGGT TTTGTGCGAC 

5 01 GGTAGAACCT ACCTGTCCGT TGCCTAAAAT GAGGATTTTC ACGGTGTGGG 

551 TCGCCGAGTG A 



This corresponds to the amino acid sequence <SEQ ID 190; ORF 046>: 

m046.pep 

1 MSAMLRPTSX PXXRACMMTI RTRSSAKRKT CNAPGQS1RP ASCSVTSCSG 

51 LMV3VMPNME RLPFSLFSSL GLRYSRYSLE RTRAMRPGML NRSAATLQAT 

101 MLVSSLRESA SSKSASSAPA RSNVKGDAPL PKTWTSRRL PVSCNAFSSM 

151 SITVTSLLGM AARFCATVEP TCPLPKMRIF TVWVAE* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 191 >: 

a046 . seq 

1 ATGTCGGCAA TGCTGCGTCC GACAAGCAGT CCGCCGCGCC GCGCCTGTAT 

51 GATGACCATC CGCACGCGGT CGTCTGCAAA ACGTAAAACC TGCAATGCGC 

101 CCGGGCAGTC TATCAGGCCG GCAAGCTGTT CGGTAACGAG CTGTTCGGGG 

151 CTGATGGTTT CGGTTATGCC GAATATGGAA AGGCTGCCGT TTTCGTTGTT 

201 TTCGAGCTTG GGGCTGAGGT ATTCGAGGTA TTCGCTGGAA CGGACGCGCG 

2 51 CGATGCGGCC GGGGATGTTG AACAGGTCGG CGGCAACTTT GCAGGCGACG 

301 ATGTTGGTTT CGTCGCTGCG GGAGAGCGCG AGCAGCAAGT CGGCATCTTC 

351 CGCGCCGGCG CGTTCTAATG TGAAGGGGGA TGCGCCGTTG CCGAAAACGG 

4 01 TTTGGACATC GAGGCGGCTG CCTGTTTCCT GCAATGCTTT TTCGTCGATG 

4 51 TCGATAACGG TTACGTCGTT GTTGGGTATG GCGGCAAGGT TTTGTGCGAC 

501 GGTAGAACCT ACCTGTCCGT TGCCTAAAAT GAGGATTTTC ACGGTGTGGG 

551 TCGCCGAGTG A 

This corresponds to the amino acid sequence <SEQ ID 192; ORF 046.a>: 

a04 6 . pep 

1 MSAMLRPTSS PPRRACMMTI RTRSSAKRKT CNAPGQSIRP ASCSVTSCSG 

51 LMVSVMPNME RLPFSLFSSL GLRYSRYSLE RTRAMRPGML NRSAATLQAT 

101 MLVSSLRESA SSKSASSAPA RSNVKGDAPL PKTWTSRRL PVSCNAFSSM 

151 SITVTSLLGM AARFCATVEP TCPLPKMRIF TVWVAE * 



m046/a046 98.4% identity over a 186 aa overlap 



10 20 30 40 50 60 

m04 6.pep MSAMLRPTSXPXXRACMMTIRTRSSAKRKTCNAPGQSIRPASCSVTSCSGLMVSVMPNME 
! I ! I i I I I I ! I I I I I I I I I I I I f I I i I 1 I I I f I I 1 M I I I I I 1 I I M I I I t ! i I I I I 
a04 6 MSAMLRPTSS PPRRACMMTIRTRSSAKRKTCNAPGQSIRPASCSVTSCSGLMVSVMPNME 
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10 20 30 40 50 60 

10 80 90 100 110 120 

m04 6 . pep RLPFSLFSSLGLRYSRYSLERTRAMRPGMLNRSAATLQATMLVSSLRESASSKSASSAPA 

I I I i i I I I i I f I I I I I I M I I I I I I I I I I I I I I M I i i I 1 I I I I I I I i I! J i I I f I ! I I I 
a04 6 RLPFSLFSSLGLRYSRYSLERTRAMRPGMLNRSAATLQATMLVSSLRESASSK5ASSAPA 

70 80 90 100 110 120 

130 140 150 160 170 130 

m04 6 . pep RSNVKGDAPLPKTVWTSRRLPVSCNAFSSMS ITVTSLLGMAARFCATVEPTCPLPKMRI F 

II I I II I t I I I t M I I I I I I I I ! I I I i I I M I I I ! ! I f I I I I I I I I I I i I I I I I i i I I I I 
a04 6 RSNVKGDAPLPKTVWTSRRLPVSCNAFSSMSIT\"7 "LLGMAARFCATVEPTCPLPKMRI F 

130 140 150 160 170 180 



m04 6.pep TVWVAEX 
I I I i I I I 

a04 6 TVWVAEX 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 046 shows 97.3% identity over a 185 aa overlap with a predicted ORF (ORF 046.ng) 
from N. gonorrhoeae: 

m046/g046 

10 20 30 40 50 60 

m04 6 . pep MSAMLRPTSXPXXRACMMTIRTRSSAKRKTCNAPGQSIRPASCSVTSCSGLMVSVMPNME 

IIIIIIIII I IIIIMIIIIIIIIIIIIIIIIIIII1IIUMIIIMIIIIIIIII 

g04 6 MSAMLRPTSSPPRRACMMTIRTRSSAKRKTCNAPG0SIRPASCSVT3C3GLM\ ; S\T^PNME 
10 20 30 40 50 60 

70 80 90 100 110 120 

m04 6 - pep RLPFSLFSSLGLRYSRYSLERTRAMRPGMLNRSAATLQATMLVSSLRESASSKSASSAPA 

IIIIIIIIIIIIIIIIIMMIIIIIIIMIIIIMIMIIIIIIIIIIIIIIIIMIII 

g04 6 RLPFSLFSSLGLRYSRYSLERTRAMRPGMLNRSAATLOATMLVSSLRESASSKSASSAPA 
70 80 90 100 110 120 

130 140 150 160 170 180 

m04 6 .pep RSNVKGDAPLPKTWTSRRLPVSCNAFSSMSITVTSLLGMAARFCATVEPTCPLPKMRIF 

i miiimiiiiiiimiiiiiimiiuiimmii iiuinimmimiM 

g04 6 RYNVKGDAPLPKTVWTSRRLPVSCNAFSSMSITVTSLLVMAARFCATVEPTCPLPKMRI F 

130 140 150 160 170 180 



m04 6.pep TVWVAEX 
Mill 

gO 4 6 TVWVAGX 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 193>: 

g047 . seq 

1 ATGGTCATCA TACAGGCGcg gcGCGGCGGG CTGCTTGTCG GACGCAGCAT 

51 TGCCGACATC GCCCAAGATT TGCCCGACGG GGCCGACTGC CAAATCTGCG 

101 CCGTTTACCG CAACAACCGC CTCATC . TCC CCGCGCCGCA AACCGTCATC 

151 ATCGAAGGCG ACGAAATCCT GTTTGCCGCC GCCGCCGAAA ACATCGGGGC 

201 GGTCATACCc gaATTGCGCC CCAAAGAAAC CAGCACCCGC CGCATCATGA 

251 TTGCCGGCGG CGGCAAC AT c tgctACCGCC TCGCCAAGCA GCTCGAACAC 



WO 99/57280 



PCT/US99/09346 



246 



3 01 GCATAcaacG TCAAAATCAT CGAATGCCGG CCGCGCca tg ccgaATGGAT 

3 51 AGCCGAAAAC ctcgAcaaCA CCCTCGTCCT GCAAGGTTCG Gcaaccgacg 

4 01 aAaccctgct cgAcaacgaa tacatcgacg aaatcgaCGT ATTCTGCGCC 

4 51 CTGACCAACG ACGACGAAAG CAACATTAtg tCCGCCCTTT TGGCGAAAAA 
501 CCTcggcgCG AAGCgcgtca tcggCATCGT CAACCGCTCA AGCTACGTCG 

5 51 ATTTGCTCGA AGGCAACAAA ATCGACATCG TCGTCTCCCC CCACCTCATC 
601 ACCATCGGCT CGATACTCGC CCACATCCoG CGCGGCGAGA TCGTTGCCGT 
651 CCACCCCATC CGGCGCGGCA CGGCGGAAGC CATCGAAGTC GTCGCGCACG 
701 GCGACAAAAA AACTTCCGCC ATCATCGGCA GGCGCATCAG CGGCATCAAA 

7 51 TGGCCCGAAG GCTGCCACAT TGCCGCCGTC GTCCGCGCCG GAACCGGCGA 

8 01 AACCATTATG GGACACCATA CCGAAACCGT CATCCAAGAC GGTGACCACA 
851 TCATCTTTTT CGTCTCGCGC CGGCGCATCC TGAACGAACT GGAGAAACTC 
901 ATCCAAGTCA AAATGGGCTT TTTCGGATAA 

This corresponds to the amino acid sequence <SEQ ID 194; ORF 047. ng>: 

g047 .pep 

1 MVIIGARRGG LLVGRSIADI AQDLPDGADC QICAVYRNNR LIVPAPQTVI 

51 IEGDEILFAA AAENIGAVIP ELRPKETSTR RIMIAGGGNI CYRLAKQLEH 

101 AYNVKI IECR PRRAEWIAEN LDNTLVLQGS ATDETLLDNE YIDEIDVFCA 

151 LTNDDESNIM SALLAKNLGA KRVIGIVNRS SYVDLLEGNK ID IWSPHLI 

201 TIGSILAHI R RGDIVAVHPI RRGTAEAIEV VAHGDKKTSA IIGRRISGIK 

251 WPEGCHIAAV VRAGTGETIM GKHTETVIQD GDKIIFFV3R RRILNELEKL 

3 01 IG/VKMGFFG* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 195>: 



m047 . seq 












1 


ATGGTCATCA 


TACAGgCGcG 


C . . syGCGGA 


sTGCTTGTCG 


GACGCAGCAT 


51 


TGCCGACATC 


GCCCAAGATT 


TGCCCGACGG 


GGCCGACTGC 


CAAATCTGCG 


101 


CCGTTTACCG 


CAACAACCGC 


CTCATCGTCC 


CCGCGCCGCA 


AACCGTCATC 


151 


ATCGAAGGCG 


ACGAAATCCT 


ATTTGCCGCC 


GCCGCCGAAA 


ACATCGGCGC 


201 


GGTCATACCC 


GAATT GCGGC 


CCAAAGAAAC 


CCAAAGAAAC 


CAGCcCmgmm 


251 


GcATCATGAT 


TkCCGGCGGC 


GGCAACATCG 


GCTACCGTCT 


CGCCAAGCAG 


301 


CTCGAACACG 


CATACAACGT 


yAAAATCATC 


GAATGCCGGC 


CGCGCCGTGC 


351 


CGAATGGATA 


GCCGAAAACC 


TCGACAACAC 


CCTCGTCyTG 


CAAGGTTCGG 


401 


CAACCGACGA 


AACCCTGCTC 


GACAACGAAT 


ACATCGACGA 


AATCGACGTA 


451 


TTCTGCGCCC 


TGACCAACGA 


CGACGAAAGC 


AACATTATGT 


CCGCCCTTTT 


501 


GGCGAaAAAC 


CTCGGCGCGA 


AGCGCGTCAT 


CGGCATCGTC 


AACCGCTCAA 


551 


GCTACGTCGA 


TTTGCTCGAA 


GGCAACAAAA 


TCGACATCGT 


CGTCTCCCCC 


601 


CACCTCATCA 


CCATCGGCTC 


GATACTCGCC 


CACATCCGGC 


GCGGCGACAT 


651 


CGTTGCCGTC 


CACCCCATCC 


GGCGCGGCAC 


GGCGGAAGCC 


ATCGAAGTCG 


701 


TCGCACACGG 


CGACAAAAAA 


ACTTCCGCCA 


TCATCGGCAG 


GCGCATCAGC 


751 


GGCATCAAAT 


GGCCCGAAGG 


CTGCCACATT 


GCCGCCGTCG 


TCCGCGCCGG 


801 


AACCGGCGAA 


ACCATTATGG 


GACACCATAC 


CGAAACCGTC 


ATCCAAGACG 


851 


GCGACCACAT 


CATCTTTTTC 


GTCTCGCGCC 


GGCGCATCCT 


GAACGAACTG 


901 


GAAAAACTCA 


TCCAGGTCAA 


AATGGGCTTT 


TTCGGATAA 





This corresponds to the amino acid sequence <SEQ ID 196; ORF 047>: 

m047 .pep 

1 MVIIQARXXG XLVGRSIADI AQDLPDGADC QICAVYRNNR LIVPAPQTVI 

51 IEGDEILFAA AAENIGAVIP ELRPKETQRN QPXXIMIXGG GNIGYRLAKQ 

101 LEHAYNVKI I ECRPRRAEWI AENLDNTLVL QGSATDETLL DNEYIDEIDV 

151 FCALTNDDES NIMSALLAKN LGAKRVIGIV NRSSYVDLLE GNKID IWSP 

201 HLITIGSILA HI RRGDIVAV HPIRRGTAEA I EW AHGDKK TSAIIGRRIS 

251 GIKWPEGCHI AAWRAGTGE TIMGHHTETV IQDGDHI I FF VSRRRILNEL 

3 01 EKLIQVKMGF FG* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 197>: 

a047 . seq 

1 ATGGTCATCA TACAGGCGCG GCGCGGCGGA CTGCTTGTCG GACGCAGCAT 
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51 TGCCGACATC GCCCAAGATT TGCCCGACGG GGCCGACTGC CAAATCTGCG 

101 CCGTTTACCG CAACAACCGC CTCATCGTCC CCGCGCCGCA AACCGTCATC 

151 ATCGAAGGCG ACGAAATCCT ATTTGCCGCC GCCGCCGAAA ACATCGGCGC 

20 i GGTCATACCC GAATTGCGCC CCAAAGAAAC CAGCACCCGC CGCATCATGA 

2 51 TTGCCGGCGG CGGCAACATC GGCTACCGTC TCGCCAAGCA GCTCGAACAC 
301 GCATACAACG TCAAAATCAT CGAATGCCGG CCGCGCCGTG ~CGAATGGAT 

3 51 AGCCGAAAAC CTCGACAACA CCCTCGTCCT GCAAGGTTCG GCAACCGACG 

4 01 AAACCCTGCT CGACAACGAA TACATCGACG AAATCGACGT ATTCTGCGCC 
4 51 CTGACCAACG ACGACGAAAG CAACATTATG TCCGCCCTTT TGGCGAAAAA 
501 CCTCGGCGCG AAGCGCGTCA TCGGCATCGT CAACCGCTCA AGCTACGTCG 
551 ATTTGCTCGA AGGCAACAAA ATCGACATCG TCGTCTCCCC CCACCTCATC 
601 ACCATCGGCT CGATACTCGC CCACATCCGG CGCGGC'GACA TCGTTGCCGT 
651 CCACCCCATC CGGCGCGGCA CGGCGGAAGC CATCGAAGTC GTCGCACACG 
701 GCGACAAAAA AACTTCCGCC ATCATCGGCA GGCGCATCAG CGGCATCAAA 
7 51 TGGCCCGAAG GCTGCCACAT TGCCGCCGTC GTCCGCGCCG GAACCGGCGA 
801 AACCATTATG GGACACCATA CCGAAACCGT CATCCAAGAC GGCGACCACA 
851 TCATCTTTTT CGTCTCGCG^ CGGCGCATCC TGAACGAACT GGAAAAACTC 
901 ATCCAAGTCA AAATGGGCTT TTTCGGATAA 



This corresponds to the amino acid sequence <SEQ ID 198; ORF 047. a>: 

a04 7 . pep 

1 MVI IQARRGG LLVGRS IADI AQDLPDGADC QICAVYRNNR LIVPAPQTVI 

51 IEGDEILFAA AAENIGAVIP ELRPKETSTR RIMIAGGGNI GYRLAKQLEH 

101 AYNVKIIECR PRRAEWIAEM LDNTLVLQGS ATDETLLDNE YIDEIDVFCA 

151 LTNDDESNIM SALLAKNLGA KRVIGIVNRS SYVDLLEGNK ID IVVSPHLI 

201 TIGSILAKI R RGDIVAVKPI RRGTAEAIEV VAHGDKKTSA IIGRRISGIK 

251 WPEGCHIAAV VRAGTGETIM GHKTETVIQD GDKI I FFV5R RRILNELEKL 

301 IQVKMGFFG* 



m047/a047 96.5% identity over a 312 aa overlap 

10 20 30 40 50 60 

mC-17 . pep MVI I QARXXGXLVGRS IADI AQDLPDGADCQICAVYRNNRL I VPAPQTV I IEGDEILFAA 

I I I ! I I I I I I I I I I i I I 1 M 1 i t I II 11 I I I I I ! i 1 I ! I I I I M I I I I i I I I I (I 1 I 
a04 7 MVI IQARRGGLLVGRS IADI AQDLPDGADCQICAVYRNNRLIVPAPQTVI IEGDEILFAA 

10 20 30 40 50 60 



70 80 90 100 110 120 

m04 7.pep AAENIGAVI PELRPKETQRNQPXXIMIXGGGNIGYRLAKQLEHAYNVKI IECRPRRAEWI 
I f I I I I I I I II I I I I I I : : I I i I I I I I I I I I I I II I I I i I I I I I I I I I j | | | | | 
a04 7 AAENIGAVI PELRPKETSTRR IMIAGGGNIGYRLAKQLEHAYNVKI IECRPRRAEWI 

70 80 90 100 110 

130 140 150 160 170 180 

m04 7 .pep AENLDNTLVLQGSATDETLLDNEYIDEIDVFCALTNDDESNIMSALLAKNLGAKRVIGIV 

I I M I I I M I I I I I I I I I I I I I I 1 I | I | M | | | | I I I ! i I I I I I I I I I I ! I I I I I II I I I 
a04 7 AENLDNTLVLQGSATDETLLDNEYIDEIDV FCALTNDDESNIMSALLAKNLGAKRVIGIV 

120 130 140 150 160 170 

190 200 210 220 230 240 

m04 7 . pep NRSSYVDLLEGNKI DI VVSPHLITIGS ILAHIRRGDIVAVHPIRRGTAEAIEVVAHGDKK 

M I II I I I j I I I I I I I I I I I I I M I I I I M I I I I I I I II I I I I I I I I I I I | | I M | | I | | 
a04 7 NRSSYVDLLEGNKI DIVVSPHLITIGS ILAHIRRGDIVAVHPIRRGTAEAIEVVAHGDKK 

180 190 200 210 220 230 



250 260 270 280 290 300 

m047 . pep TSAI IGRRISGIKWPEGCHIAAVVRAGTGETIMGHH7ETVIQDGDHI I FFVSRRRI LNEL 

I i I I M I I I I M I I I M I II I I I I I I I I I I I I I j I I I I I I I M I I I I I I I I I I I I I | | I I 
a04 7 TSAI IGRRISGIKWPEGCHIAAVVRAGTGETIMGHHTETVIQDGDHI I FFVSRRRI LNEL 

240 250 260 270 280 290 



m04 7 . pep 



310 

EKLIQVKMGFFGX 
I I I I I I I I I I I I I 
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a047 EKLIQVKMGFFGX 
300 310 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 047 shows 96.2% identity over a 312 aa overlap with a predicted ORF (ORF 047. ng) 
from A^. gonorrhoeae: 

m047/g045 

m047 .pep mviiqarxxgxlvgrsiadiaqdlpdgadcoicavyrnnrlivpapqt\ ; i:egdeilfaa 60 

iiiiiii i iMiiiiMiiiiiiMiitiUMumiiiMimmiiiiii 

gO 4 7 MVIIOARRGGLLVGRSIADIAQDLPDGADCQICAVYRNNRLIVPAPQTVIIEGDEILFAA 6 0 

m04 7 .pep AAENIGAVIPELRPKETORNQPXXIMIXGGGNIGYRLAKQLEHAYNVKIIECRPRRAEWI 120 

lilllllllillllllh ■ III I ' 
g04 7 AAENIGAVI PELRPKETSTRR IMIAGGGNICYRLAKQLEHAYNVKI I ECRPRRAEW I 117 

m04 7 .pep AENLDNTLVLQGSATDETLLDNEYIDEIDVFCALTNDDESNIMSALLAKNLGAKRVIGIV 180 

IIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIMIIIIIMIIIIIMIIil 

g04 7 AENLDNTLVLOGSATDETLLDNEYIDEIDVFCALTNDDESNIMSALLAKNLGAKRVIGI V 17 7 

m04 7 . pep NRSSYVDLLEGNKIDI WSPHLI TIGS I LAH I RRGD I VAVHP I RRGTAEAIEWAHGDKK 240 

IMIIIIMIIMMIIIMIIIIIIIIIIMIIIIIiMfllllhliilMIMMII 

g04 7 NRSSYVDLLEGNKIDI WSPKLITIGS I LAHIRRGDIVA VHP I RRGTAEAIEWAHGDKK 23 7 

m04 7 . pep TSAI IGRRISGI KWPEGCHI AAWRAGTGETIMGHHTETVIQDGDK: I FFVSRRRILNEL 3 00 

II I I I I 1 1 I II I 1 1 I I II I 1 1 I 1 1 I II I I I I I I M 1 1 I 1 1 ; I I I I I ! I I ! I 1 1 I I I i I I I 

g 0 4 7 TSAI I GRRI SGI KWPEGCH I AAWRAGTGETIMGHHTETV I CDGDHI I FFVSRRRILNEL 2 97 

m047.pep EKLIQVKMGFFGX 313 

iiniimiiii 

g04 7 EKLIQVKMGFFGX 310 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 199>: 

g04 8 . seq 

1 ATGCTCGACA AAGGCGAGGA GTTGCCCGTC GATTTCACCA ACCGCCTGAT 

51 TTACTACGTc ggcCCcgTCG ATCCGGTCGG CGATGAAGTC GTCGGTCCCG 

101 CAGGTCCGAC CACAGCCACC CGCATGGACA AATTTACCCG CCAAATGCTC 

151 AAACAAACCG GCCTCTTGGG CATGATCGGC AAATCCGagc gcgGcgcggc 

201 cacctGCGAA GCcatCGCCG ACAACAAGGC CGTGTACCTC ATGGCAGTCG 

251 GCGGCGCGGC ATACCTCGTG GCAAAAGCCA TCAAATCTTC CAAAGTCTTG 

3 01 GCGTTCCCCG AATTGGGTAT GGAAGCCGTT TACGAATTTG AAGTCAAAGA 

3 51 TATGCCCGTA ACCGTCGCCG TGGACAGCAA AGGCGAATCC ATCCACGCCA 
401 CCGCCCCGCG CAAATGGCAG GCGAAAATCG GCATCATCCC CGTCGAGTCT 

4 51 TGA 

This corresponds to the amino acid sequence <SEQ ID 200; ORF 048. ng>: 
g04 8 .pep 

1 MLDKGEELPV DFTNRLI YYV GPVDPVGDEV VGPAGPTTAT RMDKFTRQML 

51 KQTGLLGMIG KSERGAATCE AIADNKA VYL MAVGGAAYLV AKAI KSSKVL 

101 AFPELGMEAV YEFEVKDMPV TVAVDSKGES IHATAPRKWQ AKIGIIPVES 

151 * 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 20 1>: 

m048 . seq 

1 ATGCTCAACA AAGGCGAAGA ATTGCCCGTC GATTTCACCA ACCGCCTGAT 

51 TTACTACGTC GGCCCCGTCG ATCCGGTCGG CGATGAAGTC GTCGGTCCGG 
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101 CAGGTCCGAC CACAGCCACC CGCATGGACA AATTCACCCG CCAAATGCTC 

151 GAACAAACCG ACCTCTTGGG CATGATCGGC AAATCCGAGC GCGGCGTGGC 

201 CACCTGCGAA GCCATCGCCG ACAACAAAGC CGTGTACCTC ATGGCAGTCG 

251 GCGGCGCGGC GTATCTCGTG GCAAAAGCCA TCAAATCTTC CAAAGTCTTG 

3 01 GCGTTCCCCG AATTGGGCAT GGAAGCCATT TACGAATTTG AAGTCAAAGA 
351 CATGCCCGTA ACCGTCGCCG TAGATAGCAA AGGCGAATCC ATCCACGCCA 

4 01 CCGCCCCGCG CAAATGGCAG GCGAAAATCG GCATCATCCC CGTCGAATCT 
4 51 TGA 



This corresponds to the amino acid sequence <SEQ ID 202; ORF 048>: 

m04 8 . pep 

1 MLNKGEELPV DFTNRLIYYV GPVDPVGDEV VGPAGP'-'TAT RMDKFTRQML 
51 EQTDLLGM1G KSERGVATCE AIADNKA VYL MAVGGAAYLV AKAIK5SKVL 
101 AFPELGMEAI YEFEVKDMPV TVAVDSKGES IHATAPRKWQ AKIG1IPVES 
151 * 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 203>: 



a04 8 . seq 












1 


ATGCTCGACA 


AAGGCGAAGA 


ATTGCCCGTC 


GATTTCACCA 


ACCGCCTGAT 


51 


TTACTACGTC 


ggccccgi :g 


ATCCGGTCGG 


CGACGAAATC 


GTCGGCCCAG 


101 


CAGGTCCGAC 


CACCGCCACC 


CGCATGGACA 


AATTCACCCG 


CCAAATGCTC 


151 


GAACAAACCG 


ACCTCTTGGG 


CATGATCGGC 


AAATCCGAGC 


GCGGCGCGGC 


201 


CACCTGCGAA 


GCCATCGCCG 


ACAACAAAGC 


CGTGTACCTC 


ATGGCAGTCG 


251 


GCGGCGCGGC 


GTATCTCGTG 


GCAAAAGCCA 


TCAAATCTTC 


CAAAGTCTTG 


301 


GCGTTCCCCG 


AATTGGGCAT 


GGAAGCCATT 


TACGAATTTG 


AAGTCAAAGA 


351 


CATGCCCGTA 


ACCGTCGCCG 


TAGACAGCAA 


AGGCGAATCC 


ATCCACGCCA 


401 


CCGCCCCGCC 


CCAATGGCAG 


GCGAAAATCG 


GCATCATCCC 


CGTCAAATCT 


451 


TGA 











This corresponds to the amino acid sequence <SEQ ID 204; ORF 048. a>: 

a048 .pep 

1 MLDKGEELPV DFTNRLIYYV GPVDPVGDEI VGPAGPTTAT RMDKFTRQML 
51 EQTDLLGMIG KSERGAATCE AIADNKA VYL MAVGGAAYLV AKAI KS5KVL 
101 AFPELGMEAI YEFEVKDMPV TVAVDSKGES IHATAPPQWQ AKIGIIPVKS 
151 * 

m048/a048 96.0% identity over a 150 aa overlap 

10 20 30 40 50 60 

m04 8 . pep MLNKGEELPVDFTNRLI YYVGPVDPVGDEWGPAGPTTATRMDKFTRQMLEQTDLLGMIG 

I I : M t I I 1 I I t I IS I I I I I i I! I I I II I : I I t ! t I 1 I I I ! I I I I I I I I i I I I i II I ! I i 
a04 8 MLDKGEELPVDFTNRLIYYVGPVDPVGDEIVGPAGPTTATRMDKFTRQMLEQTDLLGMIG 

10 20 30 40 50 60 

70 80 90 100 110 120 

m04 8 . pep KSERGVATCEAIADNKAVYLMAVGGAAYLVAKAIKSSKVLAFPELGMEAI YEFEVKDMPV 

I i I I I : i II II IS I I I I I I i I I I I i I I I 1 I I I I I I II I i II i II I I I i I I I i t i 1 I I I i i 
a04 8 KSERG AATCE AI ADNKAVYLMAVGGAAYLVAKA I KSSKVLAFPELGME A I YEFEVKDMPV 

70 80 90 100 110 120 

130 140 150 

m04 3 . pep TVAVDSKGES IHATAPRKWQAKIGI I PVESX 

I I I I I I I I I I I I I I! 1 : I I I I M I I I I : ! I 
a04 8 TVAVDSKGESIHATAPPQWQAKIGII PVKSX 

130 140 150 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 048 shows 96.4% identity over a 150 aa overlap with a predicted ORF (ORF 048.ng) 
from N. gonorrhoeae: 
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m048/g04 8 

10 20 30 40 50 60 

m04 8 .peD MLNKGEELPVDFTNRLIY YVGPVDPVGDEWGPAGPTTATRMDKFTROMLEQTDLLGMIG 

■ ■ ; ■ ■ ' i - " : 1 i hum 

gO 4 8 MLDKGEELPVDFTNRLIYYVGPVDPVGDEWGPAGPTTATRMDKFTRQMLKQTGLLGMIG 
10 20 jO 40 50 60 

70 80 90 100 110 120 

m04 8 .peD KSERGVATCEAIADNKAVYLMAVGGAAYLVAKAIKSSKVLAFPELGMEAIYEFEVKDMPV 

llllf:||llllllllllllllll!llMII!l!lll|[||!ll|lll|:|||lllll!i 
g04 8 KSERGAATCEAIADNKAVYLMAVGGAAYLVAKAIKSSKVLAFPELGMEAVYEFEVKDMPV 
70 80 90 100 110 120 

130 140 150 

m048 .pep TVAVDSKGESIHATAPRKWQAKIGI I PVESX 

miimmimmiiMiimiM 

g04 8 TVAVDSKGES IHATAPRKWQAKIGI I PVESX 

130 140 150 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 205>; 

aCGTTTGA TCAACCGTTC GGTCAGCTCC TGTTCGGACA 

:gcgccgg TTGACGGCTT TCGGGTTCAG GATATTGATT 

^ACGCCTC TTCCGCACCG CCTTCGCCGT TTTCCGCAAC 

:gtaccgg attctgccgc ATCGGCGTTT TCCCCGCCCT 



g04 9 . seq 




1 


ATGCGGGCGC 


5 1 


GGCAGAACAC 


101 


TGGACGGGCA 


151 


CCCGTCTGCC 


201 


CAATCTGTGC 


251 


ATTCTCCGCC 


301 


AGCCTgcgcg 


351 


CGACTTCCTC 


401 


TTCTTCAGAT 



This corresponds to the amino acid sequence <SEQ ID 206; ORF 049. ng>: 

g049 -pep 

1 MRAQAFDQPF GQLLFGQAEH FAPVDGFRVQ DIDLDGHQRL FRTAFAVFRN 

51 PVCRRTGFCR IGVFPALNLC GFKFGTVFFG IEPDSPPRFD VFFRNRHLQG 

101 SLRVEPVFLK DDHRVGFD FL AAIGNGAWF FLPFL QIRL* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 207>: 



m04 9 . seq 


(partial ) 










1 


ATGCGGGCGC 


AGGCGTTTGA 


TCAGCCGTTC 


GGTCAGCTCC 


TGTTCGGACA 


51 


GGCAGAACAC 


TTCGCGCCGG 


TTGACGGCTT 


TCGGGTTCAG 


GATATTGATT 


101 


TGGACGGGCA 


TCAACGTTTC 


TTCCGCATCG 


TTTTCCCCGT 


TTTCCGAAAC 


151 


CGCCGGCTCA 


TTCGTGCCGG 


ATTCTGCCTC 


GTCGGCGTTT 


TCCCCGCTTT 


201 


CAATCTGTCC 


GGTTTCAAAT 


TCGACACTGT 


CTTTTTTGGT 


ATCAAACCGG 


251 


ATTCTCCGCC 


GCGATTCGAT 


GTGTTTTTCC 


GAAACCGACA 


TTTGCAGGGA 


301 


AGCCTGCGCG 


TTGAGCCAGT 


TTTCCTGAAG 


GACGATCATC 


GGGTCGGTTT 


351 


CGACTTCCTC 


GCCGCAATCG 


GCAACGGCGG 


CATTGTGTTC 


CTCCTGCCAT 


401 


TTTTTCAGAT 


ACGCCTT. . . 









This corresponds to the amino acid sequence <SEQ ID 208; ORF 049>: 

m049.pep (partial) 

1 MRAQAFDQPF GQLLFGQAEH FAPVDGFRVQ DIDLDGKQRF FRIVFPVFRN 

51 RRLIRAGFCL VGVFPAFNLS GFKFDTVFFG IKPDSPPRFD VFFRNRHLQG 

101 SLRVEPVFLK DDHRVGFDFL AAIGNGGIVF LLPFFQIRL . . . 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 209>: 

a04 S . seq 
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1 ATGCGGGCGC AGGCGTTTGA TCAGCCGTTC GGTCAGCTCC TGTTCGGACA 

51 GGCAGAACAC TTCGCGCCGG TTGACGGCTT TCGGGTTCAG AATATTGATT 

101 TGGACGGGCA TCAACGCTTC TTCCGCACCG CCTTCGCCGT TTTCCGCAAC 

151 CCCGTCTGCC GCCGTACCCG ATTCTGCCGC ATCGGCGTTT TCCCCGCCTT 

201 CAATCTGTCC GGTTTCAAAT TCGGCACTGT CTTTTTTGGC ATCAAACCGG 

251 ATTCTCCGCC GCGATTCGAT GTGTTTTTCC GAAACCGACA 7TTGCAGGGA 

301 AGCCTGCGCG TTGAGCCAGT TTTCCTGAAG GACGATCATC GGGTCGGTTT 

351 CGACTTCCTC GCCGCAATCG GCAACGGCGG CATTGTGTTC CTCCTGCCAT 

4 01 TTTTTCAGAT ACGCCTT 

This corresponds to the amino acid sequence <SEQ ID 210: ORF 049. a>: 

a04 9 . pep 

1 MRAQAFDQPF GQLLFGQAEH FAPVDGFRVQ NIDLDGHQRF FRTAFAVFRN 

51 PVCRRTRFCR IGVFPAFNLS GFKFGTVFFG IKPDSPPRFD VFFRNRHLQG 

101 SLRVEPVFLK DDHRVGFD FL AAIGNGG1VF LLPFF QIRL 

m049/^049 90.6% identity over a 139 aa overlap 

10 20 30 40 50 60 

m04 9 . pep MRAQAFDQPFGQLLFGQAEHFAPVDGFRVODIDLDGHQRFFRIVFPVFRNRRLIRAGFCL 

I I I I I I M I I I I I I I M I I I I I t [ I I t I M : I I I t I | | f | | | : | ||M | ; \\ 

a04 9 MRAQAFDQPFGQLLFGQAEHFAPVDG FRVQNIDLDGHQRFFRTAFAVFRN PVCRRTRFCR 

10 20 30 40 50 60 

70 80 90 100 110 120 

m04 9.pep VGVFPAFNLSGFKFDTVFFGIKPDSPPRFDVFFRNRHLQGS1RVE PVFLKDDHRVGFDFL 

: I I I f I M M M M I I I i I I ! I I M M M M M M I M M i I I I I M M M M M M M 
a0 4 9 IGVFPAFNLSGFKFGTVFFGIKPDSPPRFDVFt RNRHLQGSLRVE PVFLKDDHRVGFDFL 

"7 0 80 90 100 110 120 

130 139 
m04 9.pep AAIGNGGIVFLLPFFQIRL 
M I i M I I f I I I I i i M M 
a04 9 AAIGNGGIVFLLPFFQIRL 

130 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 049 shows 86.3% identity over a 139 aa overlap with a predicted ORF (ORF 049.ng) 
from N. gonorrhoeae: 

m049/g049 

10 20 30 40 50 60 

m04 9 . pep MRAQAFDQPFGQLLFGQAEHFAPVDGFRVQDIDLDGHQRFFRIVFPVFRNRRLIRAGFCL 

IMIMMIIIIIIIIIIIIIIlMIIMIIMMIIIhll :| 1111 hill 
904 9 m'raoafdqpfgqllfgqaehfapvdgfrvqdidldghqrlfrtafavfr^pvcrrtgfcr 

10 20 30 40 50 60 

70 80 90 100 110 120 

m04 9 . pep vgvfpafnlsgfkfdtvffgikpdspprfdvffrnrhlogslrvepvflkddhrvgfdfl 
: 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 : 1 1 II M 1 1 1 1 ! M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 i M 
g04 9 igvfpalnlcgfkfgtvffgiepdspprfdvffrnrhlogslrvepvflkddkrvgfdfl 

7 0 80 90 100 110 120 

130 139 
m04 9.pep AAIGNGGIVFLLPFFQIRL 
IMII|::||:M|:|||| 

g04 9 aaigngawfflpflqirlx 

130 140 
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The following partial DNA sequence was identified in A ; . gonorrhoeae <SEQ ID 21 1>: 

gOSO . seq 

1 atgggcgCGG GCTGGTGTCC TCCCGGCATC TTGGGCATCG GCATCGGCGg 

51 cacgcccGAA AAAGccgcgc TGATGGcaaA AGAATCCCTG ATGAGCCACA 

101 TCGAcatCca aGaATTGCAG GAAAAAGCCG CGTccggggc ggaattgece 

151 accaccgaAG CCCTGCGCCT cGAACTCTTT GAAAAGGTCA ACGCGCTGGG 

201 CATCGGCGCG CAAGGCTTGG GCGGTCTGAC CACCGTGTTG GACGTGAAAA 

2 51 TCCTCGATTA CCCGACCCAT GCCGCCTCCA AACCGATTGC CATGATTCCC 

301 AACTGTGCcg ccacCCGcca cgncgAATTT GAATTGgACG GCTCAGGt cc 

351 TGTCGAact c acgccGCcgc gtgtCGAAGA CTGA 



This corresponds to the amino acid sequence <SEQ ID 212; ORF 050.ng>: 

g050 . pep 

1 MGAGWCPPGI LGIGIGGTPE KAVLMAKESL MSHIDIQELQ EKAASGAELS 

51 TTEALRLELF EKVNALGIGA QGLGGLTTVL DVKILDYPTH AASKPIAMIP 

101 NCAATRHVEF ELDGSGPVEL TPPRVED* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 213>: 

m050 . seq 

1 ATGGGCGCGG GCTGGTGTCC TCCCGGCATC TTGGGTATCG GCATCGGCGG 

51 C. .agCCgAA AAAGCCGTGC TGATGGCAAA AGAGTCCCTG ATGAGCCACA 

101 TCGACATTCA AGAATTGCAG GAAAAGGCCG CGTCCGGGGC GgAATTGTCC 

151 ACCACCGAAG CCCTGCGCCT CGAACTCTTT GAAAAAGTCA ACGCGCTGGG 

201 CATCGGCGCA CAAGGCTTGG GCGGACTGAC CACCGTGTTG GACGTGAAAA 

251 TCCTCGATTA TCCGACCCAC GCCGCCTCCA AACCGATTGC CATGATTCCG 

3 01 AACTGCGCCG CCACCCGCCA CGTCGAATTT GAATTGGACG GCTCAGGCCC 

351 TGTCGAACTC ACGCCGCCGC GCGTCGAAGA TGGCCCGATT TGA 



This corresponds to the amino acid sequence <SEQ ID 214; ORF 050>: 

m050 .pep 

1 MGAGWCPPGI LGIGIGGXAE KAVLMAKESL MSHIDIQELQ EKAASGAELS 

51 TTEALRLELF EKVNALGIGA QGLGGLTTVL DVKILDYPTH AASKPIAMIP 

101 NCAATRHVEF ELDGSGPVEL TPPRVEDGPI * 



The following partial DNA sequence was identified in A'', meningitidis <SEQ ID 21 5>: 

a050 . seq 

1 ATGGGCGCGG GCTGGTGTCC TCCCGGCATC TTGGGCATCG GCATCGGCGG 

51 TACGCCCGAA AAAGCCGTGT TGATGGCGAA AGAATCCCTG ATGAGCCACA 

101 TCGACATCCA AGAATTGCAG GAAAAAGCCG CGTCCGGCGC GGAATTGTCC 

151 ACCACCGAAG CCCTGCGCCT CGAACTCTTT GAAAAAGTCA ACGCGCTAGG 

201 CATCGGCGCG CAAGGCTTGG GCGGTCTGAC CACCGTGTTG GACGTGAAAA 

251 TCCTCGATTA CCCGACCCAC GCCGCCTCCA AACCGATTGC CATGATTCCG 

301 AACTGCGCCG CCACCCGCCA CGTCGAATTT GAATTGGACG GCTCAGGCCC 

3 51 TGTCGAACTC ACGCCGCCGC GCGTCGAAGA CTGGCCC 

This corresponds to the amino acid sequence <SEQ ID 216; ORF 050. a>: 

a050 . pep 

1 MGAGWCPPGI LGIGIGGTPE KAVLMAKESL MSHIDIQELQ EKAASGAELS 

51 TTEALRLELF EKVNALGIGA QGLGGLTTVL DVKILDYPTH AASKPIAMIP 

101 NCAATRHVEF ELDGSGPVEL TPPRVEDWP 



moso/aoso 97.7% identity over a 129 aa overlap 



10 20 30 40 50 50 

m050 . pep MGAGWCPPGILGIGIGGXAEKAVLMAKESLMSHIDIQELQEKAASGAELSTTEALRLELF 
i I H I M I M M I I I ! I : t i I I I! I II 1 I 1 1 ! i I I ! ! ! i | | i | | [ | M ] ! I 1 ! I j I I i j 
a050 MGAGWCPPGILGIGIGGTPEKAVLMAKESLMSH I DIQELQEKAASGAELSTTEALRLELF 
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10 20 30 40 50 60 

70 80 90 100 110 120 

m0 50 . pep EKWALGIGAQGLGGLTTVLDVKI LDYPTHAASKPI AMI PNCAATRHVEFELDGSGPVEL 

I I I I t I I I I I I I II ! f I I I I I ! I ! I t I ! II i I II I f i I I I I I I ! I I I I I | | I | I I | | I i | 
a 0 5 0 EKVNALGI GAQG LGGLTTVLDVKILDY PTHAAS K P I AM I PNCAATR.HVEFELDGSGPVEL 

70 80 90 100 110 120 

130 

m050.pep TPPRVEDGPIX 

I I I I I I I I 
a050 TPPRVEDWP 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 050 shows 98.4% identity over a 127 aa overlap with a predicted ORF (ORF 050.ng) 
from N. gonorrhoeae: 

m050/g050 

10 20 30 40 50 60 

m0 5 0 .pep MGAGWCPPGILGIGIGGXAEKAVLMAKESLMSHIDIQELQEKAASGAELSTTEALRLELF 

IMIIMillMIMlh IIIIIIIIIIIMMIIIIIMMIIMMMIIMIIIII 

g05 0 MGAGWCPPGILGIGIGGTPEKAVLMAKESLMSHIDIQELQEKAASGAELSTTEALRLELF 
10 20 30 4C 50 60 

70 80 90 100 110 120 

m050 - pep EKVNALGIGAQGLGGL.TTVLDVKILDYPTHAASKPIAMIPNCAATRHVEFELDGSGPVEL 

IMlllllllllllllllllllllllllllllllllllMIMIIMillllilllllil 

g0 50 EKVNALGIGAQGLGGLTTVLDVKILDYPTHAASKPI AMI PNCAATRKVEFELDGSGPVEL 

70 80 90 100 110 120 

130 

mOSO.pep TPPRVEDGPIX 

lllllll 
g050 TPPRVEDX 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 21 7>: 

g050-l . seq 

1 ATGACCGTTA TCAAGCAAGA AGACTTTATT CAAAGTATCT GCGATGCCTT 

51 CCAATTCATC AGCTACTACC ATCCAAAAGA CTACATCGAC GCGCTTTATA 

101 AGGCGTGGCA GAAGGAAGAA AATCCCGCCG CCAAAGACGC GATGACGCAG 

151 ATTTTGGTCA ACAGCCGTAT GTGTGCCGAA AACAACCGCC CCATCTGCCA 

201 AGACACAGGT ATCGCAACCG TCTTCCTCAA AGTCGGTATG GATGTGCAAT 

251 GGGATGCGGA CATGAGCGTG GAAAAGATGG TTAACGAAGG CGTACGCCGC 

301 GCCTACACTT GGGAAGGCAA CACCCTGCGC GCTTCCGTCC TCGCCGATCC 

3 51 GGCCGGCAAA CGCCAAAACA CCAAAGACAA CACCCCCGCC GTCATCCACA 

401 TGAGCATCGT GCCGGGCGGT AAAGTCGAAG TAACCTGCGC GGCAAAAGGC 

451 GGCGGCTCTG AAAACAAATC CAAACTCGCT ATGCTCAACC CTTCCGACAA 

501 CATCGTCGAT TGGGTATTGA AAACCATCCC GACGATGGGC GCGGGCTGGT 

551 GTCCTCCCGG CATCTTGGGC ATCGGCATCG GCGGCAcgcC CGAAAAAGCC 

601 GTGTTGATGG cgaAAGAATC CCTGATGAGC CACATCGACA TCCAAGAATT 

651 GCAGGAAAAA GCCGCGTCCG GCGCGGAATT GTCCACCACC GAAGCCCTGC 

7 01 GCCTCGAACT CTTTGAAAAG GTCAACGCGC TGGGCATCGG CGCGCAAGGC 

7 51 TTGGGCGGTC TGACCACCGT GTTGGACGTG AAAATCCTCG ATTACCCGAC 

8 01 CCATGCCGCC TCCAAACCGA TTGCCATGAT TCCCAACTGT GCCGCCACCC 
e51 GCCACGTCGA ATTTGAATTG GACGGCTCAG GTCCTGTCGA ACTCACGCCG 
901 CCGCGCGTCG AAGACTGACC CGATCTGACT TACAGCCCCG ACAACGGCAA 
951 ACGCGTCGAT GTCGATAAGC TGACCAAAGA AGAAGTGGCA AGCTGGAAAA 

1001 CCGGCGACGT ATTGCTGTTG AACGGCAAAA TCCTCACC^G CCGCGATGCC 

1051 GCGCACAAAC GCCTCGTCAA TATGCTCGAC AAAGGCGAGG AGTTGCCCGT 

1101 CGATTTCACC AACCGCCTGA TTTACTACGT CGGCCCCGTC GATCCGGTCG 

1151 GCGATGAAGT CGTCGGTCCC GCAGGTCCGA CCACAGCCAC CCGCATGGAC 

1201 AAATTTACCC GCCAAATGCT CAAACAAACC GGCCTCTTGG GCATGATCGG 
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12 51 CAAATCCGAG CGCGGCGCGG CCACCTGCGA AGCCATCGCC GACAACAAGG 

1301 CCGTGTACCT CATGGCAGTC GGCGGCGCGG CATACCTCGT GGCAAAAGCC 

1351 ATCAAATCTT CCAAAGTCTT GGCGTTCCCC GAATTGGGTA TGGAAGCCGT 

1401 TTACGAATTT GAAGTCAAAG ATATGCCCGT AACCGTCGCC GTGGACAGCA 

14 51 AAGGCGAATC CATCCACGCC ACCGCCCCGC GCAAATGGCA GGCGAAAATC 

1501 GGCATCATCC CCGTCGAGTC TTGA 

This corresponds to the amino acid sequence <SEQ ID 21 8; ORF 050-1 .ng>: 

g050-l .pep 

1 MTVIKQEDFI QSICDAFQFI SYYHPKDYJD ALYKAWQKEE NPAAKDAMTQ 

51 I LVNSRMCAE NNRPICQDTG IATVFLKVGM DVQWDADMSV EKMVNEGVRR 

101 AYTWEGNTLR ASVLADPAGK RQNTKDNTPA VIHMSIVPGG KVEVTCAAKG 

151 GGSENKSKLA MLNPSDNIVD WVLKTI PTMG AGWCPPGILG IGIGGTPEKA 

201 VLMAKESLMS H T .DIQELQEK AASGAELSTT EALRLELFEK VNALGIGAQG 

251 LGGLTTVLDV KILDYPTHAA SKPIAMIPNC AATRHVEFEL DGSGPVELTP 

301 PRVED* PDLT YSPDNGKRVD VDKLTKEEVA SWKTGDVLLL NGKILTGRDA 

351 AHKRLVNMLD KGEELPVDFT NRLI YYVGPV DPVGDEVVGP AGPTTATRMD 

401 KFTRQMLKQT GLLGMIGKSE RGAATCEAI A DNKAVYLMAV GGAAYLVAKA 

4 51 IKSSKVLAFP ELGMEAVYEF EVKDMPVTVA VDSKGESI HA TAPRKWQAKI 

501 GIIPVES* 



g050-l/pl44O7 

spt P14407 | FUMB_ECOLI FUMARATE HYDRATASE CLASS I , ANAEROBIC { FUMARASE ) 
>gi 1 280063 Ipir j I B44511 fumarate hydratase (EC 4.2.1.2) fumB, iron-depenaent - Escncrichia coll 
>gi | 146048 (M27058) anaerobic class I fumarase (EC 4.2.1.2} [Escherichia coli] Length = 548 

Score = 172 bits (432), Expect = 4e-42 

Identities = 138/488 (28%), Positives = 216/488 (43%), Gaps = 22/488 (4*) 

Query: 11 QSICDAFQFISYYHPKDYIDALYKAWQKEENPAAKDAMTQILVNSRMCAENIIRPTCQDTG 7 0 
q+ DA + H K L+ E + K O L NS + A+ P CQDTG 
QAFHDASFMLRPAHQKQVAAILHDPEASEND KYVALQFLRNSEIAAKGVLPTCQDTG 103 

IATVFLKVGMDVQWDADMSVEKMVNEGVRRAYTWEGNTLRASVLADPAGKRQNTKDNTPA 130 

A + KG V W E+ +++GV YEN + A K NT N PA 

TAI I VGKKGQRV-WTGGGD-EETLSKGVYNTYI-EDNLRYSQNAALDMYKEVNTGTNLPA 166 

VIHMSIVPGGKVEVTCAAKGGGSENKSKL AMLNPSDNIVDWVLKTI PTMGAGWCP 185 

I + V G + + C AKGGGS NK+ L A+L P + +++++ * T+G CP 



Sbjct : 


53 


Query: 


71 


Sbjct: 


110 


Query : 


131 


Sbjct : 


167 


Query : 


186 


Sbjct : 


226 


Query : 


246 


Sbjct: 


285 


uery : 


302 F 


Sbjct: 


344 


Query : 


359 


Sbjct : 


; 404 


Query : 


: 419 


Sbjct : 


: 464 


Query 


: 478 


Sbjct 


: 524 



LDVKI LDYPTHAASKPIAMI PNCAATRHVEFELDGSG PVELTPP 301 

D++++ P H AS P+ M +C+A R+++ +++ G +E P 



- EVASWKTGDVLLLNGKI LTGRDAAHKRLVNM 358 



+D G+ELP + I YY TTA RMD + 



+A + YL ++GG AA L ++IK + +A+PELGMEA+++ EV+D P 



+ VD KG 
FILVDDKG 531 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 21 9>: 

m050-l . seq 

1 ATGACCGTCA TCAAACAGGA AGACTTTATC CAAAGCATTT GCGATGCCTT 

51 CCAATTCATC AGCTACTATC ATCCCAAAGA CTACATCGAC GCGCTTTATA 

101 AGGCGTGGCA GAAGGAAGAA AATCCTGCCG CCAAAGACGC GATGACGCAG 

151 ATTTTGGTCA ACAGCCGTAT GTGTGCGGAA AACAACCGCC CCATCTGCCA 

201 AGACACAGGT ATCGCAACCG TCTTCCTCAA AGTCGGTATG AACGTCCAAT 

251 GGGATGCGGA CATGAGCGTG GAAGAGATGG TTA^CGAAGG CGTACGCCGC 
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301 GCC7ACACTT GGGAAGGCAA TACGCTGCGC GCTTCCGTCC TCGCCGATCC 

351 GGCCGGCAAA CGCCAAAACA CCAAAGACAA CACCCCCGCC GTCATCCATA 

401 TGAGCATCGT GCCGGGCGGT AAAGTCGAAG TAACCTGCGC GGCAAAAGGC 

451 GGCGGCTCTG AAAACAAATC CAAACTCGCC ATGCTCAATC CTTCCGACAA 

SOI CATCGTCGAT TGGGTATTGA AAACCATCCC GACCATGGGC GCGGGCTGGT 

551 GTCCTCCCGG CATCTTGGGT ATCGGCATCG GCGGCACGCC CGAAAAAGCC 

601 GTGCTGATGG CAAAAGAGTC CCTGATGAGC CACATCGACA TTCAAGAATT 

651 GCAGGAAAAG GCCGCGTCCG GCGCGGAATT GTCCACCACC GAAGCCCTGC 

701 GCCTCGAACT CTTTGAAAAA GTCAACGCGC TGGGCATCGG CGCACAAGGC 

751 TTGGGCGGAC TGACCACCGT GTTGGACGTG AAAATCCTCG ATTATCCGAC 

801 CCACGCCGCC TCCAAACCGA TTGCCATGAT TCCGAACTGC GCCGCCACCC 

851 GCCACGTCGA ATTTGAATTG GACGGCTCAG GCCCTGTCGA ACTCACGCCG 

901 CCGCGCGTCG AAGACTGGCC CGA7TTGACT TACAGCCCCG ACAACGT-C'AA 

951 ACGCGTCGAT GTCGACAAGC TGACCAAAGA AGAAGTGGCA AGCTGG^AAA. 

1001 CCGGCGACGT ATTGCTGTTG AACGGCAAAA TCCTCACCGG CCGCGATGCC 

1051 GCACACAAAC GCCTCGTCGA TATGCTCAAC AAAGGCGAAG AATTGCCCGT 

1101 CGATTTCACC AACCGCCTGA TTTACTACGT CGGCCCCGTC GATCCGGTCG 

1151 GCGATGAAGT CGTCGGTCCG GCAGGTCCGA CCACAGCCAC CCGCATGGAC 

1201 AAATTCACCC GCCAAATGCT CGAACAAAC Z G/.CCTCTTGG GCATGATCGG 

12-1 CAAATCCGAG CGCGGCGTGG CCACCTGCGA AGCCATCGCC GACAACAAAG 

13 01 CCGTGTACCT CATGGCAGTC GGCGGCGCGG CGTATCTCGT GGCAAAAGCC 
1351 ATCAAATCTT CCAAAGTCTT GGCGTTCCCC GAATTGGGCA TGGAAGCCAT 
1401 TTACGAATTT GAAGTCAAAG ACATGCCCGT AACCGTCGCC GTAGATAGCA 

14 51 AAGGCGAATC CATCCACGCC ACCGCCCCGC GCAAATGGCA GGCGAAAATC 
1501 GGCATCATCC CCGTCGAATC TTGA 

This corresponds to the amino acid sequence <SEQ ID 220; ORF 050- 1>: 

a050-l .pep 

1 MTVIKQEDFI QSICDAFQFI SYYHPKDYID ALYKAWQKEE NPAAKDAMTQ 

51 ILVNSRMCAE NNRPICQDTG IATVFLKVGM NVQWDADMSV EEMVNEGVRF, 

101 AYTWEGNTLR ASVLADPAGK RQNTKDNTPA VIHMSIVPGG KVEV71AAKG 

151 GGSENKSKLA HLNPSDNIVD WVLKTIPTMG AGWCPPGILG IGIGGTPEKA 

201 VLMAKESLMS HIDIQELQEK AASGAELSTT EALRLELFEK VNALGIGAQG 

251 LGGLTTVLDV KILDYPTKAA SKPIAMIPNC AATRHVEFEL DGSGPVELTP 

301 PRVEDWPDLT YSPDNGKRVD VDKLTKEEVA SWKTGDVLLL NGKI LTGRDA 

351 AHKRLVDMLN KGEELPVDFT NRLI YYVGPV DPVGDEWGP AGPTTATRMD 

■11 Vr'IV^MLEQT DLLG.-iIGKSE RGVATCEAIA DNKAVYLMAV GGAAYLVAKA 

4 51 IKSSKVLAFP ELGMEAI YEF EVKDMPVTVA VDSKGESIHA TAPRKWQAKI 

501 GIIPVES* 

m050-l/g050-l 98.2% identity in 507 aa overlap 

10 20 30 40 50 60 

m050-l .pep MTVIKQEDFIQSICDAFQFISYYHPKDYIDALYKAWQKEENPAAKDAMTQILVNSRMCAE 
! I I I I I I I I I ! I I I I I I i I I I I I It I I I t t I I! I I 1 If f I i ! ! ! I ! I I I I ! I I i I I I M [ 
g050-l MTVIKQEDFI QS1CDAFQFI SYYHPKDYIDALYKAWQKEENPAAKDAiMTQI LVN3RMCAE 

10 20 30 40 50 60 

70 80 90 100 110 120 

m050-l .pep NNRPICQDTGIATVFLKVGMNVQWDADMSVEEMVNEGVRRAYTWEGNTLPASVLADPAGK 
1 I i I I t I I II I II t I I I I I t : t I I I i II I I I : M I I I 1 ! I I i I i I M I 1 I I I I II I I I I I 
gO 50 - 1 NNRP I CQDTGIATVFLKVGMDVQWDADMS VEKMVNEGVRRA YTWEGNTLRAS VLADPAGK 

70 80 90 100 110 120 

130 140 150 160 170 180 

m050-l .pep RQNTKDNTPAVIHMSIVPGGKVEVTCAAKGGGSENKSKLAMLNPSDNIVDWVLKTIPTMG 
1 I I I ! I 1 I i I I M I M I I t 1 111 I I 1 i I i I M I I if I i I I I I M M I I I I N I I I I ! I I I 
g05>0-l RQNTKDNTPAVIHMSIVPGGKVEVTCAAKGGGSENKSKLAMLNPSDNIVDWVLKTIPTMG 

130 140 150 160 170 180 

190 200 210 220 230 240 

m0S0-l . pep AGWCPPGILGIGIGGTPEKAVLMAKESLMSHIDIQELOEKAASGAELSTTEALRLELFEK 
I i 1 I I I I I I I I ! I ! I I I I II I I I I I i ! I I i I M I I I I I i I I I I I I I I I I I I I I I I I I I I I 
g050-l AGWCPPGILGIGIGGTPEKAVLMAKESLMSHIDIQELQEKAASGAELSTTEALRLELFEK 

190 200 210 220 230 240 

250 260 270 280 290 300 

m050~l . pep VNALGIGAQGLGGLTTVLDVKILDYPTHAASKPIAMIPNCAATRHVEFELDGSGPVELTP 
! i I I I I I ! I I i I i I I I I I I I I I M II I I I I I I I I I I I I I I I I I ! I I I I I I I I I I ! i I I I I 
g050-l VNALGIGAQGLGGLTTVLDVKILDYPTHAASKPIAMIPNCAATRHVEFELDGSGPVELTP 

250 260 270 280 290 300 
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310 320 330 340 350 360 

m050-l .pep PRVEDWPDLTYSPDNGKRVDVDKLTKEEVASWKTGDVLLLNGKI LTGRDAAHKRLVDMLN 
Mil! f 1 I I I I I I I I I I I I I I I I I I I I I I I t I i I I I I I ! ! t I I I I I I ( I II I I I I : I ! : 
g050-l PRVEDXPDLTYSPDNGKRVDVDKLTKEEVASWKTGDVLLLNGKILTGRDAAHKRLVNMLD 

310 320 330 340 350 3d0 



370 380 390 400 410 420 

m050-l . pep KGEELPVDFTNRLI YYVGPVDPVGDEVVGPAGPTTATRMDKFTRQMLEQTDLL-GMIGKSE 
1 I ! i I I I I I I I I I I I I I I M [ I i ! I I I I [ I I I I I i I | i | I ! I I M I f : I I I ' 11 I I I I I 
g0 5 0-l KGEELPVDFTNRLI YYVGPVDPVGDEVVGPAGPTTATFWDKFTRQMLKQTGLLGMIGKSE 

370 380 390 400 410 420 



430 440 450 460 470 4S0 

m050-l . pep RGVATCEAIADNKAVYLMAVGGAAYLVAKAIKSSKVLAFPELGMEAI YEFEVKDMPVTVA 
I f : I I i I M I I I II I I I I ! I I I I I II I I I I I I I I I I I I I I I I I I I I : I I i i i I I I ! I I I I 
g 050-1 RGAATCEAIADNKAVYLHJvVGGA/iYLVAKAIKSSKVLAFPELGMEAV YEFEVKDMPVTVA 

430 440 450 460 470 480 



490 500 
m050-l .pep VDSKGESIHATAPRKWQAKIGI IPVESX 
I I I I I I II I I I I ! I I I I I I I I I I ! I I I I 
g050-l VDSKGESIHATAPRKWQAKIGIIPVESX 

490 500 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 22 1>: 

a050-l . seq 

1 ATGACCGTCA TCAAACAGGA AGACTTTATC CAAAGCATTT GCGATGCCTT 

51 CCAATTCATC AGCTACTACC ATCCCAAAGA CTACATCGAC GCGCTTTATA 

101 AGGCGTGGCA GAAGGAAGAA AACCCCGCCG CCAAAGACGC GATGACGCAG 

151 ATTTTGGTCA ACAGCCGCAT GTGTGCCGAA AACAACCGCC CCATCTGCCA 

201 AGATACCGG^ ATCGCGACCG TGTTTTTGAA AGTCGGTATG GATGTGCAAT 

251 GGGATGCAGA CATGAGCGTC GAAGAGATGG TTAACGAAGG CGTGCGCCGC 

301 GCCTACACTT GGGAAGGCAA TACGCTGCGC GCTTCCGTTC TCGCCGACCC 

351 CGCCGGCAAA CGCCAAAATA CCAAAGACAA CACGCCCGCC GT CATC CAT A 

401 TGAGCATCGT GCCGGGCGAC AAAGTCGAAG TAACCTGCGC GGCAAAAGGC 

4 51 GGCGGTTCTG AAAACAAATC CAAACTCGCC ATGCTCAACC CTTCCGACAA 

501 CATCGTCGAT TGGGTATTGA AAACCATTCC GACCATGGGC GCGGGCTGGT 

551 GTCCTCCCGG CATCTTGGGC ATCGGCATCG GCGGTACGCC CGAAAAAGCC 

601 GTGTTGATGG CGAAAGAATC CCTGATGAGC CACATCGACA TCCAAGAATT 

651 GCAGGAAAAA GCCGCGTCCG GCGCGGAATT GTCCACCACC GAAGCCCTGC 

701 GCCTCGAACT CTTTGAAAAA GTCAACGCGC TAGGCATCGG CGCGCAAGGC 

751 TTGGGCGGTC TGACCACCGT GTTGGACGTG AAAATCCTCG ATTACCCGAC 

801 CCACGCCGCC TCCAAACCGA TTGCCATGAT TCCGAACTGC GCCGCCACCC 

851 GCCACGTCGA ATTTGAATTG GACGGCTCAG GCCCTGTCGA ACTCACGCCG 

901 CCGCGCGTCG AAGACTGGCC CGATTTGACT TACAGCCCCG ACAACGGCAA 

951 ACGCGTCGAT GTCGACAAGC TGACCAAAGA AGAAGTGGCA AGCTGGAAAA 

1001 CCGGCGACGT ATTGCTGTTG AACGGCAAAA TCCTCACCGG CCGCGATGCC 

1051 GCACACAAAC GCCTCGTCGA TATGCTCGAC AAAGGCGAAG AATTGCCCGT 

1101 CGATTTCACC AACCGCCTGA TTTACTACGT CGGCCCCGTC GATCCGGTCG 

1151 GCGACGAAAT CGTCGGCCCA GCAGGTCCGA CCACCGCCAC CCGCATGGAC 

1201 AAATTCACCC GCCAAATGCT CGAACAAACC GACCTCTTGG GCATGATCGG 

12 51 CAAATCCGAG CGCGGCGCGG CCACCTGCGA AGCCATCGCC GACAACAAAG 

1301 CCGTGTACCT CATGGCAGTC GGCGGCGCGG CGTATCTCGT GGCAAAAGCC 

1351 ATCAAATCTT CCAAAGTCTT GGCGTTCCCC GAATTGGGCA TGGAAGCCAT 

14 01 TTACGAATTT GAAGTCAAAG ACATGCCCGT AACCGTCGCC GTAGACAGCA 

14 51 AAGGCGAATC CATCCACGCC ACCGCCCCGC CCCAATGGCA GGCGAAAATC 

1501 GGCATCATCC CCGTCAAATC TTGA 



This corresponds to the amino acid sequence <SEQ ID 222; ORF 050-1 .a>: 

a050-l .pep 

1 MTVIKQEDFI QSICDAFQFI SYYHPKDYID ALYKAWQKEE NPAAKDAMTQ 

51 ILVNSRMCAE NNRPICQDTG IATVFLKVGM DVQWDADMSV EEMVNEGVRR 

101 AYTWEGNTLR ASVLADPAGK RQNTKDNTPA VIHMSIVPGD KVEVTCAAKG 

151 GGSENKSKLA MLNPSDNIVD WVLKTIPTMG AGWCPPGILG IGIGGTPEKA 

201 VLMAKESLMS HIDIQELQEK AASGAELSTT EALRLELFEK VNALGIGAQG 

251 LGGLTTVLDV KILDYPTHAA SKPIAMIPNC AATRHVEFEL DGSGPVELTP 

301 PRVEDWPDLT YSPDNGKRVD VDKLTKEEVA SWKTGDVLLL NGKILTGRDA 

351 AHKRLVDMLD KGEELPVDFT NRLI YYVGPV DPVGDEIVGP AGPTTATRMD 

4 01 KFTRQMLEQT DLLGMIGKSE RGAATCEAIA DNKAVYLMAV GGAAYLVAKA 

4 51 IKSSKVLAFP ELGMEAI YEF EVKDMPVTVA VDSKGESIHA TAPPQWQAKI 

501 GIIPVKS* 
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a050-l/m050-l 98.4% identity in 507 aa overlap 

10 20 30 40 50 60 

a050-i .pep MTVIKQEDFlQSICDAFOFISYYHPKDYIDftLYKAWQKEENPAAKDAMTQILVNSRMCAE 
I ! I I I I I II I M I I M 1 I ! i ! t I I i i I I 1 I M I I I i ! H I I! 11 I M I M I I I M I I 111 
mQ50-l MTVIKQEDFIQSICDAFQFISYYHPKOYIDALYKAWQKEENPAAKDAMTQILVNSRMCAE 
10 20 30 40 50 60 

70 80 90 100 110 120 

aO 50-1 .pep nnrpicqdtgiatvflkvgmdvqwdadmsveemvnegvrraytwegntlrasvladpagk 

I I I I I I f I I I I I I I I ! I I i I : II I I I I I I I I I I I I I I I I I I I M I I I i M I M ! I I I I I I 
m050-l NNRPICQDTGIATVFLKVGMNVOWDADMSVEEMVNEGVRRAYTWEGNTLRASVLADPAGK 

70 80 90 100 110 120 

130 140 150 160 170 180 

a050-l .pep rqntkdntpavihmsivpgdkvevtcaakgggsenksklamlnpsdnivdwvletiptmg 

I I I I I I I I i I I I I j I M I i I I I I I I I II I I I I I I 1 I M I I I I M I I I I I f I 1 I I I i I I I 

m050-l rqntkdntpavihmsivpggkvevtcaakgggsenksklamlnpsdnivdwvlktiptmg 

130 140 150 160 170 180 

190 200 210 220 230 240 

a0 5Q-l .pep AGWCPPGILGIGIGGTPEKAVLMAKESLMSHIDIQELQEKAASGAELSTTEALRLELFEK 
I I i I I I I I t I I ! I II I I 1 i I I ! I I I II ( I I I I I M I U ! I I I! I t I I I I t I II i M t 1 i I 
m0 50-l AGWCPPGILGIGIGGTPEKAVLMAKESLMSHIDIQELQEKAASGAELSTTEALRLELFEK 

190 200 210 220 230 240 

250 260 270 280 290 300 

a 0 5 0 - 1 . pep VNALG I GAQGLGGLTTVLDVKI LDYPTHAASKPI AMI PNCAATRHVEFELDGSGPVELT P 
I I i I I 1 I I) ! I i I I I I I i i i M i I 1 it 1 II i i i I ! 1 II I t t I t i I I I I i II I i I ( i I I I t 
m050-l VNALGI GAQGLGGLTTVLDVKI LDYPTHAASKPI AMI PNC AATRHVEFELDGSGPVELT? 

250 260 270 280 290 300 

310 320 330 340 350 360 

a050-l . pep PRVEDWPDLTYSPDNGKRVDVDKLTKEEVASWKTGDVLLLNGKILTGRDAAHKRLVDMLD 
I I I I I II I I I I I I I I I I I I i I I I I I I I I I I I ! I I I I I I I I I I I f I I i II M II ! I 1 I I ! : 
m050-l PRVEDWPDLTYSPDNGKRVDVDKLTKEEVASWKTGDVLLLNGKILTGRDAAHKRLVDMLN 

310 320 330 340 350 360 

370 380 390 400 410 420 

aG5G- 1 .pep KGEELPVDFTNRLIYYVGPVDPVGDEI VGPAGPTTATRMDKFTRQMLEQTDLLGMIGKSE 
I I I ! I I I I I I I I I II I I I I I I I t I I I : i ! i I I I I I I I I I I II I I I I I I I I I 1 i i I I I I I I 
m050-l KGEELPVDFTNRLI YYVGPVDPVGDEVVGPAGPTTATRMDKFTRQMLEQTDLLGMIGKSE 

370 380 390 400 410 420 

430 440 450 460 470 480 

a050-l .pep RGAATCEAI ADNKAVYLMAVGGAA YLVAKA IKS SKVLAFPELGMEA I YEFEVKDMPVTVA 
! I : I I I I I I I i ! I I i I I I i I I II M II I I I I I I I ! t t I I I I 1 I M I i I i I ( I 1 111 I M i 
m05Q-l RGVATCEAIADNKAVYLMAVGGAAYLVAKAIKSSKVLAFPELGMEAI YEFEVKDMPVTVA 

430 440 450 460 470 480 

490 500 
a050-l .pep VDSKGESIHATAPPQWQAKIGI IPVKSX 
I ! t ! I I I 1 I I 1 I I : M I 1 II M I 1 : ! I 
mO50 - 1 VDSKGESI HATAPRKViQAKIGI I PVESX 

490 500 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 223>: 

g052 . seq 

1 ATGGCTTTGG TGGCGGAGGA AACGGAAATA TCCGCGCCGT GTTTCAAAGG 

51 CTGCGAGCCG ACGGGCGACA GCAGGCTGTT GTCCACCACC AAGAGCGCGC 

101 CGATGCCGTG CGCCAATTCC GCCAAGGCTT CCAAGTCGGC CACTTCGCCC 

151 AAGGGGTTGG ACGGCGTTTC CAAAAACAGC AGTTTGGTGT TGGCTTTGAC 

201 GGCGGCTTTC CATTCATTTA TATCAGTCGG CGACACGCGG CTCACTCCGA 

251 TGCCGAATTT GGTAACGATG TTATTGATAA AGCCGACGGT CGTGCCGAAC 

3 01 AGGCTGCGGC TGGAAACCAC ATGGTCGCCC GCCTGCAGGA AGGTGAAAAA 

351 CGCCGCCTGA 

This corresponds to the amino acid sequence <SEQ ID 224; ORF 052.ng>: 

g052 .pep 



WO 99/57280 



PCT/US99/09346 



258 



1 MALVAEETEI SAPCFKGCEP TGDSRLLSTT KSAPMPCANS AXASKSATSP 

51 KGLDGVSKNS SLVLALTAAF HSFI5VG DTR LTPMPMLVTM LLIKPTWPN 

101 RLRLETTYJSP ACRKVKNAA* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 225>: 

m05 2 . seq 

1 ATGGCTTTGG TGGCGGAGGA AACGGAAATA TCCGCGCCGT GTTTCAAAGG 

51 CTGCGAGCCG ACGGGCGACA GCAGGCTGTT GTCCACCACC AAGAGCGCGC 

101 CGATGCCGTG CGCCAATTCC GCCAAGGCTT CCAAGTCGGC CACTTCGCCC 

151 AAGGGGTTGG ACGGCGTTTC CAAAAACAGC AGTTTGGTGT TGGCTTTGAC 

201 GGCGGCTTTC CATTCATTTA TATCAGTCGG CGACACGCGG CTCACTCCGA 

2 51 TGCCGAATTT GGTAACGATG TTATTGATAA AGCCGACGGT CGTGCCGAAC 
301 AGGCTGCGGC TGGAAACCAC A7GGTCGCCC GCCTGCAGGA AGGTGAAAAA 

3 51 CGCCGCCTGA 

This corresponds to the amino acid sequence <SEQ ID 226; ORF 052>: 

m052 .pep 

1 MALVAEETEI SAPCFKGCEP TGDSRLLSTT KSAPMPCANS AKAS KSATSP 

51 KGLDGVSKNS SLVLALTAAF HSFISVG DTR LTPMPNLVTM LLIKPTWPN 

101 RLRLETTWSP ACRKVKNAA* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 227>: 

a052 . seq 

1 ATGGCTTTGG TCGCGGAGGA AACGGAAATA TCCGCGCCGT GTTTCAAAGG 

51 CTGAGAGCCG ACAGGCGACA GCAGGCTGTT GTCCACCACC AAGAGCGCGC 

101 CGATGCCGTG CGCCAATTCC GCCAAGGCTT CCAAGTCGGC CACTTCTCCC 

151 AAGGGATTGG ACGGCGTTTC CAAAAACAGC AGTTTGGTGT TGGCTTTGAC 

201 GGCGGCTTTC CATTCGTTTA TATCAGTCGG CGACACGTGA CTCACTTCGA 

251 TGCCGAATTT GGTAACGATG TTATTGATAA AGCCGACGGT CGTGCCGAAC 

301 AGGCTGCGGC TGGAAATCAC ATGGTCGCCC GCCTGCAAAA AGGTGAAAAA 

351 CGCCGCCTGA 

This corresponds to the amino acid sequence <SEQ ID 228; ORF 052. a>: 

a052 . pep 

1 MALVAEETEI SAPCFKG*EP TGDSRLLSTT KSAPMPCANS AKASKSATSP 

51 KGLDGVSKNS SLVLALTAAF HSFISVG DT* LTSMPNLVTM LLIKPTWPN 

101 RLRLEITWSP ACKKVKNAA* 



m052/a052 95.8% identity over a 1 19 aa overlap 

10 20 30 40 50 60 

m052 . pep MALVAEETEISAPCFKGCEPTGDSRLLSTTKSAPMPCANSAKASKSATSPKGLDGVSKNS 
I I M I I ! I I I M I I I I I 1 I I M I I I I I I I 1 I I M i I I I M I It I I I! I I I I I ! I I I I I I 
a0 52 MALVAEETEI SAPCFKGXEPTGDSRLLSTTKSAPMPCANSAKASKSATSPKGLDGVSKNS 

10 20 30 40 50 60 



70 80 90 100 110 120 

m052 . pep SLVLALTAAFHSFISVGDTRLTPMPNLVTMLLIKPTVVPNRLRLETTWSPACRKVKNAA:< 
i I M I I M M M M I M I I I I If I I I I I li I ( M I II I M I I! I I I I II : I I I t I I i 
a052 SLVLALTAAFHSFI SVGDTXLTSMPNLVTMLLIKPTWPNRLRLEITWSPACKKVKNAAX 

70 80 90 100 110 120 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 052 shows 95.8% identity over a 1 19 aa overlap with a predicted ORF (ORF 052. ng) 
from N. gonorrhoeae: 

m052/g052 

10 20 30 40 50 60 

m05 2 . peD MALVAEETEISAPCFKGCEPTGDSRLLSTTKSAPMPCANSAKASKSATSPKGLDGVSKNS 

llllMlllllliltlli;illlil]!lllll!llllllll!!llllll!l)li;ll!!! 
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g052 



MALVAEETEISAPCFKGCEPTGDSRLLSTTKSAPMPCA2-3SAKASKSATSPKGLDGVSKNS 
10 20 30 40 50 60 



m052 . pep 



70 80 90 100 110 120 

SLVLALTAAFKSFISVGDTWLTSMPNLATMLLIKPTWPNRLRLEITWSPACKKVKNAAX 



g052 




70 80 90 100 110 12C 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 229>: 

g073 . seq 



1 ATGTGTATGC CATACGCAAT AAGGGTTTCA GACGGCATCT GCCGCATTTT 

51 TCCGCCGATG CCGTCTGAAA CACGCAATCA GCGCGCGAGT GCCTGTTTCA 

101 AATCGTCAAT CAAATCGCCA .\CATATTCCA AACCGACCGA CAGGCGCACC 

151 AGTCCGGGGC GGatacCGGC GGCGAGTTTT TCTTCGGGCT GCATCCTGCC 

201 GTGCGTGGTT GTCCACGGAT TGGTGATGGT CGAGCGCACG TCGCCGAGGT 

2 51 TGGCGGTACG GGAAAAGAGT TCCACGACTT TCCACGCGGC TGCTTGGTCG 

3 01 GCGACTTCAA AACCGATGAC GATGCCGCCG CCGTTTTGCT GTTTGCGGAT 
3 51 AAGCTCCGCC TGCGGATGGT CGGGCAATCC GGTGTAG 



This corresponds to the amino acid sequence <SEQ ID 230; ORF 073, ng>; 

g073 .pep 

1 MCMPYAIRVS DGICRIFPPM PSETRNQRAS ACFKSSIKSP TYSKPTDRRT 

51 SPGRIPAASF SSGCILPCW VHGLVMV SRT SPRLAVREKS STTFHAAAWS 

101 ATSKPMTMPP PFCCLRISSA CGWSGNPV* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 23 1>: 



1 ATGTGTATGC CATATAAGAT AAGGGTTTCA GACGGCATCT GCTGTCCAAT 

51 GCCGTCTGAA ACACGCAATC hGCGTGCGAG TGCCTGTTTC AAATCGTCAA 

1C- TCAAATCdCC AACATATTCC AAACCGACCG ACAGGCGCAC CAATCCGGGG 

151 CGGATGTTGG CGGCGAGTTT TTCTTCGGGC TGCATCCTGC CGTGCGTGGT 

201 TGTCCACGGG TGGGTAATGG TCGAGCGCAC GTCACCGAGG TTGGCGGTGC 

251 GGGAAAAGAG TTCCACGCCG TCCACAACTT TCCACGCGGC TTCTTGATCG 

301 GCAACTTCAA AGCCGATGAC GATGCCGCCG CCGTTTTGCT GTTTGCGGAT 

351 AAGCGCCGCC TGAGGATGGT CGGACAATCC GGTGTAG 



This corresponds to the amino acid sequence <SEQ ID 232; ORF 073>: 

m07 3 .pep 

1 MCMPYKIRVS DGICCPMPSE TRNQRASACF KSSIKSPTYS KPTDRRTNPG 
51 RMLAASFSSG C1LPCVWHG WVMVERTSPR LAVREKSSTP STTFHAASXS 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 23 3>: 



1 ACGTGTATGT CATATAAGAT AAGGGTTTCA GACGGCATTT GCGGTGTTTT 

51 TCCGCCGATG CCGTCTGAA . CACGCAATCA GCGCGCGAGT GCCTGTTTCA 

101 AATCGTCAAT CAAATCGCCA ACATATTCCA AACCGACCGA CAGGCGCACC 

151 AATCCGGGGC GGATGTTGGC GGCGAGTTTT TCTTCGGGCT GCATCCTGCC 

201 GTGCGTGGTT GTCCACGGAT GGGTAATGGT CGAGCGCACG TCGCCGAGGT 

251 TGGCGGTACG GGAGAAAAGT TCGACGCCGT CCACGACTTT CCACGCGGCT 

301 GCTTGGTCGG CGACTTCAAA GCCGATGACG ATGCCGCCGC CGTTTTGCTG 

351 TTTGCGGATA AGCTCCGCCT GAGGATGGTC GGGTAATCCG GTGTAA 

This corresponds to the amino acid sequence <SEQ ID 234; ORF 073. a>: 

a073 .pep 

1 TCMSYKIRVS DGICGVFPPM PSEXRNQRAS ACFKSSIKSP TYSKPTDRRT 

51 NPGRMLAASF SSGCILPCW VHGWVMV ERT SPRLAVREKS STPSTTFHAA 

101 AWSATSKPMT MPPPFCCLRI SSA*GWSGNP V* 



m0 73 . seq 



101 ATSKPMTMPP PFCCLRISAA XGWSDNPV* 



a073 . seq 



m073/a073 92.3% identity over a I3C aa overlap 
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10 2 0 30 4 0 50 

mQ7 3 .pep MCMPYKIRVSDGICC---PMPSETRNQRASACFKSSIKSP7YSKPTDRP.TNPGRMLAASF 

1 1 win wn i 1 1 i 1 1 : 1 1 1 1 1 1 1 1 it 1 1 1 1 ; ' I I I 1 ! I f J I J f M I I 1 I J I ! 

a 07 3 TCMSYKIRVSDGICGVFPP^3PSEXRNQRASACFKSSIK5?TYSKPTDRRTNPGRMLAA5F 

10 20 30 4 0 50 60 



60 70 80 90 100 110 

m07 3.pep SSGCILPCVVVHGWVMVERTSPRLAVREKSSTPSTTFHAASXSATSKPf-JTMPPPFCCLRI 
i | i t I M I I I I i I I I I I I I I I I I I I I I M I II I I i I I I I I : I I 1 I I I 1 M i M I ! I I i I 
a 07 3 SSGC I L PCVVVHGWVMVERTS PRLAVP.EKSST PSTTFHAAAWS AT SK PMTM? ? PFCCLRI 

70 30 SO 100 110 120 



120 129 
m07 3 . pep SAAXGWSDNPVX 

I : I I I I I Nil 
a0 n 3 SSAXGWSGNPVX 
130 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 073 shows 87.0% identity over a 131 aa overlap with a predicted ORF (ORF 073.ng) 
from N. gonorrhoeae: 

m073/g073 

10 20 30 40 50 

m073 . pep MCMPYKIRVSDGICC- - - PMPSETRNORASACFKSS I KS PTYSKPTDRRTNPGRMLAASF 

I I I I I lllillll I I I 1 I I I I II I I I I I I I ! I I 1 II 1 M I I I I I I : I I I : I I II 

g0 7 3 MCMPYAIRVSDGI CRI FPPMPSETRNQRASACFKSS I KS PTYSKPTDRRTS PGRI PAASF 

10 20 30 40 50 60 



60 70 80 90 100 110 

m073 ,pep SSGCILPCVWHGWVMVERTSPRLAVREKSSTPSTTFHAASXSATSKPMTMPPPFCCLRI 

IIIIIMIIIIII IMIMIIIIIIIIIIII liiih IMIIIIIIIII1IIIII 

g073 SSGCI LPCVVVHGLVMVERTS PRLAVRE KSST- - - TFHAAAWSATSKPMTMPPPFCCLRI 

70 80 90 100 110 



120 129 
mO 7 3 . pep SAAXGWSDNPVX 

hi III I II I 
gO 7 3 SS ACGWSGNPVX 

120 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 23 5>: 



g075 . seq 












I 


ATGCCGCCTT 


ACTTCATCAC 


CCTCTTAACG 


ATGGAAAATA 


CAAAAAGCGC 


51 


GGCGAAAACG 


CCCACTACAA 


TCCAACCGGC 


TTCCATACCG 


TCCGCTTTTG 


101 


CGGCTTCCAA 


AGCGTTTTTT 


GCCGTTTCGG 


GCAACGCTGC 


GTTTGCCTGT 


151 


GCCGCCAAAG 


CCAGCGGGGC 


GGCTGTTACA 


ACAGCCAGTT 


TTGCGCCGTA 


201 


TTTACGGCAG 


GTGTTAATAA 


ATTTCATGAT 


ATTTTCCTTT 


ACGAAATTTT 


251 


TAAAAAAATG 


TGTTTGCGGG 


CTTTGTGAAG 


GTTTTAGAGA 


CCGCCTGCCG 


301 


GGCCTCTTAA 


ACTTAATCTT 


CTTTTTCGTA 


GAATCCGAAA 


ATTACAAATT 


351 


CCCCGCCTAT 


CTCTTCCAAT 


GCCGAGCTAA 


AAGCGTCTTC 


ATAGCTGTCA 


401 


TATTTACCGG 


CTGA 









This corresponds to the amino acid sequence <SEQ ID 236; ORF 075.ng>: 

g07 5 .pep 

1 MPPYFITLLT MENTKSAAXT PTTIQPASIP SAFAASKAFF AVSGNAAFAC 

51 AAKASGAAVT TASFAPYLRQ VLINFMI FSF TKFLKKCVCG LCEGFRDRLP 

101 GLLNLIFFFV ESENYKFPAY LFQCRAKSVF IAVIFTG* 
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The following partial DNA sequence was identified in N. meningitidis <SEQ ID 237>: 



m075 . seq 












2 


ATGCCGTCTT 


ACTTCATCAC 


TCTCTTAACG 


ATGGAAAATA 


CAAAAAGCGC 


51 


GGCGAAAATG 


CCCACTACAA 


TCCAACCGGC 


TTCCATACCG 


TCCGCTTTTG 


101 


CGGCTTCCAA 


AGCGTTTTTT 


GCCGTATCGG 


GCAACGTTGC 


ATTTGCATGT 


151 


GCGGCCAAAG 


CCAGGGGAGC 


AGCTGTTACA 


ACAGCCAGTT 


TTGCGCCGTA 


201 


TTTACGGCAG 


GTGTTAATAA 


ATTTCATGAT 


ATTTTCCTTC 


AAAAAGTGTT 


251 


TGGCGGTAAT 


GGATGGAGCG 


TTTTTCAGAC 


GACCGCCGAA 


CATCCGAAAA 


301 


TCAGTCTTTC 


AAAAATCCGA 


ATACGACAAA 


TTCGTATTGG 


TTGCCGATTT 


351 


CTTCCAAACC 


TGCGTTAATC 


GCTTCTTCGA 


AGTCGTAGAA 


ATAATCGGCA 


401 


TTGGTGATTA 


A 









This corresponds to the amino acid sequence <SEQ ID 23S; ORF 075>: 

m075 . pep 

1 MPSYF ITLLT MENTKSAAKM PTTIQPASIP SAFAASKAFF AVSGNVAFAC 

51 AAKARGAAVT TASFAPYLRQ VLINFMI FSF KKCLAVMDGA FFRRPPNIRK 

101 SVFQKSEYDK FVLVADFFQT CVNRFFEWE IIGIGD* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 075 shows 65.7% identity over a 137 aa overlap with a predicted ORF (ORF 075. ng) 
from N. gonorrhoeae: 

m075/g075 

10 20 30 40 50 60 

MPSYFITLLTMENTKSAAKMPTTIOPASIPSAFAASKAFFAVSGNVAFACAAKARGAAVT 

II Mil HUM I! Mil 1 1 1 M M 1 1 M M M M M I M I M M M I i M i MM! 

MPPYFITLLTMENTKSAAKTPTTIQPASIPSAFAASKAFF AVSGNAAFACAAKASGAAVT 
10 20 30 40 50 60 

70 30 90 100 110 

TASFAPYLRQVLINFMIFSF KKCLAVMDGAFFRRPPNIRKSVFQKSEYDKFVLVAD 

M 1 1 M I M I M M 1 1 1 M I MM = M 1 M= = • I I : : : : I 

TASFAPYLRQVLINFMIFSFTKFLKXCVCGLCEGFRDRLPGLLNLIFFFVESENYKFPAY 
70 80 90 100 110 120 

120 130 
FFQTCVNRFFEWEI IGIGDX 

Ml - (MM 
LFOCRAKS VFI AVI FTGX 
130 



m07 5 .pep 
g075 

m075 -pep 
g075 

m07 5 .pep 
g075 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 239>: 

a075 . seq 

1 ATGCCGTCTT ACTTCATCAC TCTCTTAACG ATGGAAAAGA CAAAAAGCGC 

51 GGCGAAAACG CCCACTACAA TCCAACCGGC TTCCATACCG TCCGCTTTTG 

101 CGGCTTCCAA AGCGTTTTTT GCTGTATCGG GCAACGTTGC ATTTGCATGT 

151 GCGGCCAAAG CCAGGGGAGC AGCTGTTACA ACAGCCAGTT TTGCGCCGTA 

201 TTTACGGCAG GTGTTAATAA ATTTCATGAT ATTTTCCTTC AAAAAGTGTT 

251 TGGCGGTAAT GGATGGAGCG TTTTTCAGAC GACCGCCGAA CATCCGAAAA 

301 TCAGTCTTTC AAAAATCCGA ATACGACAAA TTCGTATTGG TTGCCGATTT 

351 CTTCCAAACC TGCGTTAATC GCTTCTTCGA AGTCGTAGAA ATAATCGGCA 

401 TTGGTGATTA A 

This corresponds to the amino acid sequence <SEQ ID 240; ORF 075. a>: 

a075 .pep 

1 MPSYFITLLT MEKTKSAAKT PTTIQPASIP o.-.FAASKAFF AVSGNVAFAC 

51 AAKARGAAVT TASFAPYLRQ VLIMFMIFSF KKCLAVM DGA FFRRPPNIRK 

101 SVFQKSEYDK FVLVADFFQT CVNRFFEVVE IIGIGD* 
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m075/a07s 98.5% identity over a 13* overlap 

10 20 30 40 50 60 

m075 . pep MPSYFITLLTMENTKSAAKMPTTIQPASIPSAFAASKAFFAVSGNVAFACAAKARGAAVT 
j 1 I I I I ! II ! ! I : ! II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I f ) I I 1 } I I I I I I 
a07 5 MPSYFITLLTMFKTKSAAKTPTTIQPASIPSAFAASKAFFAVSGNVAFACAAKARGAAVT 

10 20 30 40 50 60 



70 80 90 100 110 120 

m07 5.pep TASFAPYLRQVLINFMIFSFKKCLAVMDGAFFRRPPNIRKSVFOKSEYDKFVLVADFFQT 
I I I I I I I I I i I I 1 1 I I I i I I I I I I It I I I I I t I I I I I I If ! i I I i I I ! I I I t i I 1 ! I 1 t I 
a07 5 TASFAPYLRQVLINFMIFSFKKCLAVMDGAFFRRPPNIRKSVFQKSEYDKFVLVADFFQT 
70 80 90 100 110 120 



130 

m07 5.pep CVNRFFEWEI IGIGDX 

I I I I I I I I I 1 I I II 1 i I 
a075 CVNRFFEVVEI IGIGDX 

130 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 24 1>: 

g080 . seq 

1 ATGTGGGATA ATGCCGAAGC GATGGAACGG CTGACGCGCT GGCTGCTTGT 

51 CATGATGGCG ATGCTGCTTG CTGCGTCCGG GCTGGTTTGG TTTTACAATT 

101 CGAATCATCT GCCCGTCAAG CAGGTGTCGC TGAAGGGCAA CCTGGTTTAT 

151 TCCGATAAGA AGGCATTGGG CAGTTTGGCG AAAGAATACA TCCATGGGAA 

201 TATTTTGAGG ACGGACATCA ATGGCGCACA GGAAGCCTAC CGCCGGTATC 

251 CGTGGATTGC GTCGGTCATG GTGCGCCGCC GTTTTCCCGA TACGGTTGAG 

3 01 GTCGTCCTGA CCGAGCGCAA GCCGGTTGCA CGTTGGGGCG ACCATGCCTT 
351 GGTGGACGGC GAAGGCAATG TTTTTGAAGC CCGCTTGGAC AGACCCGGAA 

4 01 TGCCGGTATT CAGAGGCGCG GAAGGAACGT CTGCCGAAAT GCTCCGCCGT 

4 51 TATGACGAAT TTTCGACTGT TTTGGCAAAA CAGGGTTTGG GCATCAAAGA 
501 GATGACCTAT ACGGCACGTT CGGCGTGGAA TGTCGTTTTG GACAACGGCA 

5 51 TCACCGTCAG GCTCGGACGG GAAAAcgaGA TGAAACGCCT CCgGCTTTTT 
601 ACcgAAGCGT GGCAGCATCT gttgcGTAAG AATAAAAATC GGTTATCCTA 
651 TGTGGATATG Aggtataagg acggatttTC agtcccccat gctCCCGACG 
701 GTTTACCCGA AAAAGAATcc gAAGAATatt gggaacaggt ttgggacata 
751 ttacggcctg gcgtcggaaa cggttcgacg caaatttcaa tcagttatAA 
801 GGGCAGacga acaatggaac AGcagtaa 

This corresponds to the amino acid sequence <SEQ ID 242; ORF 080.ng>: 

g080 .pep 

1 MWDNAEAMSR LTRWLLVMMA MLLAASGLW? FYNSNHLPVK QVSLKGNLVY 

51 SDKKALGSLA KEYIHGNILR TDINGAQEAY RRYPWIASVM VRRRFPDTVE 

101 WLTERKPVA RWGDHALVDG EGNVFEARLD RPGMPVFRGA EGTSAEMLRR 

151 YDEFSTVLAK QGLGIKEMTY TARSAWNWL DNGITVRLGR ENEMKRLRLF 

201 TEAWQHLLRK NKNRLSYVDM RYKDGFSVPH APDGLPEKES EEYWEQVWDI 

251 LRPGVGNGST QISISYKGRR TMEQQ* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 243>: 

m080 . seq 

1 ATGTGGGATA ATGCCGAAGC GATGGAACGG CTGACGCGCT GGCTGCTTGT 

51 CATGATGGCG ATGCTGCTTG CTGCGTCCGG GCTGGTTTGG TTTTACAATT 

101 CGAATCATCT GCCCGTCAAG CAGGTGTCGC TGAAGGGCAA CCTGGTTTAT 

151 TCCGATAAGA AGACATTGGG CAGTTTGGCG AAAGAATACA TCCATGGGAA 

201 TATTTTGAGG ACGGACATCA ATGGCGCACA GGAGGCCTAC CGCCGGTATC 

251 CGTGGATTGC GTCGGTCATG GTGCGCCGCC GTTTTCCCGA CACGGTTGAG 

301 GTCGTCCTGA CCGAGCGCAA GCCGGTCGCG CGTTGGGGCG ACCATGCCTT 

3 51 GGTGGACGGC GAAGGCAATG TTTTTGAAGC CCGCTTGGAC AGACCCGGAA 

4 01 TGCCGGTATT CAGAGGCGCG GAAGGAACGT CTGCCGAAAT GCTCCGCCGT 
4 51 TATGACGAAT TTTCGACTGT TTTGGCAAAA CAGGGTTTGG GCATCAAAGA 
501 GATGACCTAT ACGGCACGTT CGGCGTGGAT TGTCGTTTTG GACAACGGCA 
551 TCACCGTCAG GCTCGGACGG GAAAACGAGA TGAAACGCCT CCGGCTTTTT 
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601 ACCGAAGCGT GGCAGCATCT GTTGCGTAAA AATAAAAATC GGTTATCCTA 
651 TGTGGATATG AGGTATAAGG ACGGATTTTC AGTCCGCTAT GCTTCCGACG 
701 GTTTACCCGA AAAAGAATCC GAAGAATAG 

This corresponds to the amino acid sequence <SEQ ID 2441 ; ORF 080>: 

m080 .pep 

1 MWDNAEAMER LTRWLLVMMA MLLAA SGLWi FYN5NHLPVK QVSLKGNLVY 

51 SDKKTLGSLA KEYIHGNILR TDINGAOEAY RRYPWIASVM VRRRFPDTVE 

101 WLTERKPVA RWGDHALVDG EGNVFEARLD RPGMPVFRGA EGTSAEMLRR 

151 YDEFSTVLAK QGLGIKEMTY TARSAWIWL DNGITVRLGR ENEMKRLRLF 

201 TEAWQHLLRK NKNRLSYVDM RYKDGr SVRY ASDGLPEKES EE* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 080 shows 97.9% identity over a 242 aa overlap with a predicted ORF (ORF 080.ng) 
from N. gonorrhoeae: 

~^8Q/g08O 

10 20 30 40 50 60 

MWDNAEAMERLTRWLLVMMAMLLAAS^ 

MlllMllllllflMllMIIIIIIMIIIIIIIiliNIIIIIIIIIIMhllill 

MWDNAE AMERLTRWLLVMMAMLLAASGLWJ F YNS NH L P VKQV S LKGN LVY SDKKALG S LA 
10 20 30 40 50 60 

70 80 90 100 110 120 

KEVIHGNILRTDINGAQEAYRRYPWIASVMVRRRFPDTVEWLTERKPVARWGDHALVDG 

|||!I!I!M1I!IIII!!MIIII1IIIIIIPMI!II!MI1!IMI!I!IIM!!! 

KEYIHGNILRTDINGAOEAYRRYPWIASVMVRRRFPDTVEWLTERKPVARWGDHALVDG 
70 80 90 100 110 120 

130 140 150 160 170 180 

EGNV^EA^-DRPGMPVFRGAEGTSAEMLRRYDEFSTVLAKC^LGIKEMTYTARSAWIVVL 

IIIMIIIIilllMlMMIIIIIIIIIIIIIIMilllMllllMIIIMII! Ill 

EGWFEARLDRPGMPVFRGAEGTSAEMLRRYDEFSTVLAKQGLGIKEMTYTARSAWNVVL 
130 140 150 160 170 180 

190 200 210 220 230 240 

DNGITVRLGRENEMKRLRLFTEAWQKLLRKNKNRLSYVDMRYKDGFSVRYASDGLPEKES 

!!llilllll!!lllllllllll!MIMIII!il!IIIIMMIIII : I llllllll 

DNGITVRLGRENEMKRLRLFTEAWOHLLRKNKNRLSYVDKRYKDGFSVPHAPDGIiPEKES 
190 200 210 220 230 240 



EEX 
!! 

EEYWEQVWDI LRPGVGNGSTQ IS I S YKGRRTMEQQX 
250 260 270 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 245>: 

a080 . seq 

1 ATGTGGGATA ATGCCGAAGC GATGGAACGG CTGACGCGCT GGCTGCTTGT 

51 CATGATGGCG ATGCTGCTTG CTGCGTCCGG GCTGGTTTGG TTTTACAATT 

101 CGAATCATCT GCCCGTCAAG CAGGTGTCGC TGAAGGGCAA CCTAGTTTAT 

151 TCCGATAAGA AAGCATTGGG CAGTTTGGCG AAAGAATACA TCCATGGGAA 

201 TATTTTGAGG ACGGACATCA ATGGCGCACA GGAGGCCTAC CGCCGGTATC 

251 CGTGGATTGC GTCGGTCATG GTGCGCCGCC GTTTTCCCGA CACGGTTGAG 

301 GTCGTCCTGA CCGAGCGCAA GCCGGTCGCG CGTTGGGGCG ACCATGCCTT 

3 51 GGTGGACGGC GAAGGCAATG TTTTTGAAGC CCGTTTGGAC AGACCCGGAA 

4 01 TGCCGGTATT CAGAGGCGCG GAAGGAACGT CTGCCGAAAT GCTCCGCCGT 
451 TATGACGAAT TTTCGACTGT TTTGGCAAAA CAGGGTTTGG GCATCAAAGA 
501 GATGACCTAT ACGGCACGTT CGGC jTGGAT TGTCGTTTTG GACAACGGCA 



m080 . pep 
080 

m080 .pep 
080 

T,080.^;r 

080 

m080 .pep 
080 

m080 -pep 
080 
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551 TCACCGTCAG GCTCGGACGG GAAAACGAGA TGAAACGCCT CCGGCTTTTT 

601 ACCGAAGCGT GGCAACATCT GTTGCGTAAA AATAAAAATC GGTTATCCTA 

651 TGTGGATATG AGGTATAAGG ACGGATTTTC AGTCCGCTAT GCTCCCGACG 

701 GTTTACCCGA AAAAGAATCC GAAGAATAG 

This corresponds to the amino acid sequence <SEQ ID 246; ORP 0S0.a>: 

a080 . pep 

1 MWDNAEAMER LTRWLLVMMA MLLA ASGLVW FYNSNHLPVK QVSLKGNLVY 

51 SDKKALGSLA KEiIHGNILR TDINGAQEAY RRYPWIASVM VRRRFPDTVE 

101 VVLTERKPVA RWGDHALVDG EGNVFEARLD RPGMPVFRGA EGTSAEMLRR 

151 YDEFSTVLAK QGLGIKEMTY TARSAWIVVL DNGITVRLGR ENEMKRLRLF 

201 TEAWQHLLRK NJKNRLSYVDM RYKDGFSVRY APDGLPEKES EE* 

moso/aoso 99.2% identity over a 242 aa overlap 

10 20 30 40 50 60 

m08 0 . pep MWDNAEAMERLTRWLLVMMAMLLAASGLVWFYNSNHLPVKQVSLKGNLVYSDKKTLGSLA 
I I I i I ! I I I 1 ! I ( ( I I I ! I I ! I I I ( I ! I M ( I I ! \ i I I II I I I I I I i I I ! I I I f : I i i I i 
a 080 MWDNAEAMFRLTRWLLVMMAMLLAASGLVWFYNSNHLPVKQVSLKGNLVYSDKKALGSLA 
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a080 KEYIHGNILRTDINGAQEAYRRYPWIASVMVRRRFPDTVEVVLTERKPVARWGDHALVDG 

70 80 90 100 110 120 

130 140 150 160 170 180 

m080 . pep EGNVFEARLDRPGMPVFRGAEGTSAEMLRRYDEFSTVLAKQGLGIKEMTYTARSAWIVVL 
i I ! I If J I ! I I I I f I f I I I I I I I I I! I 1 I I I I ! I I I J I ) i I ! I M i I M ! ! i 1 1 \ i 1 M 1 
a080 EGNVFEARLDRPGMPVFRGAEGTSAEMLRRYDEFSTVLAKQGLGIKEMTYTARSAWIVVL 

130 140 150 160 170 180 

190 200 210 220 230 240 

m080 . pep DNGITVRLGRENEMKRLRLFTEAWQHLLRKNKNRLSYVDMRYKDGFSVRYASDGLPEKES 
I l ! I I I I I t 1 M t I ! i I M M I i I I I I I I I I I I 1 I I I I I i fi I M M M I I I ! I 1 ! I II 
a080 DNGITVRLGRENEMKRLRLFTEAWQHLLRKNKNRLSYVDMRYKDGFSVRYAPDGLPEKES 

190 200 210 220 230 240 



m080.pep EEX 
I I i 

a080 EEX 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 247>; 

g081 . seq 

1 ATGAAACCAC TGGACCTAAA TTTCATCTGC CAAGCCCTCA AGCTTCCGAT 

51 GCCGTCTGAA AACAAACCCG TGTCGCGCAT CGTAACCGAC AGCCGCGATA 

101 TTCGGGAAGG CGATGTGTTT TTCGCATTGG CGGGCGGGCG GTTTGACGCG 

151 CATGATTTTG TTGGAGGCGT ATTGTCTGCG GGCGCGCCGG CGGTTGTGGT 

201 TTCGCGCGAA GATTGCGCGG CTTTGGGCGG CGCGTTGAAA GTCGATGACA 

251 CGCTTGCCGC GTTGCAAACG TTGGCGAAGG CGTGGCGCGA TAATGTGAAC 

301 CCGTTTGTGT TCGGCATTAC CGGTTCGGGC GGCAAGACGA CGGTGAAGGA 

351 GATGCTGGCT GCGGTATTGC GCCGCCGTTT CGGCGATGAT GCCGTTTCGG 

4 01 CGACGGCAGG CAACTTCAAC AACCACAtcg gaTTGCCGCT GACTTTATTG 

4 51 AAATtaaAcg aAAAACACCG CTATGCCGTG ATTGAAATGG GCATGAACCA 

501 TTTTGGcgaa ctggcggtTt taacgcaaaT CGCCAAACCC GATGCCGCTT 

551 TGGtcaACAA CGCCCTGCGC GCCCATGTCG GATGCGGTTt egaeggagtg 

6 01 GGCGATATTG CCAAAGcgaa aagcGAGATT TatgcagGct t ATGTTCAGA 

651 CGGCATGGCA CTGATTCCTC AAGAAGATGC CAATATGGCT GTCTTCAAAA 

701 CGGCAACGTT TAATTTGAAT ACGTGCACTT TCGGCGTCGA TAGCGGCGAT 

751 GTCCGCGCGG AAAATATCGT GCTGAAACCT TTGTCGTGCG AATTTGATTT 
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801 GGTGTGCGGC GACGAGCGCA CTGCCGTGGT GCTGCCTGTT CCCGGCCGCC 

851 ACAA TGTCCA CAACGCCGCC GCTGCCGCCG CGCTGGCTTT GGC7GCCGGT 

901 TTGAGTTTGA ACGATGTGGC GGAAGGTTTG CAAGGCTTCA GCAACATCAA 

951 AGGCCGTCTG AACGTCAAAG CCGGCATCAA GGGCGCAACC CTGATTGACG 

1001 ATACTTATAA TGCGAATCCC GACAGTATGA AAGCCGCGGT TGACGTGTTG 

1051 GCGCGTATGC CTGCGCCGCG CATTTTCGTG ATGGGCGATA TGGGCGAACT 

1101 GGGCGAGGAc gaAGCCGCCG CCATGCACGC CGAagtcgGC GCGTACGCCC 

1151 GCGACCAAGG CATCGAAGCG GCTTATTTTG TCGGCGACAA CAGCGTCGAA 

1201 GCGGcggaAA AATTTGGCGC GGACGGTTTG TGGTTCGCCG CCAAAGACCC 

12 51 GTTGATTCAA GTGTTGAGCC ACGATTTGCC CGAACGCGCC ACCGTGTTGG 

13 01 TGAAAGGTTC GCGCTTTATG CAGAtggAAG AAGTGGTCGA GGCATTGGAG 
13 51 GATAAGTga 

This corresponds to the amino acid sequence <SEQ ID 248; ORF 081.ng>: 



g081 . pep 
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MKPLDLNFIC 


QALKLPMPSE 


NKPVSRIVTD 


SRDIREGDVF 


FALAGGRFDA 


51 


HDFVGGVLSA 


GAAAVWSRE 
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DERTAWLPV 


PGRHNVHNAA 


AAAALALAAG 
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AAEKFGADGL 


WFAAKDPLIQ 


VLSHDLPERA 
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The following partial DNA sequence was identified in N, meningitidis <SEQ ID 249>: 

m081 . seq 

1 ATGAAACCAC TGGACCTAAA TTTCATCTGC CAAGCCCTCA AGCTTCCGAT 

51 GCCGTCTGAA AGCAAACCCG TGTCGCGCAT CGTAACCGAC AGCCGCGACA 

101 TCCGCGCGGG CGATGTGTTT TTCGCATTGG CGGGCGhGCG GTTTGACGCG 

151 CATGATTTTG TTGAAGACGT ATTGGCTGCT GGTGCGGCGG CGGTTGTGGT 

201 TTCGCGCGAA GATTGTGCTG CAATGGATGG CGCGTTG AAA GTCGATGACA 

2 51 CGCTTGCCGC ATTGCAAACG CTGGCAAAGG CGTGGCGTGA AAATGTGAAT 

3 01 CCGTTTGTGT TCGGCATTAC CGGTTCGGGC GGCAAGACGA CGGTGAAGGA 

3 51 AATGCTGGCT GCGGTATTGC GCCgCCGTTT CGGCGATGAT GCCGTGTTGG 

4 01 CGACGGCAGG CAACTTCAAC AACCATATCG GATTGCCGCT GACTTTGTTG 
451 AAGTTAAACG AAAAACACCG CTATGCCGTG ATTGAAATGG GCATGAACCA 
501 TTTCGGCGAA CTGGCGGTTT TAACGCAmAT CGCCAAACCA AATGCCGCAT 

5 51 TGGTCAACAA CGCCATGCGC GCCCATGTCG GCTGCGGTTT CGACGGAGTG 

6 01 GGCGATATTG CCAAAGCGAA AAGCGAGATT TACCAAGGTT TATGTTCAGA 
651 CGGCATTGCA CTGATTCCTC AAGAAGATGC CAATATGGCT GTCTTCAAAA 
701 CGGCAACGCT TAATTTGAAT ACGCGCACTT TCGGCATCGA TAGCGGCGAT 
751 GTTCACGCGG AAAATATTGT GCTGAAACCG TTGTCGTGCG AATTTGATTT 
8 01 GGTGTGCGGC GATGAGCGCG CCGCCGTGGT GCTGCCTGTT CCCGGCCGCC 
8 51 ACAATGTCCA CAACGCCGCC GCTGCCGCCG CGCTGGCTTT GGCTGCGGGT 
901 TTGAGTTTGA ACGATGTGGC GGAAGGTTTG AAAGGCTTCA GCAATATCAA 
951 AGGCCGTCTG AACGTCAAAT CCGGAATCAA GGGCGCAACC CTGATTGACG 

1001 ATACTTATAA TGCGAACCCT GACAGCATGA AAGCTGCGAT TGACGTGTTG 

1051 GCGCGTATGC CTGCGCCGCG TATTTTCGTG ATGGGCGATA TGGGCGAACT 

1101 GGGCGAACTG GGCGAGGACG AAGCCGCCGC TATGCACGCC GAAGTCGGCG 

1151 CGTATGCCCG CGACCAAGGC ATCGAAGCGG CTTATTTTGT CGGCGACAAC 

1201 AGCGTCGAAG CGGCGGAAAA ATTTGGCGCG GACGGTTTGT GGTTCGCCGC 

1251 CAAAGACCCG TTGATTCAAG TGTTGCGCCA CGATTTGCCC GAACGCGCCA 

13 01 CCGTGTTGGT GAAAGGTTCG CGCTTTATGC AGATGGAAGA AGTGGTCGAG 

13 51 GCATTGGAGG ATAAGTGA 

This corresponds to the amino acid sequence <SEQ ID 250; ORF 08 1>: 

m081 .pep 

1 MKPLDLNFIC QALKLPMPSE SKPVSF T,r TD SRDIRAGDVF FALAGERFDA 

51 H DFVEDVLAA GAAAVWS RE DCAAMDGALK VDDTLAALQT LAKAWRENVN 

101 PFVFGITGSG GKTTVKEMLA AVLRRRFGDD AVLATAGNFN NHIGLPLTLL 

151 KLNEKHRYAV IEMGMNHFGE LAVLTXIAKP NAALVNNAMR AHVGCGFDGV 
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201 GDI AKAKSEI YQGLCSDGT A LIPQEDANMA VFKTATLNLN TRTFGIDSGD 
251 VKAEMIVLKP LSCEFDLVCG DERAAWLPV PGRKNVHNAA AAAALALAAG 
3 01 LSLNDVAEGL KGFSNIKGRL NVKSGI KGAT LIDDTYNANP DSMKAAIDVL 

3 51 ARMPAPRIFV MGDMGELGEL GEDEAAAMHA EVGAYARDQG IEAAYFVGDN 
401 SVEAAEKFGA DGLWFAAKDP LIQVLRKDLP ERA7VLVKGS RFMQMEEWE 

4 51 ALEDK* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 081 shows 94.1% identity over a 455 aa overlap with a predicted ORF (ORF 081 .ng) 
from N. gonorrhoeae: 

m081/g081 



10 20 30 40 50 60 

m081 . pep MKPLDLNFICQALKLPMPSESKPVSRIVTDSRDIRAGDVFFALAGERFDAHDFVEDVLAA 

MlMllllllilllllllhlllllllMIIII! IMIMIIi M1IIIM 

g081 MKPLDLNFICOALKLPMPSENKPVSRIVTDSRDIREGDVFFAIiAGGRFDAHDFVGGVLSA 

10 20 30 40 50 60 



70 80 90 100 110 120 

m08 1 . pet) GAAAVWSREDCAAMDGALKVDDTLJ^QTLAKAWRE 

11111111111111= iliiliilliliiiil!llhliliiiliillltlli!lillN 

g081 GAAAVWSREDCAALGGALKVDDTLAALQTI^KA^ 

70 80 90 100 110 120 



130 140 150 160 170 180 

m08 1 . pep AVLRRRFGDDAVLATAGNFNNHIGLPLTLLKLNEKHRYAVIEMGMNHFGELAVLTXIAKP 
I I I I I I I I I I I I I I I I I I I I I I I I ! 1 I ! I I I I I I I I I I I I I I I I 1 I ! I I I ! I I ! I I I I 
g081 AVLRRRFGDDAVSATAGNFNNHIGLPLTLLKLNEKHRYAVIEMGMNHFGELAVLTOIAKP 

130 140 150 160 170 180 



190 200 210 220 230 240 

m081.pep NAALVNNAMRAHVGCGFDGVGDIAKAKSEIYOGLCSDGIALIPQEDANMAVFKTATLNLN 

:|||lili:|lilli!miiilll!illll IliMIMIIUllilllillllhlli 

g081 DAALVNNALRAHVGCGFDGVGDIAKAKSEIYAGLCSDGMALIPQEDANMAVFKTATFNLN 
190 200 210 220 230 240 



250 260 270 280 290 300 

m081 .pep TRTFGIDSGDVHAENIVLKPLSCEFDLVCGDERAAWLPVPGRHNVHNAAAAAALALAAG 

I ll|:IMIMIIIMlMIIMIMIIIIIhllllllll!lllllllllllilll!l 

gO 81 TCTFGVI)SGDVRAENIVLKPLSCEFDLVCGDERTAWLPVPGRHNVHNAAAAAALALAAG 
250 260 270 280 290 300 



310 320 330 340 350 360 

m081 .pep LSLNDVAEGLKGFSNIKGRLNVKSGIKGATLIDDTYNANPDSMKAAIDVLARMPAPRIFV 

IIMMIMhIlllllillllMMIIIIMIIilMllllllhlllllllllllll 

gO 81 LSLNDVAEGLQGFSNI KGRLNVKAGI KGATLIDDTYNANPDSMKAAVDVLARMPAPRI FV 

310 320 330 340 350 360 



370 380 390 400 410 420 

m08 1 . pep MGDMGELGELGEDEAAAMHAEVGAYARDOGIEAAYFVGDNSVEAAEKFGADGLWFAAKDP 

milllll llllliiilliilliiiiMilllliiiiiiilllliiiiimilll 

gO 81 MGDMGELGE DEAAAKHAEVGAYARDQGIEAAYFVGDNSVEAAEKFGADGLWFAAKDP 

370 380 390 400 410 

430 440 450 

mO 81 . pep L I Q V LRHD L P E RATV L VKG S RFMQME E WE ALED KX 

IIIM IIIIIIIIIIIIIIMIIIIIIIIIIIIII 

g 0 8 1 LIQVLSHDLPERATVLVKGSRFMQMEEWEALEDKX 
420 430 440 450 
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The following partial DNA sequence was identified in N. meningitidis <SEQ ID 251>; 

a081 . seq 

1 ATGAAACCAC TGGACCTAAA TTTCATCTGC CAAGCCCTCA AGCTTCCGAT 

51 GCCGTCTGAA AGCAAACCCG TGTCGCGCAT CGTAACCGAC AGCCGCGACA 

101 TCCGCGCGGG CGATGTGT77 TTCGCATTGG CGGGCGGGCG GTTTGATGCG 

151 CATGATTTTG TTGAAGACGT ATTGGCTGCG GGTGCGGCGG CGGTTGTGGT 

201 TTCGCGCGAA GATTGCGTTG CAATGGATGG CGCGTTGAAA GTCGATGACA 

251 CGCTTACCGC GTTGCAAATG TTGGCGAAGG CGTGGCGCGA GAATGTGAAC 

301 CCGTTTGTGT TCGGTATTAC CGGCTCGGGC GGCAAGACCA CGGTGAAGGA 

351 AATGTTGGCT GCGGTATTGC GCCGCCGTTT CGGCGATA/ T GCCGTTTTGG 

4 01 CGACGGCAGG CAACTTCAAC AACCACATCG GATTGCCGTT GACTTTGTTG 

4 51 AAATTAAACG AAAAACACCG CTATGCCGTG ATTGAAATGG GTATGAACCA 

501 TTTTGGCGAA CTGGCGGTTT TGACACAAAT CGCCAAACCC GATGCCGCAT 

551 TGGTCAACAA CGCCATGCGC GCCCATGTCG GCTGCGGTTT CGACGGAGTG 

601 GGCGATATTG CCAAAGCGAA AAGuGAGATT TATCAAGGCT TATGTTCAGA 

651 CGGCATGGCG CTGATTCCTC AAGAAGATGC CAATATGGCT GTCTTCAAAA 

7 01 CGGCAACGCT TAATTTGAAT ACGCGCACTT TCGGCATCGA TAGCGGCGAT 

7 51 GTCCACGCGG AAAATATCGT GCTGAAACCG TTGTCGTGCG AATTTGATTT 
801 GGTGTGCGGC AACGAGTGCG CAGCCGTGGT TCTGCCCGTT CCCGGCCGCC 

8 51 ACAATGTCCA CAACGCCGCC GCCGCCGCCG CGCTGTCTTT GGCTGCAGGT 
901 TTGAGTTTGA ACGATGTGGC GGAAGGTTTG AAAGGCTTCA GCAATATCAA 

9 51 AGGCCGTCTG AACGTCAAAT CCGGAATCAA GGGCGCAACC CTGATTGACG 
1001 ATACTTATAA TGCGAACCCT GACAGCATGA AAGCTGCGGT TGACGTGTTG 
10 51 GCGCGTATGC CTGCGCCGCG TATTTTCGTG ATGGGCGATA TGGGCGAACT 
1101 GGGTGAGGAC GAAGCCGCCG CCATGCACGC CGAAGTCGGC GCGTACGCCC 
1151 GCGACCAAGG CATCGAAGCG GCTTATTTTG TCGGCG;,3AA CAGCGTCGAA 
1201 GCGGCGGAAA AATTTGGCGC GGACGGTTTG TGGTTCGCCG CCAAAGACCC 
12 51 GTTGATTCAA GTGTTGCGCC ACGATTTGCC CGAACGCGCC ACCGTGTTGG 
1301 TGAAAGGTTC GCGCTTTATG CAGATGGAAG AAGTGGTCGA GGCATTGGAG 
1351 GATAAGTGA 

This corresponds to the amino acid sequence <SEQ ID 252; ORF 081.a>: 

a081 .pep 

1 MKPLDLNFIC QALKLPMPSE SKPVSRIVTD SRDIRAGDVF FALAGGRFDA 

51 HDFVEDVLAA GAAAWVSRE DCVAMDGALK VDDTLTALQM LAKAWRENVN 

101 PFV FGITGSG GKTTVKEMLA AVLRRRFGDN AVLATAGNFN NHIGLPLTLL 

151 KLNEKHRYAV IEMGMNHFGE LAVLTQIAKP DAALVNNAMR AHVGCGFDGV 

201 GDIAKAKSEI YQGLCSDGMA LIPQE DAN MA VFKTATLNLH TRTFGIDSGD 

251 VHAENIVLKP LSCEFDLVCG NECAAWLPV PGRHNVHNAA AAAALSLAAG 

301 LSLNDVAEGL KGFSNIKGRL NVKSGIKGAT LIDDTYNANP DSMKAAVDVL 

351 ARMPAPRIFV MGDMGELGED EAAAMHAEVG AYARDQGIEA AYFVGDNSVE 

4 01 AAEKFGADGL WFAAKDPLIQ VLRKDLPERA TVLVKGSRFM QMEEVVEALE 

451 DK* 



m08i/aOBi 96.7% identity over a 455 aa overlap 



10 20 30 40 50 60 

m081 . pep MKPLDLNFICQALKLPMPSESKPVSRIVTDSRDIRAGDVFFALAGERFDAHDFVEDVLAA 
( i i 1 I t I I 1 I t ! I I li M ! I i i I I t I I M I ! I I I i 1 I i II I i 1 1 I ! } I i 1 f I 11 I II I I 
a081 MKPLDLNFICQALKLPMPSESKPVSRIVTDSRDIRAGDVFFALAGGRFDAHDFVEDVLAA 

10 20 30 40 50 60 

70 80 90 100 110 120 

m081 . pep GAAAWVSRE DCAAMDGALKVDDTLAALQTLAKAWRENVN P FV FG I TGSGGKTTVKEMLA 

I ! I M I f I I i I I : I I I I I 1 I I I I I I : I I 1 I M I | | | | | M I I I i I I I I I II I M I I II I 
a08 1 GAAAWVSRE DC V AM DGALKVDDTLTALQMLAKAWRENVNPFVFG I TGSGGKTTVKEMLA 

70 80 90 100 110 120 



130 140 150 160 170 130 

m081 . pep AVLRRRFGDDAVLATAGNFliNHIGLPLTLLKLNEKKRYAVIEMGMNHFGELAVLTXIAKP 
I t M I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I 1 1 I I I llll 
a081 AVLRRRFGDNAVLATAGNFNNrilGLPLTLLKLNEKHRYAVIEMGMNHFGELAVLTQIAKP 
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130 140 150 160 170 180 

190 200 210 220 230 240 

m081 . nep NAALVNNAMRAHVGCGFDGVGDIAKAKSEIYQGLCSDGIALIPQEDANMAVFKTATLNLN 
: I I I I I I I i I I i I I I I I I I i i ! f f i I I I I I I I I I I I f I : I ! t t I I f I I I I I I I I I I i I I I 
a 0 8 : D AA LVN NAMRA H VG CG F DG VG D I A KAK S E I YQG LC S DOHA L I PQE D ANM A V FK TATLNLN 

190 200 210 220 230 240 



250 260 270 2S0 290 300 

m081 .oep TRTFGIDSGDVHAENIVLKPLSCEFDLVCGDE RAA VV L P V P G R H N V H N AAAAAA LA L AAG 

I I j I I I | M I i I I I i i I I I M I I I I I I I M : I I f I I I I I I I 1 I I I f I 1 j I i f I I : I I I I 
a 0 8 1 TRTFGI DSGDVHAEN I VLKPLSCE F D L V C G N E C AA V V L P V P G R H N V H N AAAAAA L S LAAG 

250 260 270 230 290 300 



310 320 330 340 350 360 

m081 . pep LSLNDVAEGLKGFSNI KGRLNVKSGIKGATLIDDTYNANPDSMKAAI DVLARMPAPRI FV 

I I I I I I j I I i I I I I I II I I I I I I I ! I I I I I ! I I I i I I I I I i I I I M : - I i It I I I I I I I ! 
aOSl LSLNDVAEGLKGFSNIKGRLNVKSGIKGATLIDDTYNANPD3MKAAVDVLARMPAPRI FV 

310 320 330 340 350 360 



370 380 390 400 410 420 

m081 . pep MGDMGELGELGEDEAAAMHAEVGAYARDQGIEAAYFVGDNSVEAAEKFGADGLWFAAKDP 
i j M | I f I I M I I I I I i I I I! I i I I I I M I I I I I I I U I 1 I I I I I I t I ! I M I ! I i I 

a081 MGDMGELGE DE AAAMH AE VGA Y ARDQG I E AA Y FVG DM S VE AAE K FG ADG LW FAAK D P 

370 380 390 400 410 



430 440 450 

m08l . peo LIOVLRHDLPERATVLVKGSRFMQMEEWEALEDKX 

i ! ! I I I I I ! I I I j I I I I I I I I I I I I I I I ! I I I I I ! ! 
a081 LIQVLRHDLPERATVLVKGSRFMQMEEWEALEDKX 

420 430 440 450 



The following partial DNA sequence was identified in vY gonorrhoeae <SEQ ID 253>; 



g062 . seq 












1 


aTGTGGTTGT 


TGAAGTTGCC 


TGCCGTCGCC 


GAAACGGCAT 


CATCGCCGAA 


51 


ACGGCGGCGC 


AATACCGCAG 


CCAGCATCTC 


CTTCACCGTC 


GTCTTGCCGC 


101 


CCGAACCGGT 


AATGCCGAAC 


ACAAACGGGT 


TCACATTATC 


GCGCCACGCC 


151 


TTCGCCAACG 


TTTGCAACGC 


GGCAAGCGTG 


TCATCGACTT 


TCAACGCGCC 


201 


GCCCAAAGCC 


GCGCAATCTT 


CGCGCGAAAC 


CACAACCGCC 


GCCGCGCCCG 


251 


CAGACAATAC 


GCCTCCAACA 


AAATCATGCG 


CGTCAAACCG 


CCCGCCCGCC 


301 


AATGCGAAAA 


ACACATCGCC 


TTCCCGAATA 


TCGCGGCTGT 


CGGTTACGAT 


351 


GCGCGACACG 


GGTTTGTTTT 


CAGACGGCAT 


CGGAAGCTTG 


AGGGCTTGGC 


401 


AGATGAAATT 


TAGGTCCAGT 


GGTTTCATAT 


TTGCTTTCGT 


TAATATTCGG 


451 


GCGGCGGACA 


CATCGGTAGC 


GGCTGATTTT 


TTTATCGCCT 


GTTTTGCTGT 


501 


GGTAAAACAC 


AGATTATTTT 


CCCATTCTCA 


TTCGGCATTT 


TTTCTGTACG 


551 


TATCATTTTT 


TAGACGTATT 


TTTAGCCGAT 


TTGCCTTTTC 


CCGCATACCA 


601 


CGGCGCGGGG 


TCGTCGGACT 


GTCTGTCGAT 


AAAGGCAAGG 


TTATTGCCTT 


651 


CGCCCGGCAC 


ATCGGGGACA 


TTCCCCCAAA 


AATCATAGCC 


GTCATCGGGC 


701 


AACTCGTCGG 


TTTCGATAGC 


CGTCCAACTG 


CCGAATCCGG 


GTAA 



This corresponds to the amino acid sequence <SEQ ID 254; ORF 082.ng>: 

g082 .pep 

1 MWLLKLPAVA ETASSPKRRR NTAASISFTV VLPPEPVMPN TNGFTLSRHA 

51 FANVCNAASV SSTFNAPPKA AQSSRETTTA AAPADNTPPT KSCASNRPPA 

101 NAKNTSPSRI SRLSVTMRDT GLFSDGIGSL RAWQMKFRSS GFIFAFVNIR 

151 A ADTSVAADF FIACFAW KH RLFSHSHSAF FLYVSFFRRI FSRFAFSRIP 

201 RRGWGLSVD KGKVI AFARH IGDIPPKIIA V1GQLVGFDT RPTAESA* 

The following partial DNA sequence was identified in ;V. meningitidis <SEQ ID 255>: 

m082 . seq 

1 ATGnnGTTGT TGAAGTTGCC TGCCGTCGCC AACACGGCAT CATCGCCGAA 

51 ACGGCGGCGC AATACCGCAG CCAGCATTTC CTTCACCGTC GTCTTGCCGC 

101 CCGAACCGGT AATGCCGAAC ACAAACGGAT TCACATTTTC ACGCCACGCC 

151 TTTGCCAGCG TTTGCAATGC GGCAAGCGTG TCATCGACTT TCAACGCGCC 
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201 


ATCCATTGCA 


251 


CAGCCAATAC 


301 


AATGCGAAAA 


351 


GCGCGACACG 


401 


AGATGAAATT 


451 


GCGGCGGACA 


501 


GGTAAAACAC 


551 


TATCATTTTT 


601 


CGGCGCGGGG 


651 


CGCCCTGCAC 


701 


AACTCGTCGG 



CCCATTCTCA TTCGGsATTT TTTCTGTACG 



This corresponds to the amino acid sequence <SEQ ID 25b: ORF 082>: 

m082.pep 

1 MXLLKLPAVA NTASSPKRRR NTAASISFTV VLPPEPVMPN TNGFTFSRHA 

51 FASVCNAASV SSTFNAPSIA AQSSRETTTA AAPAANTSST KSCASNRSPA 

101 NAKNTSPARM SRLSVTMRDT GLLSDGIGSL RAWQMKFRSS GFIFTFVNIR 

151 A ADTSVAADF FIACFAW KH RLFSHSHSXF FLYVSFFRRI FSRFAFSRIP 

201 RRGWGQSVD KGKVIAFALH IGNIPPKIIA VIGQLVGFDT RPTAESA* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 082 shows 92.7% identity over a 247 aa overlap with a predicted ORF (ORF 082.ng) 
from N. gonorrhoeae: 

m082/g082 

10 20 30 40 50 60 

m062 .pep MXLLKLPAVANTASSPKRRRNTAAS1SFTWLPPEPVMPNTNGFTFSRHAFASVCNAASV 

i I I I I I I i I - It I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I : I 1 I I I I - I I I I I I I 
g 0 8 2 MWLLKLPAVAETASSPKRRRNTAAS I S FTWLPPE PVMPNTNGFTLSRHAFANVCNAASV 

10 20 30 40 50 60 

70 80 90 100 110 120 

m082 . pep SSTFNAPSIAAQSSRETTTAAAPAANTSSTKSCASNRSPANAKNTSPARMSRLSVTMRDT 

Illllli lllllllllllllll II llllllll llllllllhhilllllllll 
g 0 8 2 SST FNAP P KAAQS S RE TTTAAAP ADNT P PT KS CASNR ? PANAKNT SPSRIS RLS VTMRDT 

70 80 90 100 110 120 

130 140 150 160 170 180 

m082 . pep GLLSDGIGSLRAWQMKFRSSGFIFTFVNIRAADTSVAADFFIACFAWKKRLFSH3HSXF 

lhll!llllllllllllllll!hl!IMIIII!!MIIMIIIIIIIlMIIIIII 1 

g082 GLFSDGIGSLRAWQMKFRSSGFIFAFVNIRAADTSVAADFFIACFAWKHRLFSHSHSAF 
130 140 150 160 170 180 

190 200 210 220 230 240 

m082 .pep FLYVSFFRRI FSRFAFSRIPRRGWGQSVDKGKVIAFALHIGNIPPKI I AVIGQLVGFDT 

MiniMitiummtiimi iiiiiiiiui i 

g082 FLYVSFFRRIFSRFAFSRIPRRGWGLSVDKGKVIAFARHIGDIPPKI IAVIGQLVGFDT 

190 200 210 220 230 240 



m082.pep RPTAESAX 

llllllll 
g082 RPTAESAX 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 257>: 

aQ82 . seq 

1 ATGTGGTTGT TGAAGTTGCC TGCCGTCGCC AAAACGGCAT TATCGCCGAA 
51 ACGGCGGCGC AATACCGCAG CCAACATTTC CTTCACCGTC GTCTTGCCGC 
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101 CCGAGCCGGT AATACCGAAC ACAAACGGGT TCACATTCTC GCGCCACGCC 

1 SI TTCGCCAACA TTTGCAACGC GGTAAGCGTG TCATCGACTT TCAACGCGCC 

201 ATCCATTGCA ACGCAATCTT CGCGCGAAAC CACAACCGCC GCCGCACCCG 

251 CAGCCAATAC GTCTTCAACA AAATCATGCG CATCAAACCG CCCGCCCGCC 

301 AATGCGAAAA ACACATCGCC CGCGCGGATG TCGCGGCTGT CGGTTACGAT 

351 GCGCGACACG GGTTTGCTTT CAGACGGCAT CGGAAGCTTG AGGGCTTGGC 

4 01 AGATGAAATT TA JGTCCAGT GGTTTCATAT T7ACTTTCGT TAATATTCGG 

4 51 GCGGCGGACA CATCGGTAGC GGCTGATTTT TTTATCGCCT GTTTTGCTGT 

5 01 GGTAAAACAC AGATTATTTT CCCATTCTCA TTCGGCATTT TTTCTGTACG 
551 TATCATTTTT TAGACGTATT TTTAGTCGAT TTGCCTTTTC CCGCATACCA 
601 CGGCGCGGGG TCGTCGGGCA GTCCGTCGAT AAAGGCAAGG TTATTGCCTT 
651 CGCCCTGCAC ATCGGGAACA TTCCCCCAAA AATCATAGCC GTCATCGGGC 
701 AACTCGTC^G TTTCGATACC CGTCCAACTG CCGAATCCGC GTAA 



This corresponds to the amino acid sequence <SEQ ID 258; ORF 082. a>: 

a082 .pep 

1 MWLLKLPAVA KTALSPKRRR NTAANISFTV VLPPEPVIPN TNGFTFSRHA 

51 FANICNAVSV SSTFNAPSIA TQSSRETTTA AAPAANTSST KSCASNRPPA 

101 NAKNTSPARM SRLSVTMRDT GLLSDGIGSL RAWQMKFRSS GFIFTFVNIR 

151 A ADTSVAADF FIACFAVV KH RLFSHSHSAF FLYVSFFRRI FSRFAFSR1P 

201 RRGVVGQSVD KGKVIAFALH IGNIPPKIIA VIGQLVGFDT RPTAESA* 



m082/a082 95.5% identity over a 247 aa overlap 

10 20 30 40 50 60 

m082 . pep MXLLKLPAVANTASSPKRRRNTAASISFTVVLPPEPVHPNTNGFTFSRHAFASVCNAASV 
I 11 I t I t I I : I I I I I I I M i I I * I 1 I M ) I I I I t I : I I I I I I I I ) t 1 I i 1 : : I I ; : i 1 
a082 LLKLPAVAKTALS PKRRRNTAAN I S FTVVLP PE PV I PNTNGFT FSRHAFAN I CN AV SV 

10 20 30 40 50 60 



70 80 90 100 110 120 

m082 . pep SSTFNAPSIAAQSSRETTTAAAPAANT5STKSCASNRSPANAKNTSPARMSRLSVTMRD7 
Ml M I I I i I : I ! I ! I I ! I I ! I f I I I i I I I I ! M I I I I I ! I M i I ! ! I I I I I I i I ! If I 
a082 SSTFNAPSIATQSSRETTTAAAPAANTSSTK5CASNRPPANAKNTS PARMSRLSVTMRDT 

70 80 90 100 110 120 



130 140 150 160 170 180 

m082 .pep GLLSDGIGSLRAWQMKFRSSGFIFTFVNIRAADTSVAADFFIACFAVVKHRLFSHSHSXr 
I I I I I I I I I I i ! I I I I I ! I I I I I I I I I I i I I I I I i I I I I I I I I I II I I I I I I II I i If I 
a082 GLLSDGIGSLRAWQMKFRSSGFIFTFVNIRAADTSVAADFFIACFAWKHRLFSHSHSAF 

130 140 150 160 170 180 



190 200 210 220 230 240 

m082 .pep FLYVSFFRRIFSRFAFSRIPRRGVVGQSVDKGKVIAFALHIGNIPPKI IAVIGQLVGFDT 

I t I I M I I I I I i I i M I t i [ I i i M I I I I I I 1 ! II I I \ I 11 I I I I I i I 1 t i I i t I I I II i 
a082 FLYVSFFRRI FSRFAFSRIPRRGVVGQSVDKGKVIAFALHIGMIPPKI IAVIGQLVGFDT 

190 200 210 220 230 240 



m082.pep RPTAESAX 
I I I I I I I I 



a082 RPTAESAX 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 259>: 

g084 . seq 

1 ATGAAacaAT CCGcccgaat aAAAAATATG GATCAGACAT TAAAAAATAc 

51 attgggcatt tGCGCGcttt tagcctTTTG TTTTggcgcG gccaTCGCAT 

101 CAGGTTATCA CTTGGAATAT GAATACGGCT ACCGTTATTC TGCCGTGGGC 

151 GCTTTGGCTT CGGTTGTATT TTTATTATTA TTGGCACGCG GCTTCCCGCG 

201 CGTTTCTTCA GTTGTTTTAC TGATTTACGT CGGCACAACC GCCCTATATT 

251 TGCCGGTCGG CTGGCTGTAT GGTGCGCCTT CTTATCAGAT AGTCGGTTCG 

301 ATATTGGAAA GCAATCCTGC CGAGGCGCGT GAATTTGTCG GCAATCTTCC 

3 51 CGGGTCGCTT TATTTTGTGC AGGCATTATT TTTCATTTTT GGCTTGACAG 
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4 01 TTTGGAAATA TTGTGTATCT GTGGGGGTAT TTGCTGACGT AAAAAACTAT 

451 AAACGTCGCA GCAAAATATG GCTGACCATA TTATTGACTT TGATTTTGTC 

501 CTGCGCGGTG ATGGAGAAAA TCGccggcga taaAGATTGG CGAGaacctg 

551 atgccggcct gttgttgaat ATTTTcgacc cgtattaCga cttggctttc 

601 cgcgccggca cAATATGCCG CCAAGCGCGC CCAcatttcg gaagCagcaa 

651 aaaaagcgtC AACATGGCAt acccgccaac ttgcgcccaa gTAcaa 

This corresponds to the amino acid sequence <SEQ ID 260; ORF 084. ng>: 

g084 . pep 

1 MKQSARIKNM DQTLKNTLGI CALLAFCFGA AIASG YHLEY EYGYRYSAVG 

51 ALASWFLLL LA RGFPR VSS WLLIYVGTT ALYL PVGWLY GAPSYQIVGS 

101 ILESNPAEAR EFVGNLP GSL YFVQALFFIF GLTV WKYCVS VGVFADVKNY 

151 KRRSK IWLTI LLTLILSCAV ME KIAGDKDW REPLAGLLLN IFDLYYDLAF 

201 RAGTICRQAR PHFGSSKKSV NMAYPPTCAQ V* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 261 >: 

m084 . seq 

1 ATGAAACAAT CCGCCcGAAf AAAa . ATATG AATCAGACAT TACTTTATAC 

51 ATTGGGCATT TGCGCGCTTT TAACCTTTnn nnnnnnnnnn nnnnnnnnnn 

101 nnnnnTATCA CCCnGAATAT GAATACGGCT ACCGTTATTC TGCCGTGGGT 

151 GCTTTGGCTT CGGTTGTATT TTTATTATTA TTGGCACGCG GTTTCCCGCG 

201 CGTTTCTTCA GTTGTTTTAC TGATTTACGT CGGCACAACC GCCCTATATT 

251 TGCCGGTCGG CTGGCTGTAT GGTGCGCCGT CTTATCAGAT AGTCGGTTCG 

301 ATATTGGAAA GCAATCCTGC CGAGGCGCGT GAATTTGTCG GCAATCTTCC 

351 CGGGTCGCTT TATTTTGTGC AGGCATTATT TTTCATTTTT GGCTTGACAG 

401 TTTGGAAATA TTGTGTATCG GGGGGGGTAT TTGCTGACGT AAAAAACTAT 

4 51 AAACGCCGCA GCAAAATATG GCTGACTATA TTATTGACTT TGATTTTGTC 
501 CTGCGCGGTG ATGGATAAAA TCGCCAGCGA TAAAGATTTG CGAGAACCTG 

5 51 ATGCCGGCCT GTTGTTGAAT ATTTTCGACC TGTATTACGA TTTGGCT . TC 
601 CGCGCCGGCA CAATATGCCG CCAAGCGCGC CCACATTTTG GAAGCAGCAA 
651 AAAAAGCGTC AACATGGCAT ATCCGTCATG TTGCGCCCAA GTATAA 

This corresponds to the amino acid sequence <SEQ ID 262; ORF 084>: 

ru0 34 . pep 

1 MKQSARIKXM NQTLLYTLGI CALLTFXXXX XXXXX YHPEY EYGYRYSAVG 

51 ALASWFLLL LA RGFPR VSS WLLIYVGTT ALYL PVGWLY GAPSYQIVGS 

101 ILESNPAEAR EFVGNLP GSL YFVQALFFIF GLTV WKYCVS GGVFADVKNY 

151 KRRSK IWLTI LLTLILSCAV MD KIASDKDL REPDAGLLLN IFDLYYDLAX 

201 RAGTICRQAR PHFGSSKKSV NMAYPSCCAQ V* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 084 shows 90.5% identity over a 231 aa overlap with a predicted ORF (ORF 084.ng) 
from N. gonorrhoeae: 

m084/g084 



10 20 30 40 50 

m084 .pep MKQSARI KXMNQTLLYTLG I CALLTF YHPEYEYGYRYSAVGALASWFLLL 

llllllll hill milllM II llllllllllllllllllllll 

g0 84 MKQSARI KNMDQTLKNTLGICALLAFCFGAAIASGYHLEYEYGYRYSAVGAIASWFLLL 

10 20 30 40 50 60 



60 70 80 90 100 110 

m084 .pep LARGFPRVSSWLLIYVGTTALYLPVGWLYGAPSYQIVGSILESNPAEAREFVGNLPGSL 

I I I I I I I II I I i I I I I I I I I II I I I I I I I I I I I ! I I I I I ! i I I I I I I i I I I I I I I i I I I I 
gO 84 LARGFPRVSSVVLLI YVGTTALYLPVGWLYGAPSYQIVGSILESNPAEAREFVGNLPGSL 

70 80 90 100 110 120 



120 130 140 150 160 170 

m084 . pep YFVQALFFIFGLTWKYCVSGGVFADVKNYKRRSKIWLTILLTLILSCAVMDKIASDKDL 

IIIIMIIIIIMIIIMII IMIIIIIIIimilllllMfllMMMIhlM 

g 0 8 4 Y FVQALFF I FGLTWJ KY CV S VGV FADV KN3YKRRSKI WLT I L LTL I L S CAVME K I AGDKDW 

130 140 150 160 170 180 
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180 190 200 210 220 

m084 .pep REPDAGLLLNIrDLYYDLAXRAGTICRQARPHFGSSKKSVNMAYFSCCAOVX 

llllllllMMillllll IMIIMIIIMIMIilllliill Mill 

g084 REPDAGLLLNI FDLYYDIAFRAGTICRQARPHFGSSKKSVNMAYPPTCAQVX 

190 200 210 220 230 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 263>: 

a084 . seq 

1 ATGAAACAAT CCGCCCGAAT AAAAAATATG GATCAGACAT TAAAAAATAC 

51 ATTGGGCATT TGCGCGCTTT TAGCCTTTTG TTTTGGCGCG GCCATCGCAT 

101 CAGGTTATCA CTTGGAATAT GAATACGGCT ACCGTTATTC TGCCGTGGGT 

151 GCTTTGGCTT CGGTTGTATT TTTATTATTA TTGGCACGCG GTTTCCCGCG 

201 CGTTTCTTCA GTTGTTTTAC TGATTTACGT CGGCACAACC GCCCTATATT 

2 51 TGCCGGTCGG CTGGCTGTAT GGTGCGCCGT CTTATCAGAT AGTCGGTTCG 

301 ATATTGGAAA GCAATCCTGC CGAGGCGCGT GAATTTGTCG GCAATCTTCC 

351 CGGGTCGCTT TATTTTGTGC AGGCATTATT TTTCATTTTT GGCTTGACAG 

4 01 TTTGGAGATA TTGTGTATCG GGGGGGGTAT TTGCTGACGT AAAAAACTAT 

4 51 AAACGCCGCA GCAAAATATG GCTGACTATA TTATTGACTT TGATTTTGTC 

501 CTGCGCGGTG ATGGATAAAA TCGCCAGCGA TAAAGATTTG CGAGAACCTG 

551 ATGCCGGCCT GTTGTTGAAT ATTTTCGACC TGTATTACGA TTTGGCTTCC 

601 . GCGCCGGCA CAATATGCCG CCAAGCGCGC CCACATTTTG GAAGCAGCAA 

651 AAAAAGCGTC AACATGGCAT ATCCGTCATG TTGCGCCCAA GTATAA 

This corresponds to the amino acid sequence <SEQ ID 264; ORF 084. a>: 

a084 .pep 

1 MKQSARIKNM DQTLKNTLGI CALLAFCFGA AIASG YHLEY EYGYRYSAVG 

51 ALASWFLLL LA RGFPR VSS WLLIYVGTT ALYL PVGWLY GAPSYQIVGS 

101 ILESNPAEAR EFVGNLP GSL YFVQALFFIF GLTV WRYCVS GGVFADVKNY 

151 KRRSK IWLTI LLTLILSCAV MD KIASDKDL REPDAGLLLN IFDLYYDLAS 

201 XAGTICRQAR PHFGSSKKSV NMAYPSCCAQ V + 

mO84/a084 92.2% identity over a 23 1 aa overlap 

10 20 30 40 50 60 

m0 8 4 .pep MKQSARIKXMNQTLLYTLGICALLTFXXXXXXXXXYKPEYEYGYRYSAVGALASVVFLLL 
I i , i ) I I I : i I I 1)111111:1 11 1 I i I 1 1 1 1 1 1 M M 1 I 11 1 ! M 

a084 MKQSARIKNMDQTLKNTLGICALLAFCFGAAIASGYHLEYEYGYRYSAVGALASWFLLL 

10 20 30 40 50 60 

70 80 90 100 110 120 

m084 .pep IARGFPRVSSVVLLIYVGTTALYLPVGWLYGAPSYQIVGSILESNPAEAREFVGNLPGSL 

I I! 1 1 1 1 1 it 1 1 II 1 i I 1 1 1 1 1 1 1 11 i 1 1 I ! 1 1 1 1 I M 1 M 1 1 I 1 1 M 1 11 1 1 1 I 1 t 1 f 1 
a08 4 LARGFPRVSSVVLLIYVGTTALYLPVGWLYGAPSYQIVGSILESNPAEAREFVGNLPGSL 

70 80 90 100 110 120 

130 140 150 160 170 ISO 

m084 .pep YFVQALFFIFGLTVWKYCVSGGVFADVKNYKRRSKIWLTILLTLILSCAVMDKIASDKDL 
M 1 1 1 1 1 1 1 11 i I I 1 : ) i i I 1 1 11 1 1 1 1 1 I M t 1 1 II 1 1 1 1 1 1 M 1 1 1 I it i I I I 1 I t 1 1 
a08 4 YFVQALFFIFGLTVWRYCVSGGVFADVKNYKRRSKIWLTILLTLILSCAVMDKIASDKDL 

130 140 150 160 170 180 

190 200 210 220 230 

m084 .pep REPDAGLLLNI FDLYYDLAXRAGT I CRQARPHFGSSKKSVNMAYPSCCAQVX 

II 1 11 I 1 1 t 1 1 t 1 1 11 i ( 1 1 1 1 1 1 1 t 1 1 M 1 1 1 1 1 ! I 1 11 1 I 1 11 1 1 1 1 I 
a084 REPDAGLLLNI FDLYYDLASXAGTICRQARPHFGSSKKSVNMAYPSCCAQVX 

190 200 210 220 230 

The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 265>: 

g085 . seq 

1 ATGGGCAAAG GGCAGGACTT CACGCCCCTG CGCGACGCGT TGAAAGATAA 
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SI GGCAAAAGGC GTGTTCCTGA TCGGCGTCGA TGCGCCGCAA ATCCGCCGCG 

101 ATTTGGACGG CTGCGGCTTG AACCTGACCG ACTGCGTCAC TTTGGAAGAG 

151 GCGGTTCAGA CGGCATACGC CCAAGCCGAA GCGGGCGATA TTGTCTTGCT 

2 01 CAGCCCCGCC TGCGCGAGTT TCGATATGTT TAAAGGCTAC GCGCACCGTT 

251 CGGAAGTGTT tatCGAAGCG TTTAAGGCTT TGTGA 

This corresponds to the amino acid sequence <SEQ ID 266; ORF 085. ng>; 

g085 .pep 

1 MGKGQDFTPL RDALKDKAKG VFLIGVDAPQ IRRDLDGCGL NLTDCVTLEE 
51 AVQTAYAOAE AGDIVLLSPA CASFDMFKGY AHRSEVFIEA FKAL* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 267>: 

m085 . seq 

1 ATGGGTAAAG GGCAGGACTT CACGCCCCTG CGCGAPGCAC TGGTAGGCAA 

51 GGCAAAAGGC GTGTTCTTGA TTGGTGTCGA TGCGCCGCAA ATCCGCCGCG 

101 ATTTGGACGG CTGCGGCTTG AATATGACCG ACTGCGCCAC TTTGGGAGAA 

151 GCCGTTCAGA CGGCATATGC CCAAGCCGAA GCAGGCGATA TTGTGTTGCT 

201 CAGCCCCGCC TGCGCGAGCT TTGATATGTT CAAAGGCTAC GCGCACCGTT 

251 CGGAAGTGTT TATCGAAGCG TTTAAGGCTT TGTGA 

This corresponds to the amino acid sequence <SEQ ID 268; ORF 085>: 

m085 .pep 

1 MGKGQDFTPL RDALVGKAKG VFLIGVDAPQ IRRDLDGCGL NMTDCATLGE 
51 AVQTAYAQAE AGDIVLLSPA CASFDMFKGY AHRSEVFIEA FKAL* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 085 shows 94.7% identity over a 94 aa overlap with a predicted ORF (ORF 085. ng) 
from N. gonorrhoeae: 

m085/g085 



m08 5 . pep 
g085 



m085 -pep 
g085 

70 80 9 ; 



10 20 30 40 50 60 

MGKGQD FTPLRDALVG KAKGVFL I GVDAPQ I RRDLDGCGLNMTDCATLGEAVQTAYAQAE 

IIIMIIIIIIIIi llll!llllllllllll!l!!lllhllh!l lllllllllll 

MGKGQDFTPLRDALKDKAKGVFLIGVDAPQIRRDLDGCGLNLTDCVTLEEAVQTAYAQA£ 
10 20 30 40 50 60 

70 80 90 

AGDIVLLS PACAS FDMFKG YAHRSEVFI EAFKALX 

IMIIIIItllMlllllllllimillllllM 

AGD IVLLS PACAS FDMFKG YAHRS E VF I EAFKALX 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 269>: 

a085 . seq 

1 ATGGGCAAAG GGCAGGACTT CACGCCCCTG CGCGACGCGC TTGCCGGCAA 

51 GGCAAAAGGC GTGTTCCTGA TCGGTGTCGA TGCGCCGCAA ATCCGCCGCG 

101 ATTTGGACGG CTGCGATCTG AATATGACCG ACTGCGCCAC TTTGGAAGAA 

151 GCGGTTCAGA AGGCATATGC CCAAGCCGAA GCGGGCGATA TCGTGCTGCT 

201 CAGCCCCGCC TGCGCGAGTT TCGATATGTT TAAAGGCTAC GCGCACCGTT 

2 51 CGGAAGTGTT TATCGGGGCG TTTAAGGCTT TGTGA 

This corresponds to the amino acid sequence <SEQ ID 270; ORF 085. a>: 

a085 .pep 

1 MGKGQDFTPL RDALAGKAKG VFLIGVDAPQ IRRDLDGCDL NMTDCATLEE 
51 AVQKAYAQAE AGDIVLLSPA CASFDMFKGY AHRSEVFIGA FKAL* 

m085/a085 94.7% identity over a 94 aa over, p 



10 20 30 40 50 60 

m08 5 . pep MGKGQDFT PLRDALVGKAKGVFL I GVDAPQ I RRDLDGCG LNMT DCAT LGEAVQT AY AQAE 
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II I I M i I i I I M I : I I I i I 1 ' ' I t I I I I i I I I I I I I I I I i I I I H I 111! (Mill 
a085 MGKGQDFTPLRDALAGKAKGVFLIGVDAPQIRRDLDGCDLNMTDCATLEEAVQKAYAQAE 

10 20 30 AO 50 60 

70 80 90 

mO S 5 . pep AG D I VLLS PACAS FDM FKGYAHRS EV FI EAFKALX 

I I ! ! J I i ! i I i I I I I i f I I i I If ) M I I I J . I I I 
a08 5 AGDI VLLS PACAS FDMFKGYAHR3EVFIGAFKALX 

70 80 90 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 271 >: 

g086 . seq 

1 ATGGTGGTGC TGATGACGGC GTTCGGCCTG CTGATGATTT ATTCGGCTTC 

51 TGTGTATTTG GCATCGAAGG AAGGCGGCGA TCAGTTTTTC TATTTGACCA 

101 GGCAGGCGGG GTTCGTCGTT GCCGGCCTTA TAGCGAGCGG TTTTTTATGG 

151 TTTCTTTGCA GGATGAGGAC ATGGCGGCGG CTTGTGCCGT GGATTTTTGC 

201 CTTATCCGGC CTGTTGCTGG TAGCCGTATT GATTGCCGGG CGCGAAATCA 

2 SI ATGGCGCGAC CCGTTGGATA CCTTTGGGTC CGTTGAATTT CCAGCCGACC 

3 01 GAGCTGTTCA AGCTGGCAGT CATCCTTTAT TTGGCAAGCC TGTTCACGCG 

3 51 CCGTGAAGAA GTGTTGCGCA GCATGGAAAG TTTGGGTTGG CAGTCGATTT 

401 GGCGGGGGAC GGCCAACCTG ATTATGTCCG CCACCAATCC GCAGGCACGT 

451 CGTGAAACAT TAGAAATGTA CGgcCGTTTC CGGGCGATCA TCCTGCCGAT 

501 TATGCTGGTG GCGTTCGGTT TGGTGCTGAT AATGGTACAG CCGGATTTCG 

551 GTTCGTTTGT CGTCATTACC GTCATTACCG TTGGAATGCT GTTTCTGGCA 

601 GGATTGCCGT GGAAATATTT TTTTGTCCTG GTAGGCAGCG TCTTGGGTGG 

6 51 GATGGTGCTG ATGATTACCG CCGCTCCCTA CCGTGTGCAG CGGGTAGTGG 

701 CATTTTTGGA CCCGTGGAAA GACCCGCAGG GTGCCGGCTA CCAGCTTACC 

751 CACTCTCTGA TGGCAATCGG GCGCGGAGAG TGGTTCGOTA TGGGTTTGGG 

801 TGCGAGTTTG AGCAAACGCG GCTTTCTGCC GGAAGCGCAT ACCGATTTTA 

851 TTTTTGCGAT CATCGCTGAA GAATTCGGCT TCTTCGGGAT GTGCGTGCTG 

901 ATATTCTGTT ACGGCTGGCT GGTGGTGCGG GCGTTTTCCA TCGGCAAGCA 

951 GTCGCGCGAT TTGGGtttgA CTTTCAACGC CTATATCGCT TCGGGTATCG 

1001 GCATTTGGAT CGGTATCCAA AGTTTCTTCA ATATCGGTGT GAACATCGGT 

1051 GCTTTGCCGA CCAAAGGTCT GACGctgCcg tTGATGTCCT ATGGcggTTC 

1101 GTCAGTCTTT TTCATGCTGA TCAGCATGAT GCTGCTGTTG CGTATCGATT 

1151 ATGAAAACCG CCAGAAAATG CGCGGTTACC GGGTGGAGTA AA 

This corresponds to the amino acid sequence <SEQ ID 272; ORF 086.ng>: 

g086 . pep 

1 MWLMTAFGL LMI YSASV YL ASKEGGDQFF YLTRQA GFW AGLIASGFLW 

51 FLCRMRTWRR L VPWIFALSG LLLVAVLI AG REINGATRWI PLGPLNFQPT 

101 ELFKLAVILY LASLFTRREE VLRSMESLGW QSIWRGTANL IMSATNPQAR 

151 RETLEMYGRF RA IILPIMLV AFGLVLIMV Q PDFGS FWIT VITVGMLFLA 

2 01 GLPWKY FFVL VGSVLGGMVL MIT AAPYRVQ RWAFLDPWK DPQGAGYQLT 
251 HSLMAIGRGE WFGMGLGASL SKRGFLPEAH TDFIFAI IAE EFGFFGMCVL 
301 IFCYGWLWR AFSIGKQSRB LGLTFNAYIA SGIGIWIGIQ SF FNIGVNIG 

3 51 ALPTKGLTL P LMSYG GSSVF FMLISMMLLL RI DYENRQKM RGYRVE* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 273>: 

m086 . seq 

1 ATGGTGGTGC TGATGACGGC GTTCAGCCTG CTGATGATTT ATTCGGCTTC 

51 TGTGTATTTG GCATCAAAAG AAGGCGGCGA TCAGTTTTTC TATTTGACCA 

101 GACAGGCGGG GTTCGTCGTT GCCGGCTTGA TAGCGAGCGG TTTGTTATGG 

151 TTTCTTTGCA GGATGAGGAC ATGGCGGCGG CTTGTGCCGT GGATTTTTGC 

201 CCTATCCGGC CTGTTGCTGG TAGTCGTATT GATTGCCGGG CGCGAAATCA 

2 51 ATGGCGCGAC CCGTTGGATA CCTTTGGGTC CGTTGAATTT CCAGCCGACC 
301 GAGCTGTTCA AGCtGGCGGT CATCCTTTAT TTGGCAAGCC TGTTCACGCG 

3 51 CCGTGAAGAA GTGTTGcGCA GCATGGAAAG TTTGGGTTGG CAGTCGATTT 
401 GGCGGGGGAC GGCCAATCTG ATCATGTCCG CCACCAATCC GCAGrCACGT 

4 51 CGTGAaACAT TAGAAATGTA CGGCCGTwTC CGGGCGATCA TCCTGCCGAT 

5 01 TATGCTGGTG GCGTTCGGTT TGGTGCTGAT AATGGTACAG CCGGATTTCG 
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551 GTTCGTTTGT CGTCATTACC GTCATTGCCG TTGGAATGCT GTTTTTGGCA 

601 GGATTGCCGT GGAAATATTT TTTCGTCCTG GTAGGCAGCG TCTTGGGCGG 

651 GATGGTGCTG ATGATTACCG CCGCTCCCTA GCGTG7GCAG CGGGTAGTGG 

701 CATTTTTGGA CCCGTGGAAA GACCCGCAGG GTGCCGGCTA CCAGCTTACC 

751 CACTCTCTGA TGGCAATCGG GCGCGGAGAG TGGTTCGGTA TGGGTTTGGG 

801 TGCGAGTTTG AGCAAACGCG GCTTTCTGCC GGAAGCGCAT ACCGATTTTA 

B51 TTTTTGCCAT CATCGCCGAA GAATTCGGTT TCTTCGGTAT GTGCGTGCTG 

901 ATATTCTGTT ACGGCTGGCT GGTGGTGCGG GCGTTTTCCA TCGGCAAGCA 

951 GTCGCGCGAT TTGGGTTTGA CTTTCAACGC CTATATCGCT TCGGGTATCG 

1001 GCATTTGGAT CGGkrTCCAA AGTTTCTTCA ATATCGGTGT GAACATCGGT 

1051 GCTTTGCCGA mCAAAgGyCT GACGCyGCCG Tc . AtGTCCw ATGGCGGTTC 

1101 GTCAGTCTTT TTCATGCTGA TCAGCATGAT GCT jCTGTkG CGTATAGATT 

1151 ATGAAAACCG CCGGAAAATG CGCGGTTATC GGGTGGAGTA A 

This corresponds to the amino acid sequence <SEQ ID 274; ORF 086>: 

m086 . pep 

1 MWLMTAFSL LMIYSASV /L ASKE3GDQFF YLTRQA GFW AGLIASGLLW 

51 FLCRMRTWRR L VPWIFALSG LLLVWLI AG REINGATRWI PLGPLNFQPT 

101 ELFKLAVILY LASLFTRREE VLRSMESLGW QSIWRGTANL IMSATNPQXR 

151 RETLEMYGRX RA IILPIMLV AFGLVLIMV Q PDFG5 FWIT VIAVGMLFLA 

201 GLPWK YFFVL VGSVLGGKVL MIT AAPYRVQ RWAFLDPWK DPQGAGYQLT 

251 HSLMAIGRGE WFGMGLGASL SKRGFLPEAH TDFI FAIIAE EFGFFGMCVL 

301 I FCYGWLWR AFSIGKQSRD LGLTFNAYIA SGIGIWIGXQ SF FNIGVNIG 

3 51 ALPXKGLTX P XMSXG GS5VF FMLISMMLLX RI DYENRRKM RGYRVE* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 086 shows 96.7% identity over a 396 aa overlap with a predicted ORF (ORF 086.ng) 
from N. gonorrhoeae: 

m086/g0B6 



10 20 30 40 50 60 

m0 86 . pep MVVLMTAFSLLMIYSASVYLASKEGGDQFFYLTRQAGFVVAGLIASGLLWFLCRMRTWRR 

IIIIIMhMliillllillllllMllllilllilNilMIIMIIIIIIIIIII 

g086 MWLMTAFGLIiMIYSASVYLASKEGGDQFFYLTRQAGFWAGLIASGFLWFLCRMRTWRR 
10 20 30 40 50 60 



70 80 90 100 110 120 

m086 .pep LVPWIFALSGLLLVWLIAGREINGATRWI PLGPLNFQPTELFKLAVILYLASLFTRREE 

llMIIIII!ll!hlllMMIIi!ll!l!l!l!lll!lllllllllllll!>lll(ll 

g086 LVPWIFALSGL.LLVAVLIAGREINGATRWIPLGPLNFQPTELFKLAVILYIASLFTRREE 
70 80 90 100 110 120 



130 140 150 160 170 180 

m086 .pep VLRSMESLGWQSIWRGTANLIMSATNPQXRRETLEMYGRXRAIILPIMLVAFGLVLIMVQ 

mimmimimmiiiiiii iiiiiiiiii iimimiiiiimm 

g086 VLRSMESLGWQSIWRGTANLIMSATNPOARRETLEMYGRFRAIILPIMLVAFGLVLIMVQ 
130 140 150 160 170 180 



190 200 210 220 230 240 

m086 .pep PDFGSFWITVIAVGMLFLAGLPWKYFFVLVGSVLGGMVLMITAAPYRVQRWAFLDPWK 

IIIIIIMlilhllllllllilllMMillllMlllMMIillMIIIMMMI! 

g08 6 PDFGSFVVITVITVGMLFLAGLPWKYFFVLVGSVLGGMVLMITAAPYRVQRVVAFLDPWK 
190 200 210 220 230 240 



250 260 270 280 290 300 

m0 86 .pep DPOGAGYOLTHSLMAIGRGEWFGMGLGASLSKRGFLPEAHTDFIFAIIAEEFGFFGMCVL 

IIIMIMIMIIIIIIIIIIIIIIIMIIIIIIIIIMIIIIIMIIIIIIMIIIIII 

g0 8 6 DPQGAGYQLTHSLMAIGRGEWFGMGLGASLSKRGFLPEAHTDFIFAIIAEEFGFFGMCVL 
250 260 270 280 290 300 

310 320 330 340 350 360 
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m086 .DeD IFCYGWLWRAFSIGKQSRDLGLTFNAYIASGIGIWIGXQSFFNIGWIGAL.PXKGL^XP 
iliilllllilMIIIIIIIIIIIIKIIillHilli IMIKilllillhllli i 
g0 8 6 IFCYGWLWRAFSIGKQSRDLGLTFNAYIASGIGIWIGIQSF FNIGVNIGALPTKGLTLP 

310 320 330 340 350 360 



370 380 390 

rr,0 8 6 . pep XMSXGGSSVFFMLISMMLLXRIDYENRRKMRGYRVEX 

i I 111111111111111 IIIMIhlilllllll 
g 0 8 6 LMS YGGSSVFFMLI SMMLLLRI DYENRQKMRG YRVEX 

370 380 390 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 275>: 

a08 6 . seq 

1 ATGGTGGTGC TGATGACGGC GTTCAGCCTG CTGATGATTT ATTCGGCTTC 

51 TGTGTATTTG GCATCAAAAG AAGGCGGCGA TCAGTTTTTC TATTTGACCA 

101 GACAGGCGGG GTTCGTCGTT GCCGGCTTGA TAGCGAGCGG TTTGTTATGG 

151 TTTCTTTGCA GGATGAGGAC ATGGCGGCGG CTTGTGCCGT GGATTTTTGC 

201 CCTATCCGGC CTGTTGCTGG TAGTCGTATT GATTGCCGGG CGCGAAATCA 

251 ATGGCGCGAC CCGTTGGATA CCTTTGGGTC CGTTGAATTT CCAGCCGACC 

301 GAGCTGTTCA AGCTGGCGGT CATCCTTTAT TTGGCAAGCC TGTTCACGCG 

351 CCGTGAAGAA GTGTTGCGCA GCATGGAAAG TTTGGGTTGG CAGTCGATTT 

4 01 GGCGGGGGAC GGCCAATCTG ATCATGTCCG CCACCAATCC GCAGGCACGT 

4 51 CGTGAAACAT TAGAAATGTA CGGCCGTTTC CGGGCGATCA TCCTGCCGAT 

501 TATGCTGGTG GCGTTCGGTT TGGTGCTGAT AATGGTACAG CCGGATTTCG 

551 GTTCC^TTGT CGTCATTACC GTCATTGCCG TTGGAATGCT GTTTTTGGCA 

601 GGATTGCCGT GGAAATATTT TTTCGTCCTG GTAGGCAGCG TCTTGGGCGG 

651 GATGGTGCTG ATGATTACCG CCGCTCCCTA CCGTGTGCAG CGGGTAGTGG 

701 CATTTTTGGA CCCGTGGAAA GACCCGCAGG GTGCCGGCTA C2AGCTTACC 

751 CACTCTCTGA TGGCAATCGG GCGCGGAGAG TGGTTCGGTA TGGGTTTGGG 

801 TGCGAGTTTG AGCAAACGCG GCTTTCTGCC GGAAGCGCAT ACCGATTTTA 

851 TTTTTGCCAT CATCGCCGAA GAATTCGGTT TCTTCGGTAT GTGCGTGCTG 

901 ATATTCTGTT ACGGCTGGCT GGTGGTGCGG GCGTTTTCCA TCGGCAAGCA 

951 GTCGCGCGAT TTGGGTTTGA CTTTCAACGC CTATATCGCT TCGGGTATCG 

1001 GCATTTGGAT CGGTATCCAA AGTTTCTTCA ATATCGGTGT GAACATCGGT 

1051 GCTTTGCCGA CCAAAGGTCT GACGCTGCCG TTGATGTCCT ATGGCGGTTC 

1101 GTCAGTCTTT T7CATGCTGA TCAGCATGAT GCTGCTGTTG CGTATAGATT 

1151 ATGAAAACCG CCGGAAAATG CGCGGTTACC GGGTGGAGTA A 

This corresponds to the amino acid sequence <SEQ ID 276; ORF 086. a>: 

a086 .pep 

1 M WLMTAFSL LMIYSASV YL ASKEGGDQFF YLTRQA GFVV AGLI ASGLLW 

51 FLCRMRTWRR L VPWIFAL5G LLLVWLI AG REINGATRWI PLGPLNFQPT 

101 ELFKLAVILY LASLFTRREE VLRSMESLGW QSIWRGTANL IMSATNPQAR 

151 RETLEMYGRF RA IILPIMLV AFGLVLIMV Q PDFG5 FWIT VIAVGMLFLA 

201 GLPWKY FFVL VGSVLGGMVL MIT AAPYRVQ RVVAFLDPWK DPQGAGYQLT 

251 HSLMAIGRGE WFGMGLGASL SKRGFLPEAH TDFIFAIIAE EF GFFGMCVL 

301 IFCYGWLVV R AFSIGKQSRD LGLTFNAYIA SGIGIWIGIQ SFFNIGVNIG 

351 ALPTKGLTLP LMSYGGSSVF FMLISMMLLL RIDYENRRKM RGYRVE * 



m086/a086 98.0% identity over a 396 aa overlap 



10 20 30 40 50 60 

m08 6 . pep MVVLMTAFSLLMIYSASVYLASKEGGDQFFYLTRQAGFVVAGLIASGLLWFLCRMRTWRR 
I I I i I I I ! I I II I I i I I f I I I I I f I I I I II J I I I I I I I I i I i i I I | I | I | I | | | | | | | [ | 
a08 6 MVVLMTAFSLLM I YS AS VYLASKEGGDQFFYLTRQAGF^/VAGL IAS GLLW FLCRMRTWRR 

10 20 30 40 50 60 



70 80 90 100 110 120 

m08 6.pep LVPWIFALSGLLLVVVLIAGREINGATRWIPLGPLNFQPTELrKLAVILYLASLFTRREE 
I i I I I I I I I I I I I I M M I I I I I II I I I I I I I j M I I 1 ! i I I I I I I i M I I I ! I I I I I ! I 
a03 6 LVPWI FALSGLLLVVVLIAGRE INGATRWI PLGPLNFQPTELFKLAVT LYLASLFTRREE 



WO 99/57280 



PCT/US99/09346 



277 



70 80 90 100 110 120 

130 140 150 160 170 180 

m086.pep V LR SME S L G WQS I W RGTAN LI MS ATNPQX RRETLEMYGRX RA 1 1 L P I M LV A FG L V L I M V Q 

I ( I I I I I If i ( I I I H i 1 I I I I M M I I ( I ( I I 1 1 1 I I i I I I M I ! I I I I II ( I I i I I 
a 06 6 VLRSMESLGWQSIWRGTANLIMSATNPQARRETLEMYGRFRAIILPIMLVAFGLVLIMVQ 

130 140 150 160 170 180 

190 200 210 220 230 240 

m08 6. pep PDFGSFVVITVIAVG^3L■FLAGLPWKYFFVLVGSVLGGMV T LMITAAPYRVQRVVAFLDPWK 
I I I I I I I f I I I I I I I I I I I I f ! I I ! I I M I I I I I I I I I M I I II I I I I I I 1 II I I I I 1 I I 
a086 PDFGS FVV ITVIAVGML FLAG LPWKYFFVLVG SVLGC M' 'LMITAAPY RVQRWA FLDPWK 

190 :00 210 220 230 240 

250 260 270 280 290 300 

m08 6.pep QPQGAGYQLTHSLMAIGRGEWFGMGLGASLSKRGFLPEAKTDFI FAI IAEEFGFFGMCVL 

I I i I I It II I I 1 ! I I I I I I I I I I II i I I I I I I f i I I I I I I I M I I I I I i M I ! i I I I I 1 I 
a08 6 DPQGAGYQLTHSLMAIGRGEWFGMGLGASLSKRGFLPEAHTDFI FAI IAEEFGFFGMCVL 

250 260 270 280 290 300 

310 320 330 340 350 360 

m08 6.pep IFCYGWLVVRAFSIGKQ^RDLGLTFNAYIASGIGIWIGXQSFFNIGVNIGALPXKGLTXP 
! I SI I M i I I I N I M I ( M I I i H M 1 I I I I if I I M I I I M 1 M I I i I ! I : I i I I i 
a08 6 IFCYGWLVVRAFSIGKQSRDLGLTFNAYIASGIGIWIGIQSFFNIGVMIGALPTKGLTLP 

310 320 330 340 350 360 

370 380 390 

m08 6.pep XMSXGGSSVFFMLISMMLLXRIDYENRRKMRGYRVEX 
II I I I i I I I I I I I I I I I i M I II I I I I I I I II I I 
a08 6 LMSYGGSSVFFMLISMMLLLRIDYENRRKMRGYRVEX 

370 380 390 

The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 277>: 

g087 . seq 

1 ATGGGCGGTA AAACCTTTAT 

51 TTTCCCAGCT CTGGCTGTGG 

101 TAATTTGGCT GGGCAGCAAG 

151 TACGGCATAC GCTTGGAAAC 

2 01 CATCAAACGC AAGCTGATGC 

2 51 AAGCGCAGCG GATTATCCGC 

301 GGCGGTTTTG TTACCTTTCC 

351 GCCGATTGTG ATTCACGAGC 

401 AccTGTCGCg ctGGGCGAAA 

4 51 AGCCACGAAG GCGGTTTGGT 
501 CCTGCCCGTG CCTGCCGAAC 

5 51 TTTTGGTGGT CGGCGGCAGT 

6 01 CCGCAGGCGT TGGCACTGCT 
6 51 CCAGTCGGGG CGTAACAAGC 
701 TGGGCGTGAA AGCGGAATGC 
751 TACCGTGATG CCGATTTGGT 
801 CGAGTTGACG GCGGCGGGGC 
851 CCGTTGATGA CCATCAAACC 
901 GCGGGGCTGC TGTTGCCGCA 
951 AATCCTCGGC AGCCTCAACC 

1001 CCCGTACGTT GGCATTGCCG 
1051 ATTGCGTGTG CGGCGTAAA 

This corresponds to the amino acid sequence <SEQ ID 278; ORP 087.ng>: 

g087 .pep 

1 MGGKTFMLMA GGTGGHIFPA LAV, -jSLRVR GHHVIWLGSK DSMEERIVPQ 

51 YGIRLETLAI KGIRGMGI KR KLMLPFTLYK TVREAQRIIR KKRVE CVIGF 

101 GG FVTFPGGL AA KLLGVPIV IHEQNAVAGL SNRKLSRWAK RVLYAFPKAF 

151 SHEGGLVGNP VRADISNLPV PAERFQGREG RLKILWGGS LGADVLNKTV 



GCTGATGGCG GGCGGAACGG GCGGACACAT 
CGGATTCATT GCGCGTGCGC GGTCATCATG 
GATTCGATGG AAGAGCGCAT CGTGCCGCAA 
GCTGGCGATT AAAGGAATAC GCGGCAACGG 
TTCCGTTTAC TCTGTACAAA ACCGTCCGCG 
AAACACCGTG TCGAGTGCGT CATCGGCTTC 
CGGCGGTCTG GCGGCGAAAC TCTTGGGCGT 
AAAACGCCGT GGCAGGCTTG TCCAACCGCC 
CGGGTGTTGT ACGCTTTTCC GAAAGCGTTC 
CGGCAACCCC GTCCGCGCCG ATATTAGCAA 
GCTTCCAAGG GCGCGAAGGC CGTCTGAAAA 
TTGGGTGCGG ACGTTTTGAA CAAAACCGTA 
GCCTGAAGAG GTGCGCCCGC AGATGTACCA 
TGGGCAATCT TCAGGCGGAT TATGACGCGT 
GTGGAATTTA TTACCGACAT GGTGTCCGCC 
GATTTGCCGT GCCGGCGCGC TGACGATTGC 
TGGGCGCGTT GTTAGTGCCG TATCCTCACG 
GCCAACGCGC GTTTCATGGT GCAGGCAGAA 
AACCCAGTTG ACGGCGGAAA AACTCGCCGA 
GCGAAAAATG CCTCAAATGG GCGGAAAACG 
CACAGCGCGG ATGACGTTGC CGAAGCCGCG 
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1 


ATGGGCGGTA 


AAACCTTTAT 


GCTGAwkkCG 


51 


TTTCCCCGCG 


CTGGCGGTGG 


CGGATTCATT 


101 


TGATTTGGCT 


GGGCAGCAAG 


GATTCGATGG 


151 


TACGGCATAC 


GCTTGGAAAC 


GCTGGCGATT 


201 


CATCAAACGC 


AAACTGATGC 


TGCCGGTTAC 


251 


AAGCGCAGCG 


GATTATCCGC 


AAACACCGTG 


301 


GGCGGCTTCG 


TTACCTTCCC 


CGGCGGTTTG 


351 


GCCGATTGTG 


ATTCACGAGC 


AAAACGCCGT 


401 


ACCTGTCGCG 


CTGGGCGAAG 


CGGGTGTTGT 


451 


AGCCACGAAG 


GCGGCTTGGT 


CGGCAACCCC 


501 


CCTGCCCGTG 


CCTGCCGAAC 


GCTTCCAAGG 


551 


TTTTGGTGGT 


CGGCGGCAGT 


TTGGGCGCGG 


601 


CCGCATGCAT 


TGGCTTTGCT 


GCCCGACAAT 


651 


CCAATCGGGA 


CGGGGCAAGC 


TGGGCATCTT 


701 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


751 


nnnGCGGGAT 


TGGGTGCGTT 


GTTAGTGCCG 


801 


TCACCAAACC 


GCCAACGCGC 


GTTTTATGGT 


851 


TGTTGCCGCA 


AACCCAGTTG 


ACGGCGGAAA 


901 


GGCTTAAACC 


GCGAAAAATG 


CCTCAAATGG 


951 


GGCACTGCCG 


CACAGTGCGG 


ACGACGTGGC 


1001 


CGGCGTAA 







2 01 PQALALLPEE VRPQMY^ n SG RMKLGNLQAD YDALGVKAEC VEFITDMVSA 
251 YRDADLVI CR AG ALTIAELT AAGLGALLV P YPHAVDDHQT ANARFMVQAE 
301 AGLLLPQTQL TAEKLAEILG SLNREKCLKW AENARTLALP HSADDVAEAA 

3 51 IACAA* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 279>: 

m087 . seq 

GGCGGAACGG GCGGACATAT 
GCGCGCGCGC GGCCATCATG 
AAGAGCGTAT CGTGCCGCAA 
AAAGGCGTGC GCGGCAACGG 
TTTGTATCAA ACCGTCCGCG 
TCGAGTGCGT CATCGCCTTC 
GCGGCGAAGC TATTArGCGT 
GGCAGGTTTG TCCAACCGCC 
ACGCTTTTCC GAAAGCGTTC 
GTCCGCGCCG ATATTAGCAA 
GCGTGAAGGC CGTCTGAAAA 
ACGTTTTGAA CAAAACCGTA 
GCGCGTCCGC ATATGTACCA 
GCAGGCGnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn 
TATCCTCACG CGGTTGACGA 
GCAGGCGGAG GCGGGATTGC 
AACTCGCCGA GATTCTCGGC 
GCAGAAAACG CCCGTACGTT 
GGAAGCCGCG ATTGCGTGTG 

This corresponds to the amino acid sequence <SEQ ID 280; ORF 087>; 

m087 . pep 

1 MGGKTFMLXX GGTGGHIFPA LAVADSLRAR GHHVIWLGSK DSMEERIVPQ 

51 YGIRLETLAI KGVRGNGIKR KLMLPVTLYQ TVREAQRIIR KHRVE CVIGF 

101 GGFVTFPGGL AA KLLXVPIV IHEQNAVAGL SNRHLSRWAK RVLYAFPKAF 

151 SHEGGLVGNP VRADISNLPV PAERFQGREG RLKILWGGS LGADVLNKTV 

201 PHALALLPDN ARPHMYHQSG RGKLGILQAX XXXXXXXXXX XXXXXXXXXX 

2 51 XAGLGALLV P YPHAVDDHQT ANARFMVQAE AGLLLPQTQL TAEKLAEILG 

301 GLNREKCLKW AENARTLALP HSADDVAEAA IACAA* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 087 shows 83.9% identity over a 355 aa overlap with a predicted ORF (ORF 087.ng) 
from N. gonorrhoeae: 

m087/g087 

10 20 30 40 50 60 

m087 .pep MGGKTFMLXXGGTGGHIFPALAVADSLRARGHH^/IWLGSKDSMEERIVPQYGIRLETLAI 

1 1 1 1 i II I 1 1 1 1 1 1 1 1 ! I M 1 1 1 1 1 1 • 1 1 II 1 1 1 1 1 1 1 1 II I M 1 1 1 1 1 1 1 1 i 1 1 1 1 1 

g087 MGGKTFMLMAGGTGGHIFPALAVADSLRVRGHHVIWLGSKDSMEERIVPQYGIRLETLAI 
10 20 30 40 50 60 

70 80 90 100 110 120 

m0B7 .pep KGVRGNGIKRKL.MLPVTLYQTVREAQRI IRKHRVECVIGFGGFVTFPGGLAAKLLXVPIV 

Ihtllimiilll llhllllllilllllllllMIIIMIIMIIIIIIII III) 

g 0 8 7 KG I RGNG I KRKLMLPFTLYKTVREAQRI I RKHRVECVI GFGGFVTF PGGLAAKLLGVP I V 

70 80 90 100 110 120 

130 140 150 160 170 180 

m087 . peo IHEQNAVAGLSNRHLSRWAKRVLYAFPKAFSHEGGLVGHPVRADISKLPVPAERFQGREG 

IIMlllillinilllllllllllllllllllllllllilMMlllllllllllllll 

g0 8 7 IKEQNAVAGLSNRHLSRWAKRVLYAFPKAFSHEGGLVGNPVRADISNLPVPAERFQGREG 
130 140 150 160 170 180 
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190 200 210 220 229 
m0 87 . pep RLKILWGGSLGADVLNKTVPHALALLPDNARPKMYKOSGRGKLGILQA 

illlllliliillMIIIIII:|lllih-ll:fll!ilhill II! 

g03 7 RLKILWGGSLGADVLNKTVPQALALLPEEVRPQMYKOSGRNKIiGNLQADYDALGVKAEC 
190 200 210 220 230 240 

230 240 250 

m08 7 . pep AGLGALLVPYPHAVDDHQTANARFMVQAE 

llllillMIIIIMIIIilllllMill 
g08 7 VEFITD1WSAYRDADLVICRAGALTIAELTAAGLGALLVPYPKAVDDHQTANARFMVQAE 
250 260 270 280 290 300 



260 270 280 290 300 310 

m0 8 7 . peD AGLLLPQTQLTAEKLAEILGGLNREKCLKMAENARTIJUjPHSADDVAEAAIACAAX 

lllll)l!llllll!llllhlllllllM!li!tlllllll(llllllllll!ll 
gOS7 aglllpqtqltaekl;.zilgsl:irekclkwaenartlalphsaddvaeaaiacaax 

310 320 330 340 350 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 28 1>: 

a087 . seq 

1 ATGGGCGGTA AAACCTTTAT GCTGATGGCG GGCGGAACGG GCGGACATAT 

51 TTTCCCCGCG CTGGCGGTGG CGGATTCATT GCGCGCGCGC GGCCATCATG 

101 TAATTTGGCT GGGCAGCAAG GATTCGATGG AAGAGCGCAT CGTGCCGCAA 

151 TACGACATCC TGCTCGAAAC GCTGGCGATT AAAGGCGTGC GCGGCAACGG 

201 CATCAAACGC AAGCTGATGC TGCCGTTTAC TTTGTATCAA ACTGTCCGCG 

2 51 AAGCGCAGCA GATTATCCGC AAACACCGTG TCGAGTGCGT CATCGGCTTC 

301 GGCGGCTTCG TTACCTTTCC CGGCGGTTTG GCGGCGAAGT TATTAGGCGT 

351 GCCGATTGTG ATTCACGAGC AAAACGCCGT GGCAGGTTTG TCCAACCGCC 

4 01 ACCTGTCGCG CTGGGCGAAG CGGGTGTTGT ACGCTTTTCC GAAAGCGTTC 

4oi AGCCACGAAG GCGGCTTGGT CGGCAACCCC GTCCGCGCCG ATATTAGCAA 

501 CCTGCCCGTG CCTGCCGAAC GCTTCCAAGG GCGTGAAGGC CGTCTGAAAA 

551 TTTTGGTGGT CGGCGGCAGT TTGGGCGCGG ACGTTTTGAA CAAAACCGTA 

601 CCGCAGGCAT TGGCTTTGCT GCCCGACAAT GCGCGTCCGC AGATGTACCA 

651 CCAATCGGGA CGGGGCAAGC TGGGCAGCTT GCAGGCGGAT TACGACGCGC 

7 01 TGGGCGTGCA AGCGGAATGC GTGGAATTTA TTACCGATAT GGTGTCCGCC 

7 51 TACCGCGATG CCGATTTGGT GATTTGCCGT GCCGGCGCGC TGACGATTGC 
SOI CGAGTTGACG GCGGCGGGAT TGGGTGCGTT GTTAGTGCCG TATCCTCACG 

8 51 CCGTTGATGA CCATCAAACC GCCAACGCGC GTTTTATGGT GCAGGCGGAG 
901 GCGGGATTGC TGTTGCCGCA AACCCAGTTG ACGGCGGAAA AACTCGCCGA 
951 GATTCTCGGC GGCTTAAACC GCGAAAAATG CCTCAAATGG GCAGAAAACG 

1001 CCCGTACGTT GGCACTGCCG CACAGTGCGG ACGACGTTGC CGAAGCCGCG 

1051 ATTGCGTGTG CGGCGTAA 

This corresponds to the amino acid sequence <SEQ ID 282; ORF 087. a>: 

a087 .pep 

1 MGGKTFMLMA GGTGGHIFPA LAVADSLRAR GHHVIWLGSK DSMEERIVPQ 

51 YD1LLETLAI KGVRGNGIKR KLMLPFTLYQ TVREAQQIIR KHRVECVIGF 

101 GGFVTFPGGL AA KLLGVPIV IHEQNAVAGL SNRHLSRWAK RVLYAFPKAF 

151 SHEGGLVGNP VRADISNLPV PAERFQGREG RLKILVVGGS LGADVLNKTV 

201 PQALALLPDN ARPQMYHQSG RGKLGSLQAD YDALGVQAEC VEFITDMVSA 

2 51 YRDADLVICR AG ALTIAELT AAGLGALLV P YPHAVDDHQT ANARFMVQAE 

301 AGLLLPQTQL TAEKLAEILG GLNREKCLKW AENARTLALP HSADDVAEAA 

351 IACAA+ 



m087/a087 85.4% identity over a 355 aa overlap 

10 20 30 40 50 60 

m037 . pep MGGKTFMLXXGGTGGHIFPALAVADSLRARGHHVIWLGSKDSMEERIVPQYGIRLETLAI 
I I i I I I i I i i i I I I U I I I I t I I I I i i I I I I M ( ! I i I I I I ! I I t I 11 I I MUM 
a087 MGGKTFMLMAGGTGGHIFPALAVADSLRARGHHVIWLGSKDSMEERIVPQYDILLETLAI 

10 20 30 40 50 60 
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70 80 90 100 110 120 

m08 7 . pep PCGVRGNGIKRKLMLPVTLYQTVREAQRIIRKHRVECVIGFGGrvTFPGGLAAKLLXVPIV 
I I I I I I I i I I ! ! I II I I I I I I I i I I : I f i I I I I I i I I 1 I I I I I I ! I I I I I I I I I I I I i 
a 0 8 7 KGVRGU G I KRK LMLP FTLYQT VREAQQ IIRKH RVE CV IGFGG FVT FPGG LAAKLLG VP I V 

70 80 90 100 110 120 

130 140 150 160 1~C 180 

m03 7 . pep IHEQNAVAGLSNRKLSRWAKRVLYAFPKAr SHEGGLVGNPVRADI3NLPVPAERFQGREG 

I I I I I I I I I I I I M I I I I I I I M M I I II I I I I I t I I I I I I I I I I i I I I I I i I I I I i I I 1 
a OS 7 IHEQNAVAGLSNRHLSRWAKRVLYAFPKAFSHEGGLVGNPVRADISNLPVPAERFQGREG 

130 140 150 150 170 180 

190 200 210 220 230 240 

m08 7 . pep RLKILVVGGSLGADVLNKTVPHALALLPDNARPHMYHQSGRGKLGILQA>;XXXXXXXXXX 

I I I I I I I I I I i f I I I ! I I I I I : I I I I II I M I 1 : I M I I I I i I I I 111 
aOSl RLKILVVGGSLGADVLNKTVPQALALLPDNARPQMYHQSGRGKLGSLQADYDALGVQAEC 
190 200 210 220 230 240 

250 260 270 280 

m087 . pep XX XXXXXXXXXAGLGALLVPYPHAVDDHQTANARFMVQAE 

: : I I I I I I M I I I I I I I M I ! M I I I M I M 
a 087 VEFITDMVSAYRDADLVICRAGALTIAELTAAGLGALLVPYPHAVDDHQTANARFMVQAE 
250 260 270 280 290 300 

290 300 310 320 330 

n\087 . pep AGLLLPQTQLTAEKLAEILGGLNREKCLKWAENARTLALPHSADDVAEAAIACAAX 
' 1 M I 1 I I I ! ( M I! I ( I ! II I I i I I it I I I ! I I I M I I 11 1 I I 1 ! ! i I I M II I I 
a087 AGLLLPQTQLTAEKLAEILGGLNREKCLKWAENARTLALPHSADDVAEAAIACAAX 

310 320 330 340 350 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 283>: 

g088 . seq 

1 ATGTTTTTAT GGCTCGCACA TTTCAGCAAC TGGTTAACCG GTCTGAATAT 

51 TTTTCAATAC ACCACATTCC GCGCCGTTAT GGCGGCGTTG ACCGCCTTGG 

101 CGTTTTCCCT GATGTTCGGC CCGTGGACGA TACGCAGGCT GACCGCGCTC 

151 AAATGCGGGC AGGCAGTGCG TACCGACGGC CCGCAAACCC ACCTCGTCAA 

2 01 ■ AAACGGCACG CCGACGATGG GCGGTTCGCT GATTCTGACC GCCATTACCG 

2 51 TGTCCACCCT GTTGTGGGGC AACTGGGCGA ACCCGTATAT CTGGATTCTC 
301 TTGGGCGTAC TGCTTGCCAC CGGTGCGCTC GGTTTTTACG ACGACTGGCG 

3 51 CAAAGTCGTT TATAAAGACC CCAACGGCGT GTCCGCCAAA TTCAAAATGG 
401 TGTGGCAGTC AAGCGTTGCC GTTatcgcCG GTttggcaTT GTTTTACctt: 

4 51 gCcgcCAATT CCGCCAACAA TATTTTGATT GTCCCGt ttT TCAAACAAAT 

5 01 CGCCCTGCCG CTGGGCGTGG TCGGCTTttt gGtgttgTCT TACCTGACCA 
5 51 TCGTCGGCAC ATCCAACGCC GTCAACCTCA CcgaCGGCTT GGACGGCCTT 
601 GCCGCcttcc cgttcgtcc-c cgttgccgcC GGGCTCGCCA ttttcgccTA 
651 CGTCAGCGGA CACTACCAAT TTTCCCAATA CCTCCAGCTT CCCTATGTCG 
701 CCGGCGCGAA CGAAGTCGCT ATATTCTGCA CCGCCATGTG CGGCGCGTGC 
751 CTCGGATTTT TGTGGTTCAA CGCCTATCCC GCGCAAGTCT TTATGGGCGA 
801 TGTCGGCGCG CTGGCATTGG GTGCCGCGCT CGGTaccGtt gCCGTcaTcg 
651 tCCGCCAAGA ATTTGTcctc gtcattaTGG GCGGTCTGTT cgtcgtagaa 
901 gccgtgTCCG TTATGCTTCa tgtcggCTGG TACAAGAAAA Ccaaaaaacg 
951 CATCTTcCTg acgGcaccga ttcatcacca ttaCCaactt cgatgCTGGa 

1001 aagaaacgca agtcgtcgtc CGTTtCTGGA TTAtTAccat cgtcgtggtt 

1051 tTgataggtt tGagtacccT caAAattcgc ggaaactatg ccgTCCGAAC 

1101 ACCTTTCAGA CGGCATTTGA ACGCGCAATA A 

This corresponds to the amino acid sequence <SEQ ID 284; ORF 088. ng>: 

g088 -pep 

1 MFLWLAHFSN WLTGLNIFQY TTFRAVMAAL TALAFSLMF G PWTIRRLTAL 

51 KCGQAVRTDG PQTHLVKNGT P TMGGSLILT AITVSTLL WG NWANP YIWIL 

101 LGVLLATGAL GF YDDWRKW YKDPNGVSAK FKMVWQSS VA VIAGLALFYL 

151 AANSANK1LI VPFFKQIALP LGWGFLVLS YLTIVGTSNA VNLTDGLDGL 
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201 AAFPFVLVAA GLAIFA YVSG HYQFSQYLQL PYVAGANE VA I FCTAMCGAC 

251 LGFLW FNAYP AQVFMGD VGA LALGAALGTV AVI V RQE FVL VIMGGLFWE 

3 01 AVSVMLHVGW YKKTKKRIFL TAPIHHHYQL RCWKETQ VW RFWIITIVW 

351 LIGL STLKIR GNYAVRTPFR RHLNAQ* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 285>: 

m088 . seq 



1 


ATGTTTTTAT 


GGCTCGCACA 


TTTCAGCAnC 


TGGTTAACCG 


GTCTGAATnn 


51 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


101 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


151 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


201 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnr.nr.r. 


nnnnnnnnnn 


251 


nnnnnnnnnn 


nnnnnnnnnr. 


nnnnnnnnnn 


nnnnnnnn/.n 


nnnnnnnnnn 


301 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


351 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


401 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


451 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


501 


nnnnnnnnnn 


nnnGGCGTGG 


TCGGCTTTTT 


GGTGTTGTCT 


TACCTGACCA 


551 


TCGTCGGCAC 


ATCCAATGCC 


GTCAACCTCA 


CCGACGGCTT 


GGACGGCCTT 


601 


GCGACCTTCC 


CCGTCGTCCT 


CGTTGCCGCC 


GGCCTCGCCA 


TCTTCGCCTA 


651 


TGCCAGCGGC 


CACTCACAAT 


TTGCCCAATA 


CCTGCAATTA 


CCTTACGTTG 


701 


CCGGCGCAAA 


CGAAGTGGTG 


ATTTTCTGTA 


CCGCCATGTG 


CGGCGCGTGC 


751 


CTCGGTTTCT 


TGTGGTTTAA 


CGCCTATCCC 


GCGCAAGTCT 


TTATGGGCGA 


801 


TGTCGGTGCA 


TTGGCATTGG 


GTGCCGCGCT 


CGGTACCGTC 


GCCGTTATCG 


851 


TCCGCCAAGA 


GTTTGTCCTC 


GTCATTATGG 


GCGGATTATT 


TGTCGTAGAA 


901 


GCCGTATCCG 


TTATGCTTCA 


GGTTGGCTGG 


TATAAGAAAA 


CCAAAAAACG 


951 


CATCTTCCTG 


ATGGCGCCCA 


TCCATCACCA 


CTACGAACAA 


AAAGGCTGGA 


1001 


AAGAAACCCA 


AGTCGTCGTC 


CGCTTTTGGA 


TTATTACCAT 


CGTCTTGGTG 


1051 


TTGATCGGTT 


TGAGTACCCT 


CAAAATCCGC 


TGAACCTATG 


CCGTCTGAAC 


1101 


ATCTTTCAGA 


CGGCATTTGA 


ACGCGCAATA 


A 





1 MFLWLAHFSN WLTGLNIFQY TTFRAVMAAL TALAFSLMF G PWTIRRLTAL 

51 KCGQAVRTDG PQTHLVKNGT P TMGGSLILT AITVSTLL WG NWANP YIWIL 

101 LGVLLATGAL GF YDDWRKW YKDPNGVSAK FKMVWQSS VA VIAGLALFYL 

151 AAKSANNILI VPFFKQIA LP LGWGFLVLS YLTIV GTSNA VNLTDGLDGL 

201 AAFPFVLVAA GLAIFA YVSG HYQFSQYLQL PYVAGANE VA I FCTAMCGAC 

251 LGFLW FNAYP AQVFMGD VGA LALGAALGTV AVI V RQE FVL VIMGGLFWE 

3 01 AVSV MLHVGW YKKTKKRIFL TAPIHHHYQL RCWKETQ VW RFWIITIVW 

3 51 LIGLSTLKIR GNYAVRTPFR RHLNAQ * 



This corresponds to the amino acid sequence <SEQ ID 286; ORP 088>: 


m0 88 .pep 
1 


MFLWLAHFSX 


WLTGLNXXXX 


XXXXXXXXXX 


XXXXXXXXXX 


XXXXXXXXXX 


51 


XXXXXXXXXX 


XXXXXXXXXX 


XXXXXXXXXX 


XXXXXXXXXX 


XXXXXXXXXX 


101 


xxxxxxxxxx 


XXXXXXXXXX 


XXXXXXXXXX 


XXXXXXXXXX 


XXXXXXXXXX 


151 


XXXXXXXXXX 


XXXXXXXXXX 


XGWGFLVLS 


YLTIVGTSNA 


VNLTDGLDGL 


201 


ATFPWLVAA 


GLAIFAYASG 


HSQFAQYLQL 


PYVAGANEW 


I FCTAMCGAC 


251 


LGFLWFNAYP 


AQVFMGDVGA 


LALGAALGTV 


AVI VRQEFVL 


VIMGGLFWE 


301 


AVSVMLQVGV? 


YKKTKKRIFL 


MAPIHHHYEQ 


KGWKETQVW 


RFWIITIVLV 


351 


LIGLSTLKIR 


XTYAVXTSFR 


RHLNAQ* 







Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 088 shows 91 .7% identity over a 205 aa overlap with a predicted ORF (ORF 088.ng) 
from N. gonorrhoeae: 

m088/g088 



10 20 30 

mO 8 8 . pep GWGFLVLSYLTI VGTSNAVNLTDGLDGLA 

IIIIIIIIIMIIIIIIIIIIMIIIMII 

go 8 8 I AGLALFYLAANSANNI LI VPFFKQIALPLGWGFLVLS YLTIVGTSNA VNLTDGLDGL A 

150 160 170 180 ISO 200 
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40 50 60 70 80 90 

m08 3 . pep TFPWLVAAGLAI FAYASGHSQFAQYLQLPYVAGANEWI FCTAMCGACLGFLWFNAYPA 

= 11 mitiiliiihli! ihiililillflillhllilMilllllNMillN 

g0 S 3 AFPFVLVAAGLAI FAYVSGHYQFSQYLQLPYVAGANEVAI FCTAMCGACLGFLWFNAYPA 

21T 220 230 240 250 260 



100 110 120 130 140 150 

mO 88 . Dep QVFMGDVGALALGAALGTVAVIVROEFVLVIMGGLFWEAVSVMLOVGWYKKTKKRI FLM 

I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I i 1 I I = I I I I I I I I I II I i 
g08 8 OVFMGDVGALAIX3AALGTVAVIVROEFVLVIMGGLFWEAVSVMIiHVGWYKKTKKRIFLT 
270 280 290 300 310 320 



160 170 180 190 200 

m08 8 .pep APIHHHYEQKGWKETQVWRFWIITIVLVLIGLSTLKIRXTYAVXTSFRRHLNAQX 

1111111= ; IIIIIIIIIUIIMMIIilllilll Mil I IMIIilll 

g 0 8 8 API HHHYOLRCWKETQVWRFW 1 1 T I WVL IGLSTLKI RGNYAVRTPFRRHLNAQX 

330 340 350 360 370 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 287>: 

a068 . seq 

1 ATGTTTTTAT GGCTCGCACA TTTCAGCAAC TGGTTAACCG GTCTGAATAT 

51 TTTTCAATAC ACCACATTCC GCGCCGTCAT GGCGGCGTTG ACCGCCTTGG 

101 CGTTTTCCCT GATGTTCGGC CCGTGGACGA TACGCAGGCT GACCGCGCTC 

151 AAATGCGGGC AGGCAGTGCG TACCGACGGT CCGCAAACCC ACCTCGTCAA 

201 AAACGGCACG CCGACGATGG GCGGTTCGCT GATTCTGACC GCCATTACCG 

2 51 TGTCCACCCT GTTGTGGGGC AACTGGGCAA ACCCGTATAT CTGGATTCTC 
301 TTGGGCGTAT TGCTCGCCAC GGGCGCACTC GGTTTTTACG ACGACTGGCG 

3 51 CAAAGTCGTC TATAAAGACC CCAACGGCGT GTCCGCCAAA TTCAAAATGG 

4 01 TGTGGCAGTC AAGCGTTGCC ATTATCGCCG GTTTGGCATT GTTTTACCTT 
4 51 GCCGCCAATT CCGCCAACAA TATTTTGATT GTCCCGTTCT TCAAACAAAT 
501 CGCCCTGCCG CTGGGCGTGG TCGGCTTTTT GGTGTTGTCT TACCTGACCA 
551 TCGTCGGCAC ATCCAATGCC GTCAACCTCA CCGACGGCTT GGACGGCCTT 
601 GCGACCTTCC CCGTCGTCCT CGTTGCCGCC GGCCTCGCCA TCTTCGCCTA 
651 TGCCAGCGGC CACTCACAAT TTGCCCAATA CCTGCAATTA CCTTACGTTG 
7 01 CCGGCGCAAA CGAAGTGGTG ATTTTCTGTA CCGCCATGTG CGGCGCGTGC 
7 51 CTCGGTTTCT TGTGGTTTAA CGCCTATCCC GCGCAAGTCT TTATGGGCGA 
801 TGTCGGTGCA TTGGCATTGG GTGCCGCGCT CGGTACCGTC GCCGTCATCG 
851 TCCGCCAAGA GTTTGTCCTC GTCATTATGG GCGGATTATT TGTCGTAGAA 
901 GCCGTATCCG TTATGCTTCA GGTCGGCTGG TATAAGAAAA CCAAAAAACG 
951 CATCTTCCTG ATGGCGCCCA TCCATCACCA CTACGAACAA AAAGGCTGGA 

1001 AAGAAACCCA AGTCGTCGTC CGCTTTTGGA TTATTACCAT CGTCTTGGTG 

1051 TTGATCGGTT TGAGTACCCT CAAAATCCGC TGAACCTATG CCGTCTGAAC 

1101 ACCTTTCAGA CGGCATTTGA ACGCGCAATA A 



This corresponds to the amino acid sequence <SEQ ID 288; ORF 088. a>: 

a088 -pep 

1 MFLWLAHFSN WLTGLNIFQY TT FRAVMAAL TALAFSLMF G PWTIRRLTAL 

51 KCGQAVRTDG PQTHLVKNGT P TMGGSLILT AITVSTLL WG NWAMPYIWTL 

101 LGVLLATGAL GF YDDWRKVV YKDPNGVSAK FKMVWQSSVA I IAGLALFYL 

151 AANSANNILI VPFFKQIA LP LGWGFLVLS YLTIV GTSNA VNLTDGLDGL 

201 ATFPVVLVAA GLAIFA YASG H5QFAQYLQL PYVAGANE VV IFCTAMCGAC 

251 LGFLW FMAYP AQVFMGD VGA LALGAALGTV AVIV RQE FVL VIHGGLFV\ 7 E 

301 AVSVMLQVGW YKKTKKRIFL MAPIHHHYEQ KGWKETQ VVV RFWIITIVLV 

351 LIGL STLKIR *TYAV*TPFR RHLNAQ^ 



m088/a088 99.5% identity over a 205 aa overlap 



150 160 170 180 190 200 

m088 . pep XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGWGFLVLSYLTIVGTSNAVNLTDGLDGLA 

I i I I I I I I I I I I I I I I i ! I I I f I f I I I I I I 
a088 IAGLALFYLAANSANNILIVPFFKQIALPLGVVGFLVLSYLTIVGTSNAVNLTDGLDGLA 
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150 160 170 180 190 200 

210 220 230 240 250 260 

m08 8 . pep TFPVVLVAAGI^IFAYASGHSQFAQYLQLFYVAGANEVVIFCTAMCGACLGFLWFNAYPA 
M i i I II I I I I II M ! ! M II I I I I i I i I I I M U I I I I ! I I I i i I II I 1 ! I I i I i i i ! I 
a 0 8 6 T FP VVLVAAG LAI FA YASGH SQFAQY LQLPY VAGANE VV I FCTAMCGACLG FLW FN' AY PA 

210 220 230 240 250 260 

270 280 290 300 310 320 

m088 . pep QVFMGDVGALALGAALGTVAVIVRQEFVLVIMGGLFVVEAVSVMLQVGWYKKTKKRIFLM 
II II ! I I i I i 1 I I I I i I 1 I I I I i I I ! S I I i 1 I I i I I I ' : I I 1 I I I I I II I I I ! I M I i II 
a08 8 QVFMGDVGALALGAALGTVAVIVRQEFVLVIMGGLFV' "IAVSVMLQVGWYKKTKKRI FLM 

270 280 290 300 310 320 

330 340 350 360 370 

m08 8 . pep APIHHHYEQKGWKETQVVVRFWIITIVLVLIGLSTLKIRXTYAVXTSFRRHLNAQX 
I I I i II I I M M i I I i I I I i i I I i I i I i I I I I M I I i I I II I I i I I I I i I II I I I 
aO 88 APIHHHYEQKGWKETQVVVRFWI ITIVLVLIGLSTLKIRXTYAVXTPFRRHLNAQX 

330 340 350 360 370 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 289>: 

g089 . seq 

1 ATGCCGCCCA AAATCACGAA GAGCGGGTTT TGCAAACCGG CAATCGCGGC 

51 GGCGGTCGCG CCGACATTCG TGCCTTTGCT GTCGTCGATG AATACCACGC 

101 CGTTTTTCTC GCCGATTTTT TCCACACGGT GCGGCAAGCC TTGGAAGGTT 

151 TTGACGTGTT CCAGCAATGC TTCGCGCGGC AAACCGACGG CCTCGCACAA 

201 AGCCACGGCA GCCATAACGT TGGCGGCGTT GTGCAAACCT TGCAGCGGGA 

2 51 TGTCTTGCGT AGAAATCAAA TCTTCATTGC CTTGTTTTAA ACAGCCCGTC 
301 CCGCGTTCCA ACCAAAAATC GGCTTCGTGT TCCAAGGAAA ACCGTTTCAC 

3 51 TTCACGCCCT GCCCGTTTCA TGGCGCGGCA GAACACGTCG TCCGCATTCA 
401 AAACCTGCAC TCCATCGCCA CGGAAAATCT CGGCTTTGGT ATGCGCGTAG 

This corresponds to the amino acid sequence <SEQ ID 290; ORF 089.ng>: 

g089 .pep 

1 MPPKITKSGF CKP AIAAAVA PTFVPLLSSM NTTPFFSPIF STRCGKPWKV 

51 LTCSSNASRG KPTASHKATA AITLAALCKP CSGMSCVEIK SSLPCFKQPV 

101 PRSNQKSASC SKENRFTSRP ARFMARQNTS SAFKTCTPSP RKISALVCA* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 29 1>: 

m089 . seq 

1 ATGCCGCCCA AAATCACkAw GAGCGGATTT TGCAAACCGG CAATCGCGGC 

51 GGCAGTCGCG CCGACATTCG TGCCTTTGCT GTCGTCGATA AACACCACGC 

101 CGTTTTTCTC GCCGATTTTT TCCACGCGGT GCGGCAGGCC TTGGAAGGTT 

151 TTGACGTGTT CGAGCAATGC TTCGCGCGAC AAACCGATGG CCTCACACAA 

2 01 AGCCACkGCA GCCATGACGT TAGCGGCGTT GTGCAkACCT TGCAACGGwA 

251 TGTCTTGCGT GACAATCAAA TCTTCATTGC CTTGTTTCAG GCGGCCTGTC 

301 TCGCGTTCCA ACCAGAAATC AGCTTCGTGT TCCAACGAAA ACCATTTTAC 

351 CTCGCGCCCG GCACGCTTCA TCGCGCGGCA GAACGCATCG TCCGCATTCA 

401 AAACCTGCAC GCCGTCGCCA CGGAAAATCT TGGCTTTGGT ATGCGCATAG 

This corresponds to the amino acid sequence <SEQ ID 292; ORF 089>: 

m089 -pep 

1 MPPKITXSGF CKP AIAAAVA PTFVPLLSSI NTTPFFSPIF STRCGRPWKV 

51 LTCSSNASRD KPMASHKATA AMTLAALCXP CNGMSCVTIK SSLPCFRRPV 

101 SRSNQKSASC SNENHFTSRP ARFIARQNAS SAFKTCTPSP RKILALVCA* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 089 shows 88.6% identity over a 149 aa overlap with a predicted ORF (ORF 089.ng) 
from K gonorrhoeae: 

m089/g089 
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10 20 30 40 50 60 

m089 .pep MPPKITXSGFCKPAIAAAVAPTFVPLLSSINTTPFFSPIFSTRCGRPWKVLTCSSNASRD 

MINI 1 1 1 1 i 1 1 1 i M ! I i i I i 1 1 1 i I 1 1 1 1 ! 1 1 1 1 1 i I i t ! 1 1 1 1 1 1 1 1 1 1 i 1 1 

g039 MPPKITKSGFCKPAIAAAVAPTFVPLLSSMNTTPFFSPIFSTRCGKPWKVLTCSSNASRG 

10 20 30 40 BO 60 



70 80 90 100 110 120 

m08 9 . pep KPMASHKATAAMTLAALCXPCNGMSCVTIKSSLPCFRRPVSRSNQKSASCSNENHFTSRP 

i i iiiinihiiiiii iMiiii miim-M nmiiiihihiim 

g08 9 KPTASHKATAAITLAALCKPCSGMSCVEIKSSLPCFKQPVPRSNOKSASCSKENRFTSRP 
70 80 90 100 110 120 



130 140 150 

m08 9 .pep ARFIARQNASSAFKTCTPSPRKILALVCAX 

IIMMMIIMIIIIIIMI MINI 

g089 AR FMARONTSSAFKTCTPS PRKI SALVCAX 

130 140 150 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 293>: 

a089. seq 

1 ATGCCGCCTA AAATCACGAA GAGCGGATTT TGCAAACCGG CAATCGCGGC 

51 GGCGGTCGCA CCGACGTTCG TGCCTTTGCT GTCGTCGATG AACACCACGC 

101 CATTTTTCTC GCCGATTTTT TCCACGCGGT GCGGCAGGCC TTGAAAGGTT 

151 TTGACGTGTT CGAGCAATGC TTCGCGCGGC AAACCGACGG CTTCGCACAA 

201 GGCAACGGCA GCCATCACGT TAGTGGCGTT GTGCAAGCCT TGCAGCGGAA 

2 51 TATCTTGCGT GGCAATCAAA TCTTCATTGC CTTGTTTCAG GCGACCTGTC 

301 TCACGTTCCA ACCAAAAATC GGCTTCGTAT TCCAACGAAA ACCATTTCAC 

351 CTCGCGCCCG GCGCGCTTCA TCGCACGACA GAACGCATCG TCCGCATTCA 

4 01 AAACCTGCAC ACCGTCGCCA CGGAAAATCT TGGCTTTGGT ATGCGCGTAG 

This corresponds to the amino acid sequence <SEQ ID 294; ORF 089. a>: 

a089 .pep 

1 MPPKITKSGF CKP AIAAAVA PTFVPLLSSM NTTPFFSPIF STRCGRP*KV 

51 LTCSSNASRG KPTASHKATA AITLVALCKP CSGISCVAIK SSLPCFRRPV 

101 SRSNQKSASY SNENHFTSRP ARFIARQNAS SAFKTCTPSP RK1LALVCA* 



m089/a089 91 .9% identity over a 149 aa overlap 

10 20 30 40 50 60 

m08 9,pep MPPKITXSGFCKPAIAAAVAPTFVPLLSSINTTPFFSPIFSTRCGRPWKVLTCSSNASRD 

I I I ! II I I I I I I I I I I i t I I I M I I i I I : II I I I I I If I I I I I I \ \ I ! I I I I I I I l I 
a08 9 MPPKITKSGFCKPAIAAAVAPTFVPLLSSMNTTPFFSPIFSTRCGRPXKVLTCSSNASRG 

10 20 30 40 50 60 



70 80 90 100 110 120 

m08 9 . pep KPMASHKATAAMTLAALCXPCNGMSCVTIKSSLPCFRRPVSRSNQKSASCSNENHFTSRP 
II I I I I I I I j : I I : I I I t I : I : I I I : t II li I II I II I I I I I I I I I I I I I I I I I I I f 
a08 9 KPTASHKATAAITLVALCKPCSGISCVAIKSSLPCFRRPVSRSNQKSA3YSNENHFTSRP 

70 80 90 100 110 120 



130 140 150 

mO 8 9 . pep ARFI ARQNASSAFKTCT PS PRKI LALVCAX 

! I II I I I I I It I I i I I I I I I I I i II I ! I t I 
a08 9 ARFIARQNAS SAFKTCTPSPRKILALVCAX 

130 140 150 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 295>: 

g090 . seq 

1 ATGCGCGTAG TCGAGCAAAT CGTCGTAGCG GTCGAGATGG TCTTCGGAAA 
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51 TGTTCATCAC CGTCGCCGCA GTCGGGCGCA GGCTTTCGGT GTTTTCCAGT 

101 TGGAAGCTGG AAAGCTCcca CACCCACACG TCCGCCTTTT TGCCTTCgcg 

151 ctgCAATtct gcctccaaga cgggcgtacc gat ATTGCCC GCAATGAcgg 

201 tatccagccc gcactcgatg CAGAGatagc ggaccaggct ggttaccgTG 

251 GTTt cgccgt tgctgCcggt aatcgCaatc accctgccgC CGCGGCGG tt 

3 01 cAcaaTGTCc gccaGCAATc ggATGTCGCC TAgCACGCGC . ccgccgTTT 

351 TGCttga 

This corresponds to the amino acid sequence <SEQ ID 296; ORF 090.ng>: 

g090 . pep 

1 MRWEQIWA VEMVFGNVHH RRRSRAQAFG VFQLEAGKLP HPHVRLFAFA 

51 LQFCLODGRT DIARNDGIQP ALDAEIADQA CYF.GFAVAAG NRNHLVAAAV 

101 HNVRQQLDVA XHA/RRFA* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 297>: 

m090 . seq 

1 ATGCGCATAG TCGAGCAAGT CGTCGTAGCG GTCGAGATGG TCTTCGGAAA 

51 TGTTCAGCAC CGTCGCCGCA GTCGGACGCA GGCTTTCGGT GTTTTCCAGT 

101 TGGAAGCTGG AAAGCTCCAA CACCCACACG TCCGCCTTTT TGCCTTCGCG 

151 CTGCCATTCC GCCTCCAAAA CCGGCGTGCC GATATTGCCC GCGATAACGG 

201 TATCCAGCCC GCACTTGATA CAGAGATAGC CGACCAGGCT CGTTACCGTG 

251 GTTTTGCCGT TGCTGCCGGT AATCGCAATT ACCTTGTCGT CCCGGCGGTT 

3 01 CACAATGTCC GCCAGCAATT CGATGTCGCC CAACACGCGT . CCGCCGTTT 

3 51 TGCTTGA 

This corresponds to the amino acid sequence <SEQ ID 298; ORF 090>: 

m090 . pep 

1 MRIVEQWVA VEMVFGNVQH RRRSRTQAFG VFQLEAGKLQ HPHVRLFAFA 

51 LPFRLQNRRA DIARDNGIQP ALDTEIADQA RYRGFAVAAG NRNYLWPAV 

101 HNVRQQFDVA QHAXRRFA* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 090 shows 83.9% identity over a 1 1 8 aa overlap with a predicted ORF (ORF 090,ng) 
from N. gonorrhoeae: 

m090/g090 



m090 .pep 
g090 



m090 . pep 
g090 

70 80 90 100 110 



10 20 30 40 50 60 

MRIVEQVWAVEMVFGNVQHRRRSRTQAFGVFQLEAGKLQHPHVRLFAFALPFRLQNRRA 

l|:|!|:|||IIM!llhlll!!|:||||ll!ll!lll I I 1 I I I I I I I I I lh h 
MRWEQIWAVEMVFGNVHHRRRSRAQAFGVFQLEAGKLPHPHVRLFAFALQFCLQDGRT 
10 20 30 40 50 60 

70 80 90 100 110 119 

DIARDNGIQPALDTEIADQARYRGFAVAAGNRNYLWPAVHNVRQQFDVAOHAXRRFAX 

I ! I I : : I I I I I I I : 1 ! I ! ! ! I I I I II i I I i I \ i h lllllllhlll MIIIIM 
D I ARNDG I OPALDAE I ADQAG YRGFAVAAGNRNHLVAAAVHNVRQQLD VAXHAXRRFAX 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 299>: 

a090 . seq 

1 ATGCGCGTAG TCGAGCAAGT CGTCGTAGCG GTCGAGATGG TCTTCGGAAA 

51 TGTTCAGCAC TGTCGCCGCA GTCGGGCGCA GGCTTTCGGT GTTTTCCAGT 

101 TGGAAACTGG AAAGCTCCAA CACCCACACG TCCGCCTTTT TGCCTTCGCG 

151 CTGCAATTCC GCCTCCAAAA CCGGCGCGCC GATATTGCCC GCGATAACGG 

201 TATCCAGCCC ACACTTGATG CAGAGATAGC CGACCAGGCT CGTTACCGTG 

251 GTTTTGCCGT TGCTGCCGGT AATCGCAATC ACCTTGTCGC CGCGGCGGTT 

301 CACAATGTCC GCCAGCAATT CGATGTC<"C CAACACGCGT CCGCCGTTT 

351 CGCTTAA 



This corresponds to the amino acid sequence <SEQ ID 300; ORF 090.a>: 
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a090.pep 

1 MRVVEQVVVA VEMVFGNVQH CRRSRAQAFG VFQLETGKLQ HPHVRLFAFA 

51 LQFRLQNRRA DIARDNGIQP TLDAEI ADQA RYRGFAVAAG NRNHLVAAAV 

101 HNVRQQFDVA QHAXRRFA* 

m09/a090 91.5% identity over a 1 1 7 aa overlap 

10 20 30 40 50 60 

m090 . pep MRI VEQVVVAVEMVFGNVQHRRRSRTQAFGVFQLEAGKLQKPHVRLFAFALPFRLQNRRA 

I I : I I I I I I I I I I I I I I I I I I f I I : I I I I I I I I f : I I I f | I | | | | | | M j | I \ I | | | |" 
a0 90 MRVVEQVVVAVEMVFGNVQHCRRSRAQAFGVFQLETGKLCHPKVRLFAFALQFRLQNRRA 

10 20 30 40 50 60 

70 80 90 ICO 110 119 

m0 90. pep DIARDNGIQPALDTEIADQARYRGFAVAAGNRNYLWPAVHNVRQQFDVAQHAXRRFAX 
1 I II If ! I I I : I I : I I I I ! I I I ! f I I I I I I I I I : I j : f | | | | [ I I I I I I I I ! j I | | | \ 
a090 DIARDNGIQPTLDAEIADQARYRGFAVAAGNRNHLVAAAVHNVRQOFDVAQHAXRRFAX 
70 80 90 100 110 

The following partial DNA sequence was identified in N. gonorrhoeae 

g090-1.3eq This sequence contains multiple stop codons (not shown) 

This corresponds to the amino acid sequence < ORF 090-1 .ng>: 

g090-l.pep (not shown) 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 2>: 

m090-l .seq 

1 ATGACGGCGT TTGCATTTCA GACGGCATCA CAAAGCCTTA AACGCTTCGA 

51 TAAACACTTC CGAACGGTGC GCGTAGCCTT TGAACATATC AAAGCTCGCG 

101 CAGGCGGGGC TGAGCAACAC AATATCGCCT GCTTCGGCTT GGGCAT-TGC 

151 CGTCTGAACG GCTTCTCCCA AAGTGGCGCA GTCGGTCATA TTCAAGCCGC 

201 AGCCGTCCAA ATCGCGGCGG ATTTGCGGCG CATCGACACC AATCAAGAAC 

251 ACGCCTTTTG CCTTGCCTAC CAGTGCATCG CGCAGGGGCG TGAAGTCCTG 

301 CCCTTTACCC ATGCCGCCCA AAATCACGAA GAGCGGATTT TGCAAACCGG 

351 CAATCGCGGC GGCAGTCGCG CCGACATTCG TGCCTTTGCT GTCGTCGATA 

4 01 AACACCACGC CGTTTTTCTC GCCGATTTTT TCCACGCGGT GCGGCAGGCC 

4 51 TTGGAAGGTT TTGACGTGTT CGAGCAATGC TTCGCGCGAC AAACCGATGG 

501 CCTCACACAA AGCCACGGCA GCCATGACGT TAGCGGCGTT GTGCAGACCT 

551 TGCAACGGAA TGTCTTGCGT GACAATCAAA TCTTCATTGC CTTGTTTCAG 

601 GCGGCCTGTC TCGCGTTCCA ACCAGAAATC AGCTTCGTGT TCCAACGAAA 

651 ACCATTTTAC CTCGCGCCCG GCACGCTTCA TCGCGCGGCA GAACGCATCG 

701 TCCGCATTCA AAACCTGCAC GCCGTCGCCA CGGAAAATCT TGGCTTTGGT 

7 51 ATGCGCATAG TCGAGCAAGT CGTCGTAGCG GTCGAGATGG TCTTCGGAAA 

801 TGTTCAGCAC CGTCGCCGCA GTCGGACGCA GGCTTTCGGT GTTTTCCAGT 

851 TGGAAGCTGG AAAGCTCCAA CACCCACACG TCCGCCTTTT TGCCTTCGCG 

901 CTGCCATTCC GCCTCCAAAA CCGGCGTGCC GATATTGCCC GCGATAACGG 

951 TATCCAGCCC GCACTTGATA CAGAGATAGC CGACCAGGCT CGTTACCGTG 

1001 GTTTTGCCGT TGCTGCCGGT AATCGCAATT ACCTTGTCGT CCCGGCGGTT 

1051 CACAATGTCC GCCAGCAATT CGATGTCGCC CAACACGCGT CCGCCGTTTT 

1101 GCTTGAACGC CTCAATATCC GGCTGCCGCT CGCTGATGCC GGGACTGAGA 

1151 GCCAGAATAT CGAAACCGTT GTCCAGCGCA TCTTTCAGAC GGCCCGTGTA 

1201 AAACACCAAC CCGTCAAACA TCTTACCGAT TTGCGACACG CGTTCCGGCT 

1251 TCAGCTCCGC ATCATACGCA GCAACCTCCG CGCCGTTTTT GCGCAGGTAG 

1301 GCAATCATGG AAATACCCGT ACCGCCGAGT CCGGCGACGA GGATTTTTTT 

1351 GTTTTGAAAA GTCATTTTGG TTTGTCCTAA 

This corresponds to the amino acid sequence <SEQ ID 3; ORF 090- 1>: 

m090-l.pep 

1 MTAFAFQTAS QSLKRFDKHF RTVRVAFEHI KARAGGAEQH NIACFGLGIC 

51 RLNGFSQSGA VGHIQAAAVQ IAADLRRIDT NQEHAFCLAY QCIAQGREVL 

101 PFTHAAQNHE ERILQTGNRG GSRADIRAFA VVDKHHAVFL ADFFHAVRQA 

151 LEGFDVFEQC FARQTDGLTQ SHGSHDVSGV VQTLQRNVLR DNQ IFIALFQ 

201 AACLAFQPEI SFVFQRKPFY LAPGTLHRAA ERIVRIQNLH AVATENLGFG 

251 MRIVEQVVVA VEMVFGWVQH RRRSRTQAFG VFQLEAGKLQ HPHVRLFAFA 

301 LPFRLQNRRA DIARDNGIQP ALDTEIADQA RYRGFAVAAG NRNYLWPAV 

351 HNVRQQFDVA QHASAVLLER LNIRLPLADA GTESQNIETV VQRI FQ7ARV 

4Q1 KHQPVKKLTD LRHAFRLQLR 1 1 RSNLRAVF AQVGMKGNTR TAESGDEDFF 

451 VLKSHFGL3 * 
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The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 303>: 

g091 . seq 

1 ATGGAAATAC CCGTGCCCCC AAGTCCGGCG AGGAGGATTT TTTTGTTTGA 

51 AAGTCATTTT GGTTTTGTCC TAAAACAAAT CATATTGGGC AGGAGACGTC 

101 CGCCCTTGCC CAAGCCGCTT TCAGACGGCA TCGCGAGCCG ATTAATAACC 

151 CGCCTTCAGG CGTTGGTCAT TGTCGCAGCT GTTTTGGTCT CCGTTTTGAC 

201 AAGCCTTGCC AAGCCATTGT TGAGCGAGCG CAAGGTCTTG GCGCACGCCG 

251 CGTCCATCGT AATACATCAA GCCCAAATTG TATTGGGCTT GGGCATCCCC 

3 01 TTGTTCTGA 

This corresponds to the amino acid sequence <SEQ ID 304; ORF 091.ng>: 

g091 . pep 

1 MEIPVPPSPA TRIFLFESHF GFVLKQIILG RRRPPLPKPL SDGIASRLIT 
51 RLQ ALVIVAA VLVSVLTSLA KPLLSERKVL AHAA5 IVIHQ AQIVLGLGIP 
101 LF* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 305>: 

m091 . seq 

1 ATGGAAATAC CCGTACCGCC GAGTCCGGCG ACGAGGATTT TTTTGTTTGA 

51 AAAGTCATTT TGGTTTGTCC TAAAACAAAT CATATTGAGC AGGAGATGTC 

101 CGCCCCTGCC CAAGCCGCTT TCAGACGGCA TCGCGAGCTG TTCAATAACC 

151 CGCCTTCAGG CGTTGGTCAT TGTCGCAGCC GTCTTGGTCT CCGTTTTGAC 

201 AAGCCTTGCC AAACCATTCT TGTGCAAGGG CGCGGTCTTG GCGCACGCCG 

251 CGTCTTTCGG CATACATCAC GCCCAAATTG TTTTGGGCTT GGGCTACCCC 

301 CTGCGC. . . 

This corresponds to the amino acid sequence <SEQ ID 306; ORF 09 1>: 

m0 9 1 . pep 

1 MEIPVPPSPA TRIFLFEKSF WFVLKQIILS RRCPPLPKPL SDGIASCSIT 
51 RLQ ALVIVAA VLVSVLTSLA KPFLCKGAVL AHAASFG IHH AQIVLGLGYP 
lCl LR. 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 091 shows 84.2% identity over a 101 aa overlap with a predicted ORF (ORF 091. ng) 
from N. gonorrhoeae: 

m09l/g091 



10 20 30 40 50 60 

m091.pep MEIPVPPSPATRIFLFEKSFWFVLKQIILSRRCPPLPKPLSDGIASCSITRLQALVIVAA 

IMIIilllMMillh I I II II II hi I UN II I II II II III I III UN! 

g091 MEIPVPPSPATRIFLFESHFGFVLKQI ILGRRRPPLPKPLSDGIASRLITRLQALVIVAA 

10 20 30 40 50 60 



70 SO 90 100 

m091 . pep VLVS VLTS LAKP FL CKGAVLAHAAS FG I HHAQ I VLGLG YP LR 

■ : llllllh Ihllllllll II 

g0 91 VLVSVLTSLAKPLLSERKVLAHAASIVIHQAQIVLGLGIPLFX 
70 80 90 100 



The following partial DNA sequence was identified in N. meningiditis <SEQ ID 307>: 

a09l . seq 

1 ATGGAAATAC CCGTGCCGCC AAGTCCGGCG ACGAGGATTT TTTTGTTTTG 

51 GAAATCATTT TGGTTTGTCC TAAAACAAAT CATATTGAGC AGGGGATGTC 

101 TGATCCTGCT CAAGCCGCTT TCAGACGGCA TCGCGAGCTG TTCAATAACC 

151 CGCTTTCAGG CGTTGGTCAT TGTCGCAGCT GTCTTGGTAT CCGTTTTGAC 

201 AAGCCTTGCC AAGCCATTCT TGTGCAAGGG CGCGGTCTTG GCGCACGCCG 

2 51 CGTCTTTCGG CATACATCAC GCCCAAATTG TTTTGGGC 
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This corresponds to the amino acid sequence <SEQ ID 30S; ORF 091.a>: 

a091 .pep 

1 MEIPVPPSPA TRI FLFWKSF WFVLKQIILS RGCLI LLKPL SDGIASCSIT 
51 RFQ ALVIVAA VLVSVLTSLA KPFLCKGAVL AHAASFGIHH AQIVLG 

m09i/a09i 93.8% identity over a 96 aa overlap 

10 20 30 40 50 60 

m0 91 .pep MEI PVPPSPATRI FLFEKSFWFVLKQI ILSRRCPPLPKPLSDGI ASCSITRLQALVIVAA 

M ! I ( I f I I i I I I I I I I f I I I i 1 I! I II I I i I i I I ! i I I I M I I ! ! : J I I \ M I I 
a 0 9 1 MEIPVPPS PAT R I FL FWKS FW FVLKQ 1 1 L SRGC L I LLK PL S DG I A 3 CS I T R FQALV T V AA 

10 20 30 40 50 60 

70 80 90 100 

m091 .pep VLVSVLTSLAKPFLCKGAVLAiiAASFGIHHAQIVLGLGYPLR 

t t i 1 t i I t t I 1 i I I I M t I I II I I I! I M ! I II I II 
a 0 9 1 VLVSVLTSLAKPFLCKGAVLAHAAS FGIHHAQI VLG 

70 80 90 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 309>: 

g092 . seq 

1 ATGTTTTTTA TTTCAATCCG CTATATATTT GTCAGAAAAC TATGGTGCGC 

51 AAACGGTCAG ACCTTTAAAA TAACGCCTTT ACGCACTAAA AACCAACCGG 

101 AACGCAACAT TATGATGAAA AATCGAGTAA GCAACATCCA TTTTGTCGGT 

15 n ATCGGCGGCG TCGGCATGAG CGGTATCGCC GAAGTCTTGC ACAATTTGGG 

2 01 CTTTAAAGTT TCCGGTTCGG ATCAGGCGCG AAATGCCGCT ACCGAGCATT 
251 TGAGCAGCCT GGGCATTCAA GTTTATCCCG GCCATACCGC AGAACACGTT 

3 01 AACGGTgcgg ATGTCGTCGT TGCCTCTACC GCCGTCAAGA AAGAAaatcC 
351 CGAAGTtgtc gcTGCGTTGG AGCGGCAAAT TCCCGTTATT CCGCGCGCCT 

4 01 TGATGCTGGC AGAGCTGATG CGCTTCCGTG ACGgcatcgc cattgccggT 

4 51 ACGCACGGCA AAACCACGAC CACCAGCCTG ACCGCCTCCA TCCTCGGCGC 
501 GGCAGGACTC GACCCCACTT TCGTTATCGG CGGCAAACTC AACGCCGCAG 

5 51 GCACCAACGC CCGCTTGGGC AAAGGCGAAT ACATCGTTGC CGAAGCCGAC 

6 01 GAATCCGATG CCTCTTTCCT ACATCTGACC CCGATTATGT CCGTCGTTAC 
651 CAATATCGAC GAAGACCATA TGGATACCTA CGGGCACAGC GTCGAAAAAC 
701 TGCATCAGGC GTTTATCGAT TTCATCCACC GTATGCCCTT CTACGGCAAA 
751 GCCTTTTTGT GTGTTGACAG CGAACACGTC CGCGCGATTT TGCCCAAAGT 
801 GAGCAAACCT TATGCTACTT ACGGTTTGGA CGATACCGCC GACATCTACG 
851 CCACCGACAT CGAAAACGTC GGCGCGCAAA TGAAATTCAC CGTCCATGTT 
901 CAAATGAAAG GACATGAGCA GGGGTCGTTT GAAGTCGTGC TGAATATGCC 
951 CGGCAGACAC AACGTGCTGA ACGCATTGGC AGCCATCGGc gtggcGCTgg 

1001 aagtcGgCGC ATcggtcgAA GCGAtcCAAA AaggCTTGCT CGGCTTTGAA 

1051 GGCGTCGGCC GCCGCTTCCA AAAATAcggc gacatCAagt tgccaaacgg 

1101 cggGaccgCT TTgctGGTGG ACGATTAcgg ACACCACCCC GTCGAAATGG 

1151 CGGcaaccct tgccgcTGCA CGCGGCGCGT ATCCGGAAAA acgtTTGGTG 

1201 CtcgCCTTCC AGCCGCACCG CTATACCCGC ACGCGCGATT TGTTTGAAGA 

1251 CTTTACCAAA GTACTCAATA CCGTTGatgC GCTGGTACTG ACCGAAGTTT 

13 01 AtgccgccgG CGAAGAGCCG GTTGCCGCCG CCGacccCCG CGCCTTGGCG 

13 51 CGTGCTATCC GCGTATTGGG CAAACTTGAG CCGATTTACT GCGAAAatgt 

14 01 cgccgACCTG CCGCAAATGC TGATGAATGT TTTACAGGAT Ggcgatgttg 
1451 tgttgAATAT GggTgcggga agcatcaacc gcgttccttc cgcgctgttg 
1501 gaattgtcga AACAGActtg A 

This corresponds to the amino acid sequence <SEQ ID 310, ORF 092. ng>; 

g092 .pep 

1 MFFISIRYIF VRKLWCANGQ TFKITPLRTK NQPERNIMMK NRVSNIHFVG 

51 IGGVGMSGIA EVLHNLGFKV SGSDQARNAA TSHLSSLGIQ VYPGHTAEHV 

101 NGADVWAST AVKKENPEW AALERQIPVI PRALMLAELM RFRDGIAIAG 

151 THGKTTTTSL TASILGAAGL DPTFVIGGKL NAAGTNARLG KGEYIVAEAD 

201 ESDASFLKLT PIMSWTNID EDKMDTYGHS VEKLHQAFID FIHRMPFYGK 
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251 AFLCVDSEHV RAILPKVSKP YATYGLDDTA DIYATDIENV GAQMKFTVHV 

301 OMKGHEQGSF EWLNMPGRH NVLNA LAAIG VALEVGASVE AI QKGLLGFE 

3 51 GVGRRFQKYG DIKLPNGGTA LLVDDYGHHP VEMAATLAAA RGAYPEKRLV 
401 LAFQPKRYTR TRDLFEDFTK VLNTVDALVL TEVYAAGEEP VAAADSRALA 

4 51 RAIRVLGKLE PIYCENVADL PQMLMNVLQD GDWLNMGAG SINRVPSALL 
501 ELSKQI * 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 31 ]>: 

m092 . seq 

1 ATGTTTTTTA TTTCAATCCG CTATATATTT GTCAGAAAAC TATGGCGCGC 

51 AAA CGGTC AG CCCTTTAAAA TAACGCCTTT ACGCATCGAA AATCCACCGG 

101 AACGCAACAT TATGATGAAA AATCGAGTTA C:\ACATCCA TTTTGTCGGT 

151 ATCGGCGGCG TCG^CATGAG CGGCKTCGCC GAAGTCTTGC ACAATTTGGG 

201 CTTTAAAGTT TCCG-oTTCGG ATCAgGCGCG AAAVGCCGCT ACCGAGCATT 

251 TGGGCAGCCT GGGCATTCAA GTTTATCCCG GCCATACCGC CGAACACGTT 

3 01 AACGGTGCGG ATGTCGTCGT TACCTCTACC GCCGTCAAAA AAGAAAATCC 
351 CGAAGTTGTC GCTGCGTTGG AGCAGCAAAT TCCCGTTATT CCGCGCGCCC 

4 01 TGATGTTGGC GGAGTTGATG CGCTTCCGTG ACGGCATCGC CATTGCCGGC 
4 51 ACGCACGGCA AAACCACGAC CACCAGCCTG ACCGCCTCCA TCCTCGGCGC 
501 GGCAGGACTT GACCCGACTT TCGTTATCGG CGGCAAACTC AACGCCGCAG 
551 GCACTAACGC CCGCTTGGGC AAAGGCGAAT ACATCGTTGC CGAAGCCGAC 
6 01 GAGTCGGATG CATCCTTTCT GCACCTGACA CCGATTATGT CCGTCGTTAC 
651 CAATATCGAC GAAGACCATA TGGATACCTA CGGGCACAGC GTCGAAAAAC 
701 TGCATCAGGC GTTTATCGAT TTCATCCACC GTATGCCCTT CTACGGCAAA 
751 GCCTTTTTGT GTATTGACAG CGAACACGTC CGCGCGATTT TGCCCAAAGT 
801 GAGCAAACCT TATGCTACTT ACGGTTTGGA CGATACCGCC GACATCTACG 
851 CCACCGACAT CGAAAACGTC GGCGCGCAAA TGAAATTCAC CGTCCATGTT 
901 CAAATGAAAG GACATGAGCA GGGGTCGTTT GAAGTCGTGC TGAATATGCC 
951 CGGCAGACAC AACGTGCTGA ACGCATTGGC AGCCATCGGC GTGGCGCTGG 

1001 AAGTCGGCGC ATCGGTTGAA GCGATCCAAA AAGGCTTGCT CGGCTTTGAA 

1051 GGCGTCGGCC GCCGCTTCCA AAAATACGGC GACATCAAGT TGCCAAACGG 

1101 CGGGACCGCG CTCTTGGTGG ACGACTACGG ACACCACCCC GTCGAAATGG 

1151 CGGCGhCCCT TGCCGCCGCA CGCGGCGCGT ATCTGGAAAA ACGTTTGGTA 

1201 CTCGCCTTCC AGCCGCACCG CTATACCCGC ACGCGCGATT TGTTTGAAGA 

12 51 CTTTACCAAA GTCCTCAATA CCGTTGACGC GCTGGTGCTG ACCGAAGTTT 

13 01 ATGCCGCCGG TGAAGAGCCG ATTGCCGCCG CCGATTCCCG CGCTCTTGCC 

13 51 CGCGCCATCC GCGTGTTGGG CAAACTCGAG CCGATTTACT GCGAAAACGT 

14 01 TGCCGATCTG CCCGAAATGC TGTTGAACGT TTTGCAGGAC GGCGACATCG 
14 51 TGTTGAATAT GGGCGCGGGA AGCATCAACC GCGTCCCCGC CGCGCTGCTG 
1501 GCATTGTCGA AACAGATTTG A 

This corresponds to the amino acid sequence <SEQ ID 3 12; ORF 092>; 

m092 .pep 

1 MFFISIRYIF VRKLWRANGQ PFKITPLRIE NPPERNIMMK MRVTNIHFVG 

51 IGGVGMSGIA EVLHNLGFKV SGSDQARNAA TEHLGSLGIQ VYPGHTAEHV 

101 NGADVWTST AVKKENPEW AALEQQIPVI PRALMLAELM RFRDGIAIAG 

151 THGKTTTTSL TAS I LGAAGL DPTFVIGGKL NAAGTNARLG KGEYIVAEAD 

201 ESDASFLHLT PIMSWTNID EDHMDTYGHS VEKLHQAFID FIHRMPFYGK 

251 AFLCIDSEHV RAILPKVSKP YATYGLDDTA DIYATDIENV GAQMKFTVHV 

3 01 OMKGHEQGSF EWLNMPGRH NVLNALAAIG VALEVGASVE AI QKGLLGFE 
351 GVGRRFQKYG DIKLPNGGTA LLVDDYGHHP VEMAATLAAA RGAYLEKRLV 

4 01 LAFQPHRYTR TRDLFEDFTK VLNTVDALVL TEVYAAGEEP IAAADSRALA 
4 51 RAIRVLGKLE PIYCENVADL PEMLLNVLQD GDIVLNMGAG SINRVPAALL 
501 ALSKQI* 

Computer analysis of this amino acid sequence gave the following results: 

Homology with a predicted ORF from N. gonorrhoeae 

ORF 092 shows 96.6% identity over a 506 aa overlap with a predicted ORF (ORF 092.ng) 
from N. gonorrhoeae: 

m092/g092 

10 20 30 40 50 60 

m092 .pep MFFISIRYIFVRKLWRANGQPFKITPLRIENPPERNIMMKNRVTNIKFVGIGGVGMSGIA 
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NIMIIIIIII" 1 Nil llllill = 1 IliimiiiMimillllllllll 

g092 MFFISIRYIFVRKLWCANGQTFKITPLRTKNQPERNIKMKNRVSNIHFVGIGGVGMSGIA 
10 20 30 40 50 60 



70 80 90 100 110 120 

mO 9 2 . pep EV\HNLGFKVSGSDQARNAATEHLGSLGIQV YPGHTA£HVNGADVVVTSTAVKKENPEVV 

! i i I ! I 1 1 ! I i I ( 1 1 1 1 1 i I i 1 1 1 : 1 1 1 i M 1 1 I I I II i i i i i i 1 1 ! = I ! 1 1 I 1 1 1 1 1 i I 

g 0 9 2 EVLHNLG FKVS GSDQ ARNAATEHLS SLGI QVY PGHTAEHVNGADVW A3 T AVKKENP EW 

70 80 90 100 110 120 



130 140 150 160 170 180 

m0 9 2 .pep AALEQQIPV1PRALMLAELMRFRDGIAIAGTHGKTTTTSLTAS ILGAAGLDPTFVIGGKL 

I I I I ■■ I I I I I I I I I I ( I I i I I I I I I I I I I I I I I I II I I I I i I I I I i I I I I I I I I I I I I I I 
g 0 9 2 AALERQ I PV I PRALMLAELMRFRDG I AI AGTHG KTTTTS LTAS I LGAAGLD PTFVIGGKL 

130 140 150 160 170 180 



190 200 210 220 230 240 

m0 92 . pep NAAGTNARLGKGEYIVAEADESDASFLHLTPIMSWTNIDEDHMDTYGHSVEKLHQAFID 

IIIIIIIIIIIIIIIUIIIIIIIMIIIIIIIIIillllllllllllllllllllllll 

gO 9 2 NAAGTNARLGKGEYIVAEADESDASFLHLTPIMS WTNIDEDHMDTYGHSVEKLHQAFID 

190 200 210 220 230 240 



250 260 270 280 290 300 

m092.pep FI HRM P FYG KAFLC IDS EHVRAI LPKVS KPYATYGLDDTAD I YATDI ENVGAQMKFTVHV 

i(lllllllllll|:!llll!ill!llill!lltlllll!tlll!l!!llllllll!ll! 

g 0 9 2 FI HRMPFYG KAFLCVDS EHVRAI LPKVS KPYATYGLDDTAD I YATDI ENVGAQMKFTVHV 

250 260 270 280 290 300 



310 320 330 340 350 360 

m092 -pep QMKGHEQGSFEVVLNMPGRHNVLNALAAIGVALEVGASVEAIQKGLLGFEGVGRRFQKYG 

ll!lllll!lll(illM>!!llllllII!llll£llilillllllillllllillllll 

g 0 9 2 QMKGHEQGS FEWLNMPGRHNVLNALAAIGVALEVGAS VEAI QKGLLGFEGVGRRFQKYG 

310 320 330 340 350 360 



370 380 390 400 410 420 

mO 9 2 . pep DI KLPNGGTALLVDDYGHHPVEMAATLAAARGAYLEKRLVLAFQPHRYTRTRDLFEDFTK 

MIIIIMMIIMIKIIIilllMNMUII 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

gO 9 2 DIKLPNGGTALLVDDYGHHPVEMAATLAAARGAYPEKRLVIiAFQPHRYTRTRDLFEDFTK 

370 380 390 400 410 420 



430 440 450 460 470 480 

m092 . pep VLNTVDALVLTEVYAAGEEPIAAADSRAIARAIRVLGKLEPIYCENVADLPEMLI.WLQD 

IIIIIIIIIIMIMIIIIhlllllllllllllMllilllMIIIMIhlhlllll 

g 0 9 2 VLNTVDALVLTEV YAAGEE P VAAADSRALARA I RVLGKLE P I YCENVADLP QMLMNVLQD 

430 440 450 460 470 480 



490 500 
m0 92 . pep GDIVLNMGAGSINRVPAALLALSKQIX 

ihiiiiiiiiiiiihiii nun 

g0 92 GDWLNMGAGS I NR VP S ALLELS KQI X 

490 500 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 313>: 

a092 .seq 

1 ATGTTTTTTA TTTCAATCCG CTATATATTT GTCAGAAAAC TATGGCGCGC 

51 AAACGGTCAG CCCTTTAAAA TAACGCCTTT ACGCATCGAA AATCCACCGG 

101 AACGCAACAT TAT GAT G AAA AATCGAGTGA CCAACATCCA TTTTGTCGGT 

151 ATCGGCGGCG TCGGCATGAG CGGTATCGCC GAAGTCTTGC ACAATTTGGG 

201 TTTTAAAGTT TCCGGTTCGG ATCAGGCGCG AAATGCCGCT ACCGAGCATT 

2 51 TGGGCAGCCT GGGCATTCAA GTTTATCCCG GCCATACCGC AGAACACGTT 
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MjM o O 1 UV>Uu 


ATnTCGTrnT 


TCI 






■A A 1 




ao l i on i vj 






A A APT ACG AP 


DUi 




P,APPPGA(~TT 


0 ^- I 


cr APP a apc^t 




601 




CATCCTTTCT 


DDI 


pa aTaTrr;zir 

^ /-vrt 1 rt J. ^ o ^ 




7 01 




GTTTATCGAT 


/ J 1 


O^w^ 1 I I I I O 1 


GTATTGACAG 




pappaaappt 




851 


rc a ppp, at at 


CGAAAACGTC 


901 


CAAATGAAAG 


GACATGAGCA 


951 


CGGCAGACAC 


AACGTGCTGA 


1001 


AAGTCGGCGC 


ATCGGTTGAA 


1051 


GGTGTCGGCC 


GCCGCTTCCA 


1101 


TGGAACCGCG 


CTCTTGGTGG 


1151 


CGGCGACCCT 


TTCCGCCGCA 


1201 


CTCGCCTTCC 


AGCCGCACCG 


1251 


CTTTACCAAA 


GTCCTCAATA 


1301 


ATGCCGCCGG 


TGAAGAGCCG 


1351 


CGCGCCATCC 


GCGTGTTGGG 


14 0 1 


TGCCGATCTG 


CCCGAAATGC 


1451 


TGTTGAATAT 


GGGTGCGGGA 


1501 


GAATTGTCGA 


AACAGATTTG 



TACCTCTACC GCCGTCAAAA AAGAAAATCC 
AGCAGCAAAT TCCCGTTATT CCGCGCGCCC 
CGCTTCCGTG ACGGCATCGC CATTGCCGGC 
CACCAGCCTG ACCGCCTCCA TCCTCGGCGC 
TCGTTATCGG CGGCAAACTC AACGCCGCAG 
AAAGGCGAAT ACATCGTTGC CGAAGCCGAC 
GCACCTGACA CCGATTATGT CCGTCGTTAC 
TGGATACCTA CGGGCACAGT GTTGAGAAGC 
TTCATCCACC GTATGCCCTT CTACGGCAAA 
CGAACACGTC CGCGCGATTT TG CCCAAAGT 
ACGGTTTGGA CGATACCGCC GACATCTACG 
GGCGCGCAAA TGAAATTCAC CGTCCATGTT 
GGGGTCGTTT GAAGTCGTGC TGAATATGCC 
ACGCATTGGC AGCCATCGGC GTGGCGCTGG 
GCGATCCAAA AAGGCTTGCT CGGCTTTGAA 
AAAATACGGC GACATCAAGT TGCCAAACGG 
ACGAC1 ACGG ACACCACCCC GTCGAAATGG 
CGCGGCGCGT ATCCGGAAAA ACGTTTGGTA 
CTATACCCGC ACGCGCGATT TGTTTGAAGA 
CCGTTGACGC GCTGGTGCTG ACCGAAGTTT 
ATTGCCGCCG CTGATTCCCG CGCTCTTGCC 
CAAACTCGAG CCGATTTACT GCGAAAACGT 
TGTTGAACGT TTTGCAGGAC GGCGACATCG 
AGCATCAACC GCGTCCCCGC CGCGCTGCTG 
A 



This corresponds to the amino acid sequence <SEQ ID 3 14; ORF 092. a>: 

a092 .pep 

1 MFFISIRYIF VRKLWRANGQ PFKITPLRIE NPPERNIMMK NRVTNIHFVG 

51 IGGVGMSGIA EVLHNLGFKV SGSDQARNAA TEHLGSLGIQ VYPGHTAEHV 

101 NGADVVVTST AVKKENPEVV AALEQQIPVI PRALMLAELM RFRDGIAIAG 

151 THGKTTTTSL TAS ILGAAGL DPTFVIGGKL NAAGTNARLG KGEYIVAEAD 

2G± ESDASFLKLT PIMSWTNID EDHMDTYGHS VEKLHQAFID FIHRMPFYGK 

251 AFLCIDSEHV RAILPKVSKP YATYGLDDTA DIYATDIENV GAQMKFTVHV 

301 QMKGHEQGSF EWLNMPGRH NVLNA LAAIG VALEVGASVE AI QKGLLGFE 

351 GVGRRFQKYG DIKLPNGGTA LLVDDYGHHP VEMAATLSAA RGAYPEKRLV 

4 01 LAFQPHRYTR TRDLFEDFTK VLNTVDALVL TEVYAAGEEP IAAADSRALA 

4 51 RAIRVLGKLE PIYCENVADL PEMLLNVLQD GDIVLNMGAG SINRV PAALL 

501 ELSKQI * 

m092/a092 99.4% identity over a 506 aa overlap 



10 ' 20 30 40 50 60 

m092 .pep MFFISIRYIFVRKLWRANGQPFKITPLRIENPPERNINMKNRVTNIHFVGIGGVGMSGIA 
I i I 1 M i 1 i I I II M i I I I I I II I 1 I i I I I I I i I i I I I I I ! I I f 11 1 ! I it M 1 i I i It t 
a092 MFFISIRYIFVRKLWRA13GQPFKITPLRI EN PPERNIMMKNRVTN I KFVG IGGVGMSGIA 

10 20 30 40 50 60 



70 80 90 100 110 120 

m0 92 .oep EVLHNLGFKVSGSDQARNAATEHLGSLGIQVYPGHTAEHVNGADWVTST AVKKENPEVV 

t I I I I H I I ! I I I I I I I I I I I I I i I I I II I I I I I I I I I I M I I ! I I I I ! I I I I i I I I II I 
a 092 EVLHNLGFKVSGSDQARNAATEHLGSLGIQVYPGHTAEHVNGADWVTSTAVKKENPEVV 

70 30 90 100 110 120 



130 140 150 160 170 180 

m0 92 .pep AALEQQIPVI PRALMLAELMRFRDGIAIAGTHGKTTTTSLTASILGAAGLDPTFVIGGKL 

I i I I I I M I t I I I I t I f I i I I I i I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I M I I I II 
a 0 9 2 AALEQQIPVI PRALMLAE LMRFRDG I AI AGTHGKTTTT S LTAS I LGAAGLD PTFVI GGKL 

130 140 150 160 170 180 



190 200 210 220 230 240 

m092 . pep NAAGTNARLGKGEYI VAEADESDASFLHLTPIMSVVTNIDEDHMDTYGHSVEKLHQAFID 

M I t I I ( I I I I M I I I ( I I I I i ! t 1 I I i M I ! I i I I i t I I i i I I I i i | I | ! | I I | | I t | ! 
a0 92 NAAGTNARLGKGEYIVAEADESDASFLHLTPIMSVVTNIDEDHMDTYGHSVEKLHQAFI D 
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190 200 210 220 230 240 

250 260 270 280 290 300 

m092 . pep FIHRMPFYG KA FLCIDSEHV RA I L P K V 3 K P Y A T Y G L D P T A D I Y A T D I E N V G A Q M K FT V H V 

t I I I I I ! I I I t I ! I I I I I II i f I I I I I I 1 ( I I I 1 I I J I f I I I i ! II M I I I M I I I I (I I 
a 0 92 FIHRMPFYGKAFLCI DSEHVRAILPKVSKPYATYGLDDTADI YATDIENVGAQMKF TVHV 

250 260 270 280 290 300 

310 320 330 340 3S0 360 

m092 . pep QMKGHEQGSFEVVLNMPGRHNVLNALAAIGVALEVGASVEAIQKGLLGFEGVGRRFQKYG 
I I I I I I II I I t I t I 1 II ! I I t 1 t ) I 1 i 1 II 11 I 1 i i 1 1 ) 1 i 1 I 1 I M i i 1 1 1 I I ! I I ! I I 
a 092 QMKGHEQGSFEVVLNMPGRHNVLNALAAIGVALEVGASVEAIQKGLLGFEGVGRRFQKYG 

310 320 330 340 350 360 

370 380 390 400 410 420 

m092 . pep DIKLPNGGTALLVDDYGHHPVEMAATLAAARGAYLEKRLVLAFQPHRYTRTRDLFEDFTK 
I 1 i 1 I I II I I i I M I I 1 i i I i I i I I M : I I I II I I I II i I ! II I I II I I 1 1 M I I I I ! I 
a0 92 DIKLPNGGTALLVDDYGHHPVEMAATLSAARGAYPEKRLVLAFQPHRYTRTRDLFEDFTK 

370 380 390 400 410 420 

430 440 450 4 60 470 480 

ra092 .pep VLNTVDALVLTEVYAAGEEPIAAADSRALARAIRVLGKLEPIYCENVADLPEMLLNVLQD 
I ! I I I I II I I I i ! I 1 I ! II M I I I I I M i I M I M I I II II I I I I M I I I I 1 I I I I ! M I 
a 0 92 VLNTVDALVLTEVYAAGEEPIAAADSRALARAIRVLGKLEPI YCENVADLPEMLLNVLQD 

430 440 450 4 60 470 430 

4 90 500 
m0 92 .pep GDIVLNMGAGSINRVPAALLALSKQIX 
I I I I I II I I II I I I I I I I 1 I I I I I I I 
3092 GDIVLNMGAGSINRVPAALLELSKQIX 

4 90 500 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 315>: 

g093 . seq 

1 aTGCAGAATt ttgGCAAAGT ggccgtATTG ATGGGtggtT TTTCCAGCGA 

51 ACGAGAaatC tcgcTGGACA GCgGTACCGC CATTTTGAAC GCCTTAAAAA 

101 GCAAAGGCAT AGACGCATAC GCCTTCGACC CTAAGGAAAC GCCGTTATCC 

151 GAACTGAAGG AGCGGGGCTT TCAGACGGCA TTCAACATCC TTCACGGTAC 

201 TTACGGCGAA GACGGGGCTG TTCAGGGTGC ATTGGAACTG TTGGGCATTC 

251 CCTATACCGG CAGCGGTGTC GCCGCCTCCG CCATCGGCAT GGACAAATAC 

3 01 CGCTGCAAAC TGATTTGGCA GGCATTGGGA TTACCCGTTC CCGAGTTCGC 

351 CGTACTGTAC GATGATACCG ATTTCGATGC CGTCGAAGAA AAATTGGGTC 

401 TGCCGATGTT TGTGAAGCCG GCGGCCGAAG GCAGCAGCg r cggcgtggta 

451 aAAGTCAAAG AAAaaggccg TCTGAAAAGC GTTtacgaag aatTGAaaCA 

501 CCTTcagggg cgaAAtcatc gccgAacgTT TTATCGGCGG CGGCGAATAT 

551 TCCTGCCCCG TCCTGAACGG CAAAGGGCTG CCCGGCATAC ACATCATCCC 

601 CGCAACCGAG TTTTACGAct acgaagccaa GtacaaCCGA GACGAcacca 

651 tttaTCAATG TCCTTCGGAA GATTTGACCG AAGCCGAAGA AAGCCTGATG 

701 CGCGAACTGG CGGTTCGCGG CGCACAGGCA ATCGGTGCGG AAGGCTGCGT 

751 GCGCGTCGAT TTCCTCAAAG ATACCGACGG CAAACTCTAT CTGTTGGAAA 

801 TCAACACCCT GCCCGGTATG ACCGGCCATA G 

This corresponds to the amino acid sequence <SEQ ID 3 16; ORF 093. ng>: 

g0 93 . pep 

1 MQNFGKVAVL MGGFSSEREI SLDSGTAILN AIKSKGIDAY AFDPKETPLS 

51 ELKERGFQTA FNILHGTYGE DGAVQGALEL LGIPYTGSGV AASAIGMDKY 

101 RCKLIWQALG LPVPEFAVLY DDTDFDAVEE KLGLPMFVKP AAEGSSVGW 

151 KVKEKGRLKS VYEELKHLQG RKHCRTFYRR RRIFLPRPSR QRAARHTHHP 

201 RNRVLRLRSQ VQPRRHKLSM SFGRFDRSRR KPDARTGGSR RTGNRCGRLR 

251 ARRFPQRYRR QTLSVGNQKP ARYDRP* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 317>: 

m093 . seq 
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1 ATGCAGAATT TTGGCAAAGT GGCCGTATTG ATGGGCGGTT TTTCCAGCGA 

51 ACGAGAAATC TCGCTGGACA GCGGCACCGC CATTTTGAAT GCTTTAAAAA 

101 GCAAAGGCAT AGACGCATAC GCCTTCGATC CTAAAGAAAC CCCATTGTCT 

151 GAATTGAAGG CACAAGGTTT TCAGACGGCA TTCAACATCC TTCACGGTAC 

201 TTACGGCrAA GACGGGGCGG TTCAGGGTGC ATTGGAACTG TTGGGCATTC 

2S1 CCTATACCGG CAGCGGTGTC GCCGCATCCG CCATCGGCAT GGACAAATAC 

3 01 CGCTGCAAAC TGATTTGGCA GGCATTGGGA TTGCCCGTTC CCGAGTTCGC 
351 CGTCCTGCAC GACGACACTG ATTTCGATGC CGTCGAAGAA AAATTGGGCC 

4 01 TGCCGATGTT TGTGAAACCG GCGGCCGAAG GCAGCAGCGT AGGCGTGGTA 
451 AAAGTCAAAG GAAAAGGCCG TCTGAAAAGC GTTTACGAAG AATTGAAACA 
501 CCTTCAGGG. CGAAATCATT GCCGAACGT- 1 ^TATCGGCGG CGGCGAATAT 
551 TCCTGCCCCG TTTGAACGG CAAAGGGCTG CCCGGCATAC ACATCATTCC 
601 CGCAACCGAG TTTTACGACT ACGAAGCCAA GTACAACCGC GACGACACCA 
651 TTTATCAATG TCCTTCGGAA GATTTGACGG AAGCCGAAGA AAGCCTGATG 
701 CGCGAACTGG CGGTTCGCGG CGCGCAGGCA ATCGGTGCGG AAGGCTGCGT 
7 51 GCGCGTCGAT TTCCTCAAAG ATACCGACGG CAAACTCTAT CTGTTGGAAA 
801 TCAACACCCT GCCCGGTATG ACGAGCCATA G 

This corresponds to the amino acid sequence <SEQ ID 31 8; ORF 093>: 

m09 3 . pep 

1 MQNFGKVAVL MGGFSSEREI SLDSGTAILN ALKSKGIDAY AFDPKETPLS 

51 ELKAQGFQTA FNILHGTYGX DGAVQGALEL LGI PYTGSGV AASAIGMDKY 

101 RCKLIWQALG LPVPEFAVLH DDTDFDAVEE KLGLPMFVKP AAEGSSVGW 

151 KVKGKGRLKS VYEELKHLQX RNHCRTFYRR RRI FLPRPER QRAARHTHHS 

2 01 RNRVLRLRSQ VQPRRKHLSM SFGRFDRSRR KPDARTGGSR RAGNRCGRLR 

251 ARRFPQRYRR QTLSVGNQHP ARYDEP* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 093 shows 96.7% identity over a 276 aa overlap with a predicted ORF (ORF 093. ng) 
from N. gonorrhoeae: 

m093/g093 

10 20 30 40 50 60 

m093 .pep 
g093 



m093 . pep 
g093 



m093 . pep 
g093 



m093 . pep 
g093 



m093 . pep 
g093 

250 260 270 



MQNFGKVAVLMGGFSSEREISLDSGTAILNALKSKGIDAYAFDPKETPLSELKAQGFQTA 

llllilllNlllllllllllllllllillllllllllllllllllllllMI Olllt 
MQNFGKVAVLMGGFSSEREISLDSGTAILNALKSKGIDAYAFDPKETPLSELKERGFQTA 
10 20 30 40 50 60 

70 80 90 100 110 120 

FNILHGTYGXDGAVQGALELLGIPYTGSGVAASAIGMDKYRCKLIWQALGLPVPEFAVLH 

Illllllll IIMMIIMIIIIIIIIIIIIIIIIMMIIMIIIIIMIIIIIIII: 

FNILHGTYGEDGAVQGALELLGIPYTGSGVAASAIGMDKYRCKLIWQALGLPVPEFAVLY 
70 80 90 100 110 12Q 

130 140 150 160 170 180 

DDTDFDAVEEKLGLPMFVKPAAEGSSVGWKVKGKGRLKSVYEELKHLQXRNHCRTFYRR 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIMII IMIIIIIMIIIII llllllllll 
DDTDFDAVEEKLGLPMFVKPAAEGSSVGWKVKEKGRLKSVYEELKHLQGRNHCRTFYRR 
130 140 150 160 170 160 

190 200 210 220 230 240 

RRI FLPRPERQRAARHTHHSRNRVLRLRSQVQPRRHHLSMS FGRFDRSRRKPDARTGGSR 

lllllllllllllllllll lll!llllllllllil!MIIIIIIIIIII!l!IIIIIM 

RRIFLPRPERQRAARHTHHPRNRVLRLRSQVQPRRKHLSMSFGRFDRSRRKPDARTGGSR 
190 200 210 220 230 240 

250 260 270 

RAGNRCGRLRARRFPQRYRRQCLSVGNQHPARYDEPX 
I : I I I I M I I I i I I I I i II I I I I I I I I I I I I I I | : I I 
RTGNRCGRLRARRFPQRYRRQTLSVGNQHPARYDRPX 
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The following partial DNA sequence was identified in /V meningitidis <SEQ ID 319>: 

a0 93 . seq 

1 ATGCAGAATT TTGGCAAAGT GGCCGTATTG ATGGGCGGTT 77TCCAGCGA 

51 ACGAGAAATC TCGCTGGACA GCGGCACCGC CATTTTGAAT GCTTTAAAAA 

101 GCAAAGGCAT AGACGCATAC GCCTTCGATC CCAAGGAAAC CCCATTGTCT 

151 GAATTGAAGG CACAAGGTTT TCAGACGGCA TTCAACATCC TTCACGGTAC 

201 TTACGGCGAA GACGGGGCTG TTCAGGGTGC ATTGGAACTG TTGGGCATTC 

251 CCTATACCGG CAGCGGTGTC GCCGCATCCG CCATCGGCAT GGACAAATAC 

301 CGCTGCAAAC TGATTTGGCA GGCATTGGGA TTGCCCGTTC CCGAGTTCGC 

351 CGTCCTGCAC GACGACACTG ATTTCGATGC CGTCGAAGAA AAATTGGGCC 

4 01 TGCCGATGTT TGTGAAACCG GCGGCCGAAG GCAGCAGC3T AGGCGTGGTA 

4 51 AAAGTCAAAG GAAAAGGCCG TCTGAAAAGC GTTTACGAAG AATTGAAACA 

501 CTTTCAGGG. CGAAATCATT GCCGAACGGT TTATCGGCGG CGGCGAATAT 

551 TCCTGCCCTG TGTTGAACGG CAAAGGCCTG CCCGGCATAC ACATCATCCC 

601 CGCGACCGAG TTTTATGACT ACGAAGCCAA GTACAACCGC AACGACACCA 

651 TTTATCAATG TCCTTCGGAA GATCTGACCG AAGCCGAAGA AAGCCTGATG 

7 01 CGCGAACTGG CGGTTCGCGG CGCGCAGGCA ATCGGTGCGG AAGGCTGCGT 

7 51 GCGCGTCGAT TTCCTCAAAG ATACCGACGG CAAACTCTAT CTGTTGGAAA 

801 TCAACACCCT GCCCGGTATG ACCGGCCATA G 

This corresponds to the ammo acid sequence <SEQ ID 320; ORF 093. a>: 

a093 .pep 

1 MQN FGKVAVL MGGFSSEREI SLDSGTAILN ALKSKGIDAY AFDPKETPLS 

51 ELKAQGFQTA FNILHGTYGE DGAVQGALEL LGIPYTGSGV AASAIGMDKY 

101 RCKLIWQALG LPVPEFAVLH DDTDFDAVEE KLGLPMFVKP AAEGSSVGVV 

151 KVKGKGRLKS VYEELKHFQX RNHCRTVYRR RRI FLPCVER QRPARKTHH? 

2 01 RDRVL* LRSQ VQPQRHHLSM SFGRSDRSRR KPDARTGGSR RAGNRCGRLR 

2 51 ARRFPQRYRR QTLSVGNQHP ARYDRP* 



m093/a093 95.7% identity over a 276 aa overlap 

10 20 30 40 50 60 

m0 93. pep MQNFGKVAVLMGGFSSEREISLDSGTAILNALKSKGIDAYAFDPKETPLSELKAOGFQTA 
I [ I ! M I I I I ! I i I I I M I I I I i I I I I I M I M I I I I I I I I f I I I I I i i ! I I I I i I 1 i I I 
a0 93 MQNFGKVAVLMGGFSSEREISLDSGTAILNALKSKGIDAYAFDPKETPLSELKAQGFQTA 

10 20 30 40 50 60 



70 80 90 100 110 120 

m0 93 .pep FNILHGTYGXDGAVQGALELLGIPYTGSGVAASAIGMDKYRCKLIWQALGLPVPEFAVLH 
I I I it 1 I I I I I I II II I I I I If I I I \ i I ! I I I I I I ! I I I I I I I I I I I f i I I I I f I I j I I 
a093 FNILHGTYGEDGAVQGALELLGIPYTGSGVAASAIGMDKYRCKLIWQALGLPVPEFAVLH 

70 80 90 100 110 120 



130 140 150 160 170 180 

mO 93 . pep DDTDFDAVEEKLGLPMFVKPAAEGSSVGWKVKGKGRLKSVYEELKHLQXRNHCRTFYRR 
i i M i I I 1 I M I I I i ! M M M i I II I I I i I i II M 1 II II i W W I : 1 1 \ I i \ \ I Ml 
a 0 93 DDTDFDAVEEKLGLPMFVKPAAEGSSVGWKVKGKGRLKSVYEELKHFQXRNHCRTVYRR 

130 140 150 160 170 180 



190 200 210 220 230 240 

m093 . pep RRIFLPRPERQRAARHTHHSRNRVLRLRSQVQPRRHHLSMSFGRFDRSRRKPDARTGGSR 
I I I I I I III! I I M I I I : I I I I I I I I I I : M I I ! I I I I I I I I I I I I I M I | | I | 
a0 93 RRIFLPCVERQRPARHTHHPRDRVLXLRSQVQPQRHHLSMSFGRSDRSRRKPDARTGGSR 
190 200 210 220 230 240 



250 260 270 

m0 93 . pep RAGNRCGRLRARRFPQRYRRQTLSVGNQHPARYDEPX 
I I I I II I I I I I I I ! I I I I I I I I I I I I I I I I I I I i : I I 
a 0 9 3 RAGNRCGRLRARRFPQR YRRQT LSVGNQH PAR YDR PX 

250 260 270 
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The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 32 1>: 

g094 . sec 

1 ATGTATTCGC CTTTGCCCAA GCGGGCGTTG GTGCCTGCGG CGTTGAGTTT 

51 GCCGCCGATA ACGAAAGTGG GGTCGAGTCC TGCCGCGCCG AGGATGGAGG 

101 CGGTCAGGCT GGTGGTCGTG GTTTTGCCGT GCGTAccggc aatggcgatg 

151 cCGTCACGGA AGCGCATCAG CTCTGCCAGC ATCAAGGCGC GCGGAATAAC 

201 GGGAATTTGC CGCTCCAACG CAgcgacaAC TTCGGgattT TCTTTCTTGA 

251 CGGCGGTAGA GGCAACGACG ACATccgcAC CGTTAACGTG TTCTGCGGTA 

3 01 TGGCCGGGAT AA 

This corresponds to the amino acid sequence <SEQ ID 322; ORF 094. ng>: 

g094 . pep 

1 MYSPLFKRAL VPAAL5LPPI TKVGSSPAAP RMEAVRLVW VLPCVPAMA M 
51 PSRKRISSAS IKARGITGIC RSNAATTSGF SFLTAVEATT TSAPLTCSAV 
101 WPG* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 323>: 

m094 . seq 

1 ATGTATTCGC CTTTGCCCAA GCGGGCGTTA GTGCCTGCGG CGTTGAGTTT 

51 GCCGCCGATA ACGAAAGTCG GGTCAAGTCC TGCCGCGCCG AGGATGGAGG 

101 CGGTCAGGCT GGTGGTCGTG GTTTTGCCGT GCGTGCCGGC AATGGCGATG 

151 CCGTCACGGA AGCGCATCAA CTCCGCCAAC ATCAGGGCGC GCGGAATAAC 

201 GGGAATTTGC TGCTCCAACG CAGCGACAAC TTCGGGATTT TCTTTTTTGA 

251 CGGCGGTAGA GGTAACGACG ACATCCGCAC CGTTAACGTG TTCGGCGGTA 

3 01 TGGCCGGGAT AA 

This corresponds to the amino acid sequence <SEQ ID 324; ORF 094>: 

m0 94 .pep 

1 MYSPLPKRAL VPAALSLPPI TKVGSSPAAP RMEAVRLVW VLPCVPAMA M 
51 PSRKRINSAN IRARGITGIC CSNAATTSGF SFLTAVEVTT TSAPLTCSAV 
101 WPG* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 094 shows 95.1% identity over a 103 aa overlap with a predicted ORF (ORF 094.ng) 
from N. gonorrhoeae; 

m094/g094 



10 20 30 40 50 60 

m0 94 . pep MYSPLPKRALVPAALSLPPITKVGSSPAAPRMEAVRLVVVVLPCVPAMAMPSRKRINSAN 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M II i 1 1 1 i 1 1 1 1 1 1 - 1 1 ■ 

g094 MYSPLPKRALVPAALSLPPITKVGSSPAAPRMEAVRLVVVVLPCVPAMAMPSRKRISSAS 
10 20 30 40 50 60 



70 80 90 100 

m094 .pep IRARGITGICCSNAATTSGFSFLTAVEVTTTSAPLTCSAVVfPGX 

I - I I I I I I I I I I I I I I I I I I I I I i I i : I 1 I I I I I I I I I I I 1 I I 
g 0 9 4 I KARGITG I CRSNAATTSGFSFLTAVEATTTS APLTCS AWPGX 

70 80 90 100 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 325>: 

a094 . seq 

1 ATGTATTCGC CTTTGCCCAA GCGGGCGTTG GTGCCTGCGG CGTTGAGTTT 

51 GCCGCCGATA ACGAAAGTCG GGTCAAGTCC TGCCGCGCCG AGGATGGAGG 

101 CGGTCAGGCT GGTGGTCGTG GTTTTGCCGT GCGTGCCGGC AATGGCGATG 

151 CCGTCACGGA AGCGCATCAA CTCCGCCAAC ATCAGGGCGC GCGGAATAAC 

201 GGGAATTTGC TGCTCCAACG CAGCGACAAC TTCGGGATTT TCTTTTTTGA 

251 CGGCGGTAGA GGTAACGACG ACATCCGCAC CGTTAACGTG TTCTGCGGTA 

301 TGGCCGGGAT AA 

This corresponds to the amino acid sequence <SEQ ID 326; ORF 094.a>: 
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a0 94 . pep 

1 MYS PLPKRAL VPAALSLPPI TKVGSSPAAP RMEAVRLVVV VLPCVPAMA M 

51 PSRKRINSAN I RARGITGI C CSNAATTSGF SFLTAVEVTT TSAPLTCSAV 

101 WPG* 

m094/a094 1 00.0% identity over a 103 aa overlap 

10 20 30 40 50 60 

m094 .pep M Y S P L P KRA L V P .AA LSLPPITKVGSSP AA P RM E A V R L VWV L P C V P AMAM F S R K R I N S Aft 

I I M I I I t ! I II I I i I I I i I I I i I I I I I I I M I f I 1 f I I I I I I , I ! I I I I I I I i I I f i I I 
a 0 9 4 MYSPLPKRALV PAA L S L P ? I T K VG S S P AA PRMEAVRLVV 7 V L PC V P AMAM PSRKRI N S AN 

10 20 30 40 50 60 

70 80 90 100 

raO 94 . pep IRARGITGICCSNAATTSGFSFLTAVEVTTTSAPLTCSAVWPGX 
I I I I I I I i i I ! ! i I I ! I ! I i I I I i I I \ I I I I I I i I I I I I I i M i 
a094 IRARGITGICCSNAATTSGFSFLTAVEVTTTSAPLTCSAVWPGX 

70 80 90 100 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 327>: 

g095 . seq 

1 ATGTCCTTTC ATTTGAACAT GGACGGTGAA TTTCATTTGC GCGCCGACGT 

51 TTTCGATGTC GGTGGCGTAG ATGTCGGCGG TATCGTCCAA ACCGTAAGTA 

101 GCATAAGGTT TGCTCACTTT GGGCAAAATC GCGCGGACGT GTTCGCTGTC 

151 AACACACAAA AAGGCTTTGC CGTAGAAGGG CATACGGTGG ATGAAATCGA 

201 TAAACGCCTG ATGCAGTTTT TCGACGCTGT GCCCGTAGGT ATCCATATGG 

251 TCTTCGTCGA TATTGGTAAC GACGGACATA ATCGGGGTCA GTGTAGGAAA 

3 01 GAGGCATCGG ATCGT CGGCT TCGGCAACGA TGTATTCGCC TTTGCCCAAG 

3 51 CGGGCGTTGG TGCCTGCGGC GTTGA 

This corresponds to the amino acid sequence <SEQ ID 328; ORF 095. ng>: 

g095 .pep 

1 MSFHLNMDGE FHLRADVFDV GGVDVGGIVQ TVSS1RFAHF GQNRADVFAV 

51 NTQKGFAVEG HTVDEIDKR L MQFFDAVPVG IHMVFV PIGN DGHNRGQCRK 

101 EASDRRLRQR CIRLCPSGRW CLRR* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 329>: 



m095 . seq 












l 


ATGTCCTTTC 


ATTTGAACAT 


GGACGGTGAA 


TTTCATTTGC 


GCGCCGACGT 


51 


TTTCGATGTC 


GGTGGCGTAG 


ATGTCGGCGG 


TATCGTCCAA 


ACCGTAAGTA 


101 


GCATAAGGTT 


TGCTCACTTT 


GGGCAAAATC 


GCGCGGACGT 


GTTCGCTGTC 


151 


AATACACAAA 


AAGGCTTTGC 


CGTAGAAGGG 


CATACGGTGG 


ATGAAATCGA 


201 


TAAACGCCTG 


ATGCAGTTTT 


TCGACGCTGT 


GCCCGTAGGT 


ATCCATATGG 


251 


TCTTCGTCGA 


TATTGGTAAC 


GACGGACATA 


ATCGGTGTCA 


GTGCAGAAAG 


301 


GATGCATCCG 


ACCGTCGGCT 


TCGGCAACGA 


TGTATTCGCC 


TTTGCCCAAG 


351 


CGGGCGTTAG 


TGCCTGCGGC 


GTTGA 







This corresponds to the amino acid sequence <SEQ ID 330; ORF 095>; 

m0 95 .pep 

1 MSFHLNMDGE FHLRADVFDV GGVDVGGIVQ TVSSIRFAKF GQNRADVFAV 

51 NTQKGFAVEG HTVDEIDKR L MQFFDAVPVG IHMVFV PIGN DGHNRCQCRK 

101 DASDRRLRQR CIRLCPSGRX CLRR* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 095 shows 97.6% identity over a 124 aa overlap with a predicted ORF (ORF 095.ng) 
from N. gonorrhoeae: 

m095/g095 

10 20 30 40 50 60 

mO 95 . pep MSFHLNMDGEFHLRADVFDVGGVDVGGIVOTVSSIRFAHFGONRADVFAVNTQKGFAVEG 

lllililllllllllllllMIIIIIIKIIIIIIIUiilllllillllllllillMI 
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g0 95 MSFHLNMDGEFKLRADVFDVGGVDVGGI VQTV3S I RFAKFGQNRADVFAVNTQKGFAVEG 

10 20 30 40 50 60 

70 80 90 100 110 120 

m0 95 pep htvdeidkrlmqffdavpvgihmvfvdigndgknrcqcrkdasdrrlrorcirlcpsgrx 
. ■ : ; I :l : II:- 1 

g0 9 5 HTVDEIDKRLMQFFDAVPVGIHMVFVDIGNDGKNRGQCRKEASDRRLRORCIRLCPSGRW 
70 80 SO 100 110 120 



m0 9 5.pep CLRRX 

Mill 

g095 CLRRX 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 33 1>: 

a095 . seq 

1 ATGTCCTTTC ATTTGAACAT GGACGGTGAA TTTCATTTGC GCGCCGACGT 

51 TTTCGATGTC GGTGGCGTAG ATGTCGGCGG TATCGTCCAA ACCGTAAGTA 

101 GCATAAGGTT TGCTCACTTT GGGCAAAATC GCGCGGACGT GTTCGCTGTC 

151 AATACACAAA AhGGCTTTGC CGTAGAAGGG CATACGGTGG ATGAAATCGA 

201 TAAACGCCTG ATGCAGCTTC TCAACACTGT GCCCGTAGGT ATCCATATGG 

2 51 TCTTCGTCGA TATTGGTAAC GACGGACATA ATCGGTGTCA GTGCAGAAAG 

301 GATGCATCCG ACCGTCGGCT TCGGCAACGA TGTATTCGCC TTTGCCCAAG 

351 CGGGCGTTGG TGCCTGCGGC GTTGA 

This corresponds to the amino acid sequence <SEQ ID 332; ORF 095. a>: 

a095.pep 

1 MSFHLNMDGE FHLRADVFDV GGVDVGGIVQ TVSSIRFAHF GQNRADVFAV 

51 KTQKGFAVEG HTVDEIDKRL MQLLNTVPVG IHMVFVDIGN DGHNRCQCRK 

101 DASDRRLRQR CIRLCPSGRW CLRR* 

m095/a095 96.0% identity in 124 aa overlap 

10 20 30 40 50 60 

m0 95 pep MSFHLNMDGEFHLRADVFDVGGVDVGGIVQTVSSIRFAHFGQNRADVFAVNTQKGFAVEG 
I | | ! ! | | i I | I I I I I I I I I I I I I! I I I I I t I t I i I M I t I I t I I I t ! ! I I I I M i I I II I 
a0 95 MSFHLNMDGEFHLRADVFDVGGVDVGGIVQTVSSIRFAHFGQNRADVFAVNTQKGFAVEG 

10 20 30 40 50 60 

70 30 90 100 110 120 

m095 . pep HTVDEIDKRLMQFFDAVPVGIHMVFVDIGNDGHNRCQCRKDASDRRLRQRCIRLCPSGRX 

[ j | I I I I I I I I I :::: I I I I I I ! I I 1 I II I I I I I 1 I I I I I t I 1 I I i I I I I I I I I ! I I t I 
a0 95 HTVDEIDKRLMQLLNTVPVGIHMVFVDIGNDGHNRCQCRKDASDRRLRQRCIRLCPSGRW 
70 80 90 100 110 120 



m0 95.pep CLRRX 
I 1 I I I 

a095 CLRRX 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 

333>: 

g096 . seq 

1 ATGGCCGGTC ATACCGGGCA GGGTGTTGAT TTCCAACAGA TAGAGTTTGC 

51 CGTCGGTATC TTTGAGGAAA TCGACGCGCA CGCAGCCTTC CGCACCGATT 

101 GCCTGTGCGC CGCGAACCGC CAGTTCGCGC ATCAGGCTTT CTTCGGCTTC 

151 GGTCAAATCT TCCGAAGGAC ATTGAtaaat ggtgTCGTCT CGGttgtaCt 
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201 tggcttcgta gTC~7AAAAC TCGGTTGCGG GGATGATGTG TATGCCGGGC 

251 AGCCCTTTGC CGTTCAGGAC GGGGCAGGAA TATTCGCCGC CGCCGATAAA 

301 AcgtTcggca atgaTTtcgc ccctgAAGGT GttTCAattc ttcgtaAACG 

351 CTTTTCAGAc ggccttTTTC TTTGA 



This corresponds to the amino acid sequence <SEQ ID 334; ORF 096. ng>: 

g096 .pep 

1 MAGHTGQGVD FQQIEFAVGI FEEIDAHAAF RTDCLCAANR QFAHQAFFGF 

51 GQI FRRT LIN GVVSVVLGFV VVKL GCGDDV YAGQPFAVQD GAGIFAAADK 

101 TFGNDFAPEG VSILRKRFSD GLFL* 

The following partial DNA sequence was identified in A. meningitidis <SEQ ID 335>: 



m096. 



seq 

1 
51 
101 
151 
201 
2 51 
301 
351 



ATGGCTCGTC 
CGTCGGTATC 
GCCTGCGCGC 
GGTCAAATCT 
TGGCTTCGTA 
AGCCCTTTGC 
ACGTTCGGCA 
CTTTTCAGAC 



ATACCGGGCA 
TTTGAGGAAA 
CGCGAACCGC 
TCCGAAGGAC 
GTCGTAAAAC 
CGTTCAGGAC 
ATGATTTCGC 
GGCCTTTTCC 



GGGTGTTGAT 
TCGACGCGCA 
CAGTTCGCGC 
ATTGATAAAT 
TCGGTTGCGG 
GGGGCAGGAA 
CC . TGAAGGT 
TTTGA 



TTCCAACAGA 
CGCAGCCTTC 
ATCAGGCTTT 
GGTGTCGTCG 
GAATGATGTG 
TATTCGCCGC 
GTTTCAATTC 



TAGAGTTTGC 
CGCACCGATT 
CTTCGGCTTC 
CGGTTGTACT 
TATGCCGGGC 
CGCCGATAAA 
TTCGTAAACG 



This corresponds to the amino acid sequence <SEQ ID 336; ORF 096>: 

m096 .pep 

1 MARHTGQGVD FQQIEFAVGI FEEIDAHAAF RTDCLRAANR QFAHQAFFGF 

51 GQI FRRT LIN GVVAVVLGFV VVKL GCGNDV YAGQPFAVQD GAGIFAAADK 

101 TFGNDFAXEG VSILRKRFSD GLFL* 



m096/g096 96.0% identity 



124 aa overlap 



10 20 30 40 50 60 

mO 9 6 . pep MARHTGQGVDFQQI E FAVG I FEE I DAHAAFRT DCLRAANRQFAHQAFFG FGQI FRRTL I N 

II ! I I I ! I I f I M II I II I I M I t I I I I I M I I I I I I I f I I 1 ! II I I M I I I I ! I I M 
g096 MAGHTGQGVDFQQIE FAVG I FEEI DAHAAFRT DCLCAANRQFAHQAFFG FGQI FRRTL IN 

10 20 30 40 50 60 



70 80 90 100 110 120 

mO 9 6 . pep GVVAVVLG FVWKLGCGN DVYAGQPFAVQDGAG I FAAADKT FGNDFAXEGVS I LRKRFSD 

I M : I i I I I ! 1 I I I I I I : I M I I i I t ! I! I I I I I 1 ! IS M i I I 1 I I I M I I I I I II I I I 
g096 GWSWLG FVWKLGCGDDVYAGQPFAVQDGAG I FAAADKT FGNDFAPEGVS I LRKRFSD 

70 80 90 100 110 120 



m096.pep GLFLX 
Mill 

g096 GLFLX 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 337>: 
a096.seq 

1 ATGGCCGGTC ATACCGGGCA GGGTGTTGAT TTCCAACAGA TAGAGTTTGC 

51 CGTCGGTATC TTTGAGGAAA TCGACGCGCA CGCAGCCTTC CGCACCGATT 

101 GCCTGCGCGC CGCGAACCGC CAGTTCGCGC ATCAGGCTTT CTTCGGCTTC 

151 GGTCAGATCT TCCGAAGGAC ATTGATAAAT GGTGTCGTTG CGGTTGTACT 

201 TGGCTTCGTA GTCATAAAAC TCGGTCGCGG GGATGATGTG TATGCCGGGC 

2 51 AGGCCTTTGC CGTTCAACAC AGGGCAGGAA TATTCGCCGC CGCCGATAAA 

301 CCGTTCGGCA ATGATTTCGC CCT . GAAAGT GTTTCAATTC TTCGTAAACG 

351 CTTTTCAGAC GGCCTTTTCC TTTGA 

This corresponds to the amino acid sequence <SEQ ID 338; ORF 096. ng>: 
a0 96 . pep 

1 MAGHTGQGVD FQQIEFAVGI FEEIDAHAAF RTDCLRAANR QFAHQAFFGF 
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51 GQ I FRRT LIN GVVAWLGFV VIKL GRGDDV YAGQAFAVQH RAGI FAAADK 
101 PFGNDFAXES VSILRKRFSD GLFL * 



m096/a096 92.7° 6 identity in 124 aa overlap 

10 20 30 40 50 60 

mO 9 6 . pep MARHTGQG V D FQQIE F AVG I FEE I DAHAA FRT DC LRAANRQFAHQA F FGFGQI FRRT LIN 

I | I | I | | M 1 1 I I M i ! 1 i I ! t I I I I 1 ! i 1 I i II I i 1 ) i t i i i i I 1 I 1 I i I I I i I I 1 ! I 
a096 MAGHTGQGVDFQQIEFAVGIFEEIDAHAAFRTDCLRAANRQFAHQAFFGFGQI FRRTLIN 

10 20 30 40 50 60 



70 80 3. 100 110 120 

mO 9 6 . pep GVVAWLG FVVVKLGCGNDV YAGQP FAVQDGAG I FAAADKT FGN DFAXEGVS I LRKRFS D 

M I I I I I I I I I : I I i f : I I I I I I I I I I I I I I I I I I I I I I I ! I I I : I I I I I I I I I I 
a0 96 GVVAWLG FVVIKLGRGDDVYAGQAFAVQHRAG I FAAADKP FGN D FAXES VS I LRKRFS D 

70 80 90 100 110 120 



m096.pep GLFLX 
Mill 



a096 GLFLX 



The following partial DNA sequence was identified in A : . gonorrhoeae <SEQ ID 339>: 

g097 . seq 

1 ATGGATATTT CAAAACAAAC ATTGCTGGAT AGGG77TTTA ACCTGAAGGC 

51 AAACGGTACG ACGGTACGTA CCGAGTTGAT GGCGGGTTTG ACGACCTTTT 

101 TGACGATGTG CTACATCGTT ATCGTCAA'IC CCCTGATTTT GGGCGAGACC 

151 GGAATGGATA TGGGGGCGGT ATTCGTCGCT ACCTGTATCG CATCCGCCAT 

201 CGGCTGTTTT GTCATGGGTT TTATCGGCAA CTATCCGATT GCGCTTGCCC 

251 CGGGGATGGG GCTGAATGCC TATTTCACCT TTGCCGTCGT TAAGGGTATG 

3 01 GGCGTGCCTT GGCAGGTGGC GTTGGGTGCG GTGTTCATTT CCGGTCTGAT 

3 51 TTTCATCCTG TTCAGCTTTT TTAAAGTCAG GGAAATGCTG GTCAACGCAC 

4 01 TGCCTATGGG TTTGAAAATG TCGATTGCCG CCGGTATCGG TTTGTTTTTG 
4 51 GCACTGATTT CCCTGAAAGG CGCAGGCATT ATCGTTGCCA ATCCGGCAAC 
501 CTTGGTCGGC TTGGGCGATA TTCATCAGCC CAGCGCACTG TTGGCATTGT 
551 TCGGTTTTGT CATGGTGGTC GTATTGGGGT ATTTCCGCGT TCAAGGCGCA 
601 ATCATCATCA CCATTCTGAC GATTACCGTC ATTGCCAGCC TGATGGGTTT 
651 GAACGAGTTT CACGGCGTGG TCGGCGAAGT ACCGGGCATT GCGCCGACCT 
701 TTATGCAGAT GGATTTTAAA GGTCTGTTTA CCGTCAGCAT GGTCAGCGTG 
751 ATTTTCGTCT TCTTCTTGGT CGATTTGTTC GACAGTACCG GAACGCTGGT 
801 CGGCGTATCC CACCGTGCCG GACTGCTGGT GGACGGTAAG CTGCCCCGCC 
851 TGAAACGCGC ACTGCTTGCA GACTCTACCG CCATTGTGGC AGGTGCGGCT 
901 TTGGGTACTT CTTCAACCAC GCCTTATGTG GAAAGCGCGG CGGGCGTATC 
951 GGCAGGCGGA CGGACCGGCC TGACGGCGGT TACCGTCGGC GTATTGATGC 

1001 TGGCGTGTCT GATGTTCTCC CCATTGGCGA AAAGTGTTCC GGTATTTGCC 

1051 ACCGCGCCCG CACTGCTTTA TGTCGGCACG CAGATGCTCC GCAGTGCGAG 

1101 GGACATTGAT TGGGACGATA TGACTGAAGC CGCGCCCGCG TTCCTGACCA 

1151 TTGTCTTCAT GCCGTTTACC TATTCGATTG CAGACGGCAT CGCCTTCGGC 

12 01 TTCATCAGCT ATGCCGTGGT CAAACTTTTG TGTCGCCGGA CTGGGGACGT 
1251 GCCGCCTATG GTATGGGTTG TTGCCGTATT GTGGGCATTG AAATTCTGGT 

13 01 ATTTGGGCTG A 

This corresponds to the amino acid sequence <SEQ ID 340; ORF 097.ng>: 

g097 . pep 

1 MDISKQTLLD RVFNLKANGT TVRTE LMAGL TTFLTMCYIV IV NPLILGET 

51 GMDMG AVFVA TCIASAIGCF VM GFIGNYPI ALAPGMGLNA YFTFAWKGM 

101 GVPWQ VALGA VFISGLIFIL FS FFKVREML VNALPMGL-KM SIAAGIGLFL 

151 ALISLKGAGI IV ANPATLVG LGDIHQPS AL LALFGFVMW VLGYF RVQGA 

201 IIITILTITV IASLMGL NEF HGWGEVPGI APTFMQMDFK GLFTVSMVSV 

251 IFVFFLV PLF DSTGTLVGVS HRAGLLVDGK LPRLKRALLA DSTAIVAGAA 

301 LGTSSTTPYV ESAAGVSAGG RT GLTAVTVG VLMIACLMF S PLAKSVPVFA 

351 TAPALLYVGT QML.^SARDID WDDMTEAAPA FLTI VFMPFT YSIADGIAFG 
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4 01 F1SYAWKLL CRRTGDVPPM VWWAVLWAL KFWYLG * 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 34 1>: 

m097 . seq 

1 ATGGACACTT CAAAACAAAC ACTGTTGGAC GGGATTTTTA AGCTGAAGGC 

51 AAACGGTACk ACGGTGCGTA CCGAGTTGAT GGCGGGTTTG ACAACTTTTT 

101 TGACGATGTG CTACATCGTT ATCGTCAACC CTCyGATTTT GGGCGAGACC 

151 GGCATGGATA TGGGGGCGGT ATTCGTCGCT ACCTGTATCG CGTCTGCCAT 

201 CGGCTGTTTT GTTATGGGTT TTGTCGGCAA CTATCCGATT GCACTCGCAC 

251 CGGGGATGGG GCTGAATGCC TATTTCACCT TTGCCGTCGT TAAGGGTATG 

301 GGCGTGCCTT GGCAGGTTGC GTTGGGTGCG GTGTTCATCT CCGGTCTGAT 

3 51 TTTTATCCTG TTCAGCTTTT TTAAAGTCAG GGAAATGCTG GTCAACGCAC 

4 01 TGCCTATGGG TTTGAAAATG TCGATTGCTG CCGGTATCGG TTTGTTTTTG 

4 51 GCACTGATTT CCCTGAAAGG CGCAGGCATT ATCGTTGCCA ATCCGGCAAC 

5 01 CTTGGTCGGT TTGGGCGATA TTCATCAGCC GTCCGCGTTG TTGGCATTGT 
551 TCGGTTTTGC TATGGTGGTC GTATTGGGAC ATTTCCGCGT TCAAGGCGCA 
601 ATCATCATCA CCATCTTGAC CATTACCGTC ATTGCCAGCC TGATGGGTTT 
651 GAATGAATTT CACGGCATCA TCGGCGAAGT ACCGAGCATT GCGCCGACTT 
701 TTATGCAGAT GGATTTTGAA GGCCTGTTTA CCGTCAGCAT GGTCAGTGTG 
751 ATTTTCGTCT TCTTCTTGGT CGATCTATTT GACAGTACCG GAACGCTGGT 
801 CGGCATATCC CACCGTGCCG GGCTGCTGGT GGACGGTAAG CTGCCCCGCC 
851 TGAAACGCGC ACTGCTTGCA GACTCTACCG CCATTGTGGC AGGTGCGGCT 
901 TTGGGTACTT CTTCCACCAC GCCTTATGTG GAAAGCGCGG CGGGCGTATC 
951 GGCAGGCGGA CGGACCGGCC TGACGGCGGT TACCGTCGGC GTATTGATGC 

1001 TCGCCTGCCT GATGTTTTCA CCTTTGGCGA AAAGTGTTCC CGCTTTTGCC 

1051 ACCGCGCCCG CCCTGCTTTA TGTCGGCACG CAGATGCTCC GCAGTGCGAG 

1101 GGATATTGAT TGGGACGATA TGACGGAAGC CGCACCTGCG TTCCTGACCA 

1151 TTGTTTTCAT GCCGTTTACT TATTCGATTG CAGACGGCAT CGCTTTCGGC 

1201 TTCATCAGTT ATGCCGTGGT TAAACTTTTA TGCCGCCGCA CCAAAGACGT 

12 51 TCCGCCTATG GTATGGATTG TTGCCGTATT GTGGGCACTG AAATTCTGGT 

13 01 ATTTGGGCTG A 

This corresponds to the amino acid sequence <SEQ ID 342; ORF 097>: 

m0 97 .pep 

1 MDTSKQTLLD GI FKLKANGT TVRTS LMAGL TTFLTMCYIV IV KPXILGET 

51 GMDMG AVFVA TCIASAIGCF VM GFVGNYPI ALAPGMGLNA YFTFAWKGM 

101 GVPWQ VALGA VFISGLIFIL FS FFKVREML VNALPMGLKM SIAAG IGLFL 

151 ALISLKGAGI IV AMPATLVG LGDIHQPS AL LALFGFAMW VLGKF RVQGA 

2 01 IIITILTITV IASLMGL NEF HGIIGEVPSI APTFMOMDFE GLFTVSMVSV 
251 IFVFFLV DLF DSTGTLVGIS HRAGLLVDGK LPRLKRALLA DSTAIVAGAA 

3 01 LGTSSTTPYV ESAAGVSAGG RT GLTAVTVG VLMLACLMF S PLAKSVPAFA 
3 51 TAPALLYVGT QMLRSARDID VfDDMTEAAPA FLTIVFMPFT YSIADGIAFG 
401 FISYAWKLL CRRTKDVPPM VWIVAVLWAL KFWYLG * 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 097 shows 96.3% identity over a 436 aa overlap with a predicted ORF (ORF 097.ng) 
from N. gonorrhoeae: 

m097/g097 

10 20 30 40 50 60 

m097 .pep MDTSKQTLLDGIFKLKANGTTVRTELMAGLTTFLTMCYIVIVNPXILGETGNIDMGAVFVA 

1 1 IMIIII :hMIIMIIIMIimil!IMIIIIIIII IIIIIIIIIIMMI 

g0 9 7 MDISKQTLLDRVFNLKANGTTVRTELMAGLTTFLTMCYIVIVNPLILGETGMDMGAVFVA 
10 20 30 40 50 60 

70 80 90 100 110 120 

m0 97 . uep TC I ASAI GC FVMG FVGN Y P I ALAPGMGLNAY FTF A WKGMG VP WQV ALGAVF I SGL I F I L 

inilllllllllhllMIIIIIIIIIIMIIIIIIMIlMIMIIIMIllllllll 

g097 TCIASAIGC FVMG FIGNYPI ALAPGMG LN AY FT F AWKGMGV P WQV ALGAV FISGLIFIL 

70 80 90 100 110 120 

130 140 150 150 170 180 
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m0 97 . pep FS F FKVREMLVNALPMGLKMS I AAG I GLFLAL I 3LKGAG 1 1 VANPATLVGLGD I HQPS AL 

IIIIMIIIIIIIIIIIIIIIIMMIIIIIIIMIIIIIIIIIIIIMIIIMIIIIII 

g0 97 FSF FKVREMLVNALPMGLKMS I AAG I GLFLAL I 3 LKGAG I I VANPATLVGLGD I HQ PS AL 

130 140 150 160 170 ISO 



190 200 210 220 230 240 

mO 9 7 . pep LAL FG FAMWVLGH FRVQGAI 1 1 T I LT I TV I AS LMGLNE FHG 1 1 GEVPS I APTFMQMDFE 

MMI|:illlllM!IIIIMIIIIIII!IIIMIM!ll|::||lhlEIIIIII!|: 

g0 9 7 LAL FGFVMVWLGY FRVQGAI I ITILTITVIASLMGLNEFHGWGEVPGIAPTFMQMDFK 

190 200 210 220 230 240 



250 260 2';0 280 290 300 

mO 9 7 . pep GLFTVSMVSVI FVFFLVDLFDSTGTLVGI SKrAGLLVDGKLPRLKRALLADSTAIVAGAA 

MMIIIMIIIIIIIIMIIIMIIIhlllllllllllllllllllMIIIIIIIIII 

gO 9 7 GLFTVSMVSVI FVFFLVDLFDSTGTLVGVSHRAGLLVDGKLPRLKRALLADSTA I VAGAA 

250 260 270 280 290 300 



310 320 330 340 350 360 

mO 97 . pep LGTSSTTPYVESAAGVSAGGRTGLTAVTVGVLMLACLMFSPLAKSVPAFATAPALLYVGT 

MIIIIIIIIIIIMIIIMIIMIilllMlllllllllllllllhlfllllllllll 

gO 9 7 LGTSSTTPrvESAAGVSAGGRTGLTAVTVGVLMLACLMFSPLAKSVPVFATAPALLYVGT 

310 320 330 340 350 360 



370 380 390 400 410 420 

m0 9 7 . pep QMLRSARDIDWDDMTEAAPAFLTIVFMPFTYSIADGIAFGFISYAWKLLCRRTKDVPPM 

IIIIMIIIIIIIIIIIIIIMIMIIIMillMIIMMllillllllllll 

qO 9 7 QMLRSARDIDWDDMTEAAPAFLTIVFMPFTYSI ADGI AFGFISYAWKLLCRRTGDVPPM 

370 380 390 400 410 420 



430 

m097.pep VW I VAVLWAL KFW Y LGX 

I I : I I I I I I i I I I I I I I 
g097 VWWAVLWALKFWYLGX 
430 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 343> 

a0 97 . seq 

1 ATGGACACTT CAAAACAAAC ACTGTTGGAC GGGATTTTTA AGCTGAAGGC 

51 AAACGGTACG ACGGTGCGTA CCGAGTTGAT GGCGGGTTTG ACAACTTTTT 

101 TGACGATGTG CTACATCGTT ATCGTCAACC CTCTGATTTT GGGCGAGACC 

151 GGCATGGATA TGGGGGCGGT ATTCGTCGCT ACCTGTATCG CGTCTGCCAT 

201 CGGCTGTTTT GTTATGGGTT TTGTCGGCAA CTATCCGATT GCACTCGCAC 

2 51 CGGGGATGGG GCTGAATGCC TATTTCACCT TTGCCGTCGT TAAGGGTATG 
301 GGCGTGCCTT GGCAGGTTGC GTTGGGTGCG GTGTTCATCT CCGGTCTGAT 

3 51 TTTCATCCTG TTCAGCTTTT TTAAAGTCAG GGAAATGCTG GTCAACGCAC 

4 01 TGCCTATGGG TTTGAAAATG TCGATTGCTG CCGGTATCGG TTTGTTTTTG 
4 51 GCACTGATTT CCCTGAAAGG CGCAGGCATT ATCGTTGCCA ATCCGGCAAC 
501 CTTGGTCGGC TTGGGCGATA TTCATCAGCC GTCCGCGTTG TTGGCACTGT 
551 TCGGTTTTGC CATGGTGGTC GTATTGGGAC ATTTCCGCGT TCAAGGCGCA 
601 ATCATCATCA CCATTTTGAC GATTACCGTC ATTGCCAGCC TGATGGGTTT 
651 GAACGAATTT CACGGCATCA TCGGCGAAGT GCCGAGCATT GCGCCGACTT 
701 TTATGCAGAT GGATTTTAAA GGGTTGTTTA CCGTCAGCAT GGTCAGCGTG 

7 51 ATTTTCGTCT TTTTCCTAGT CGATCTGTTC GACAGTACCG GAACACTGGT 
801 CGGTGTATCG CATCGTGCCG GACTGCTGGT GGACGGTAAG CTGCCCCGCC 

8 51 TGAAACGCGC ACTGCTTGCA GACTCTACCG CTATTGTGGC AGGTGCGGCT 
901 TTGGGTACTT CTTCAACCAC GCCTTATGTG GAAAGTGCGG CGGGCGTATC 
951 GGCAGGCGGG CGGACAGGTC TGACGGCGGT TACCGTCGGC GTATTGATGC 

1001 TCGCCTGCCT GATGTTTTCA CCTTTGGCGA AAAGTGTTCC CGCTTTTGCC 

1051 ACCGCGCCCG CCCTGCTTTA TCGGCACG CAGATGCTCC GCAGTGCGAG 

1101 GGACATCGAT TGGGACGATA TGACGGAAGC CGCACCCGCA TTCCTGACCA 

1151 TTGTCTTCAT GCCGTTTACC TATTCGATTG CAGACGGCAT CGCTTTCGGC 

1201 TTCATCAGTT ATGCCGTGGT TAAACTTTTA TGCCGCCGCA CCAAAGACGT 

12 51 TCCGCCTATG GTATGGATTG TTGCCGTATT GTGGGCACTG AAATTCTGGT 
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1301 ATTTGGGCTG A 

This corresponds to the amino acid sequence <SEQ ID 344; ORF 097. a>: 

a0 97 .pep 

mdtskqtlld gifklkangt tvrte lmagl ttfltmcyiv 
gmdmg avfva tciasaigcp vm gfvgnypi 
gvf::qvalga vfisglifil fsffkvreml 



1 

51 
101 
151 
20] 
251 
301 
351 
401 



ALISLKGAGI IV ANPATLVG LGDIHQPSA 



ALAPGMGLNA 
VNALPMGLKM 
LALFGFAMVV 



IVNPLILGET 
YFTFAVVKGM 
SI AAGIGLFL 



VLGHF RVQGA 



IIITILTITV IASLMGLNEF 



TFVFFLVDLF 
LGTSSTTPYV 
TAPALLYVGT 
FISYAVVKLL 



DSTGTLVGVS 
ESAAGVSAGG 
QMLRSARDID 
CRRTKDVPPM 



HGIIGEVPSI 
HRAGLLVDGK 
RTGLTAVTVG 



APTFMQMDFK 
LPRLKRA1LA 
VLMLACLMFS 



WDDMTEAAPA 
VWIVAVLWAL 



FLTI VFMPFT 
KFWYLG * 



GLFTVSMVSV 
DSTAIVAGAA 
PLAKSVPAFA 
YSIADGIAFG 



m097/a097 



99.3% identity in 436 aa overlap 



10 20 30 40 50 60 

m0 97 . pep MDTSKQTLLDGI FK LKANGTTVRTELMAGLTTFLTMCY I VI VNPXILGETGMDMGAVFV A 

I 1 I I I I I i I I I M M 1 1 I I M I 1 I I I I I I I M i i I I M I I I I i I I i I I ! I I I I ] ! I 1 I | 
a097 MDTSKQTLLDGI FKLKANGTTVRTELMAG LTTFLTMCYIVIVNPLILGETGMDMGAVFVA 

10 20 30 40 50 60 

70 80 90 100 110 120 

m097 . pep TCIASAIGCFVMGFVGNYPIALAPGMGLNAYFT FAWKGMGVPWQVALGAVFISGLIFIL 

I I I I I I i I i t I I I I M I I I I II I M I I i I I I I I I I 1 I I ! ! I t M M M I I ! I I i I M I I I 
a 097 TCI AS AIGCFVMGFVGNYP I ALAPGMGLNAY FT FAWKGMGVPWQVALGAVFISGLIFIL 

70 80 90 100 110 120 

130 140 150 160 170 ISO 

m097 . pep FSFFKVREMLVNALPMGLKMSIAAGIGLFLALISLKGAGI IVAN PATLVGLGDIHQPSAL 

I I I ! I I I I I I i I I I I I I t I I I I f I I II I I I I I ! i I 1 M I I I I I ! I I I I M | | If | | | || | 
a0 97 FSFFKVREMLVNALPMGLKMSIAAGIGLFLALISLKGAGI IVAN PATLVGLGDIHQPSAL 

130 140 150 160 170 180 

190 200 210 220 230 240 

m097 .pep LALFGFAMWVLGHFRVQGAI I ITILTITVI ASLMGLNEFHGI IGEVPSIAPTFMQMDFE 

I I I I I I I 1 I I I I I I I I I I I I I I I 1 I I I i I I I I I I II II i ! I I I I ! I i II M M I I i I ! I : 
a0 97 LALFGFAMWVLGHFRVQGAI I ITILTITVI ASLMGLNEFHGI I GEV PS I APTFMQMDFK 

190 200 210 220 230 240 

250 260 270 280 290 300 

m097 . pep GLFTVSMVSVIFVFFLVDLFDSTGTLVGI SHRAGLLVDGKLPRLKRALLADSTAI VAGAA 

M I M I t I I I f I I I I 1 I I I I It I : i I I I I ! || 1 | | | | | | | | | | | | M | | | | i | | | 

a0 97 GLFTVSMVSV I FVFFLVDLFDSTGTLVGVSHRAGLLVDGKLPRLKRALLADSTAI VAGAA 

250 260 270 280 290 300 

310 320 330 340 350 360 

m097 . pep LGTSSTTPYVESAAGVSAGGRTGLTAVTVGVLMLACLMFSPLAKSVPAFATAPALLYVGT 
I i I I I I M I I 1 I I M i I I I i I I I I I I I I I I ! I I I | I | | || | | j M ! M I I I I I I I I | | | | 
a097 LGTSSTTPYVESAAGVSAGGRTGLTAVTVGVLMLACLMFSPLAKSVPAFATAPALLYVGT 

310 320 330 340 350 360 

370 380 390 400 410 420 

m097 . pep QMLRSARDIDWDDMTEAAPAFLTIVFMPFTYS IADGIAFGFISYAWKLLCRRTKDVPPM 

I M I I M I I I I I I I I I I I I M I I I II I ! 1 I I I I I I I I I I I I I I M i | I I M I I I I I I I i | 
a0 97 QMLRSARD I DWDDMTEAAPAFLT I VFMPFT YS IADGIAFGFISYAWKLLCRRTKDVPPM 

370 380 390 400 410 " 420 



m097 . pep 
a097 



430 

VWIVAVLWALKFWYLGX 
I I I I I I I I I I I I I M I I 
VWIVAVLWALKFWYLGX 
430 
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The following partial DNA sequence was identified in /V. gonorrhoeae <SEQ ID 345>: 

c098 . seq 

1 ATGACCGCCG ACGGTCTCTT CGTCGCTTTC AACTTCAATA CGTTTGCCGT 

51 TGTGCGAATA TTGATACCAG TACAGCAGGA TGCTGCCCAG GCTGGCGATC 

101 AGTTTGTCGG CGATGTCGCG CGCTTCGCTG TCGGGATGGC TTTCGCGTTC 

151 GGGATGAACG CAGCCGAGCA TGGACACGCC GGTACGCATC ACGTCCATCG 

201 GATGGGTATG TGCAGGCAGG CTTTCCAAAA CTTTAATCAC ACGGATAGGC 

251 AGGCCGCGCA TGGATTTGAG CTTGGTTTTA TAAGCGGCCA GCTCGAATTT 

301 GTTGGGCAGA TGGCCGTGAA TCAGCAAG7G TGCGACTTCT TCAAACTCGC 

3 51 ATTTTTGTGC CAAATTAGAA TGTCGTAA 

This corresponds to the amino acid sequence <SEQ ID 346; ORF 09S.ng>: 

g098 . pep 

1 MTADGLFVAF NFN T FAWRI LIPVQQDAAQ AGDQFVGDVA RFAVGMAFAF 

51 GMNAAEHGKA GTHHVHRMGM CRQAFQNFNK TDRQAAKGFE LGFISGQLEF 

101 VGQMAVNQQV CDFFKLAFLC 01 RMS* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 347>: 



m098 . seq 












1 


ATGACCGCCG 


ATGGTCTCTT 


CGTCGCTTTC 


AACCTCAATG 


CGTTTGCCGT 


51 


TGTGCGAATA 


TTGATACCAG 


TACAAGAGGA 


TGCTGCCGAG 


GCTGGCGATC 


101 


AGTTTGTCGG 


CGATGTCGCG 


CGCTTCACTT 


TCCGGATGGC 


TTTCACGTTC 


151 


AGGATGAACG 


CAGCCCAGCA 


TGGATACGCC 


GGTACGCATT 


ACGTCCATCG 


201 


GATGGGTATG 


TGCAGGCAGG 


CTTTCCAAAA 


CTTTAATCAC 


ACGGATAGGC 


251 


AGGCCGCGCA 


TGGATTTGAG 


CTTGGTTTTA 


TAAGCGGCCA 


GCTCGAATTT 


301 


GTTGGGCAGA 


TGGCCGTGAA 


TCAGCAGGTG 


GGCGACTTCT 


TCAAACTCGC 


351 


ATTTTTGTGC 


CAAATCAGAA 


TGTCGTAA 







This corresponds to the amino acid sequence <SEQ ID 348; ORF 098>: 

m098 -pep 

1 FTA^GLFVAF NLNAFAWRI LIPVQEDAAE AGDQFVGDVA RFTFRMAFTF 

51 RMNAAQHGYA GTHYVHRMGM CRQAFQNFNH TDRQAAKGFE LGFISGQLEF 

101 VGQMAVNQQV GDFFKLAFLC QIRMS* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 098 shows 89.6% identity over a 125 aa overlap with a predicted ORF (ORF 098.ng) 
from N. gonorrhoeae: 

m098/g098 

10 20 30 40 50 60 

m098 . pep 
g098 



m098 .pep 
g098 



m098 .pep 
g098 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 349>: 
a098 . seq 

1 ATGACCGCCG ATGGTCTCTT CGTCGCTTTC AACCTCAATG CGTTTGCCGT 
51 TGTGCGAATA TTGATACCAG TACAAGAGGA TGCTGCCGAG GCTGGCGATC 



MTADGLFVAFNLNAFAVVRI LI PVQEDAAEAGDQFVGDVARFTFRMAFTFRMNAAQHGYA 
niliilMlhiMlimilitMlhiltlliiiilli: 

MTADGLFVAFNFNTFAWRI LI PVQQDAAQAGDQFVGDVARFAVGMAFAFGMNAAEHGHA 
10 20 30 40 50 60 

70 80 90 100 110 120 

GTHYVHRMGMCRQAFQNFNHTDRQAAHGFELGFISGQLEFVGQMAVNQQVGDFFKLAFLC 

ilhllliiliMlitliMlimiiHiltlllllillllllliilll lltlliill 
GTHHVHRMGMCRQAFQNFNHTDRQAAHG FE LG F I SGQLE FVGQMAVNQQVCDF FKLAFLC 
70 80 90 100 110 120 



QIRMSX 

llllll 
QIRMSX 
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101 AGTTTGTCGG CGATGTCGCG CGCTTCACTT TCCGGATGGC TTTCACGTTC 

151 AGGATGAACG CAGCCCAGCA TGGATACGCC GGTACGCATT ACGTCCATCG 

201 GATGGGTATG TGCAGGCAGG CTTTCCAAAA CTTTAATCAC ACGGATAGGC 

2 51 AGGCCGCGCA TGGATTTGAG CTTGGTTT ; A TAAGCGGCCA GCTCGAATTT 

301 GTTGGGCAGA TGGCCGTGAA TCAGCAGGTG GGCGACTTCT TCAAACTCGC 

351 ATTTTTGTGC CAAATCAGAA TGTCGTAA 

This corresponds to the amino acid sequence <SEQ ID 350; ORF 098. a>: 
a098 .pep 

1 MTADGLFVAF NLNAFA VVR1 LIPVQEDAAE AGDQFVGDVA RFTFRMAFTF 

51 RMNAAQHGYA GTHYVHRMGM CRQAFQNFNH TDRQAAHGFE LGFISGQLEt 

101 VGOMAVNQQV GDFFKLAFLC QIRMS* 



m098/a098 100.0% identity in 125 aa overlap 

10 20 30 40 50 60 

m098 . pep MTADGLFVAFNLNAFAVVRILIPVQEDAAEAGDQFVGDVARFTFRMAFTFRMNAAQHGYA 
I I I 1 ! I i i i t i I I M S I I I I I I II I I i I H i I I I I I I I I i \ \ \ li I I M I I I I i I i I I M 
a 0 9 8 MTADG L FVAFN LNAFAVVR I L I PVQE DAAEAG DQFVG DVAR FT FRMA FT FRMN AAQHG Y A 

10 20 30 40 50 60 

70 80 90 100 110 120 

m0 98 . pep GTHYVHRMGMCRQAFQNFNHTDRQAAHGFELGFI SGQLEFYGQMAVNQQVGDFFKLAFLC 

I 1 I I I 1 I I I I I I I I I I I I I 1 I I It ! I I I I I I M I I I! M I I I I I M ! I I I I M ! I I I I I I 
a 0 98 GTHYVHRMGMCRQAFQNFNHTDRQAAHGFELGFI3GQLEP7GQMWJQQVGDFFKLAFLC 

70 80 90 100 110 120 



m098.pep QIRMSX 
! I t I i ! 

a098 QIRMSX 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 35 1>: 

g099 . seq 

1 ATGCTGGGAC GCGCGTCCAT GATGCGCCTG CCCGATATTG TCGGCGTGGA 

51 GCTGACGGGC AAACGGCAGG CGGGCATTAC TGCCACAGAC ATCGTGTTGG 

101 CACTGACCGA ATTCTTGCGT AAAGAGCGCG TGGTCGGGGC GTTTGTCGAA 

151 TTTTTCGGCG AGGGCGCGAG AAGCCTGTCT ATCGGCGACC GCGCGACCAT 

201 TTCCAACATG ACGCCGGAGT TCGGCGCGAC TGCCGCCATG TTCGCCATCG 

2 51 ACGCGCAAAC TATTGATTAT TTGAAACTGA CCGGACGTGA CGACGCGCAG 

3 01 GTGAAATTGG TGGAAACCTA CGCCAAAACC GCAGGCTTAT GGGCAGGTGG 

3 51 CTTGAAAACC GCCGTTTATC CGCGCGTTTT GAAATTTGAT TTGAGCAGCG 

4 01 TAACGCGCAA TATGGCAGGC CCGAGCAACC CGCACGCGCG TTTTGCCACC 
4 51 GCCGATTTGG CGGCGAAAGG GCTGGCGAAG CCTTACGAAG AGCCTTCAGA 
501 CGGCCAAATG CCTGACGGTG CAGTGATTAT TGCCGCGATT ACTTCGTGTA 
551 CCAATACTTC CAACCCGCGC AACGTTGTCG CCGCCGCACT GTTGGCACGC 
6 01 AATGCCAACC GCCTCGGCTT GAAACGCAAA CCTTGGGTGA AATCTTCGTT 

6 51 TGCCCCGGGT TCAAAAGTAG CCGGAATCTA TTTGAAAGAA GCAGGCTTGT 

7 01 TGCCCGAAAT GGAAAAACTC GGCTTCGGTA TCGTCGCCTT CGCATGTACC 
751 ACCTGTAACG GCATGAgcgG CGCGCTcgaC CCGAAAATCC AACAAGAAAT 
801 CATCGACCGC GAtttgtacg cCACCGCCGT ATTGTCAGGC AACCGCAACT 

8 51 TCGACGGCCG TATCCATCCG TATGCGAAAC AGGCTTTCCT CGCTTCGCCT 
901 CCTTTGGTCG TTGCCTACGC ATTGGCAGGT AGCATCCGTT TCGATATTGA 
951 AAACGACGTA CTCGGCGTTG CAGACGGCCG CGAAATCCGC CTGAAAGATA 

10 01 TCTGGCCGAC AGACGAAGAA ATCGATGCCA TCGTTGCCGA ATATGTGAAA 

10 51 CCGCAACAAT TCCGCGACAT TTATATCC CG ATGTCCGACA CCGGCACAGC 

1101 GCAAAAAGCA CCAAGCCCGC TGTACGACTG GCGACCGATG TCCACCTACA 

1151 TCCGCCGTCC GCCCTATTGG GAAGGCGCAC TGGCAGGGGA ACGTACATTA 

12 01 AGAGGTATGC GTCCGCCGGC GATTTTGCCC GACAACATCA CCACCGACCA 

12 51 CATCTCgcca tCCAATGCGA TTTTGGCCGG cagTGCcgca ggtgaATATT 
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g099 . pep 






1 


MLGRASMMRL 


PDIVGVELTG 


51 


FFGEGARSLS 


IGDRATISNM 


101 


VKLVETYAKT 


AGLWAGGLKT 


151 


ADLAAKGLAK 


PYEEPSDGQM 


201 


NANRLGLKRK 


PWVKSSFAPG 


251 


TCNGMSGALD 


PKIQQEIIDR 


301 


PLWAYALAG 


SIRFDIENDV 


351 


PQQFRDIYIP 


MSDTGTAQKA 


401 


RGMRPPAILP 


DNITTDHISP 


451 


RGDHLTAQRA 


TFANPKLFNE 


501 


ETYMNRKQPL 


IIIAGADYGQ 


551 


NLIGMGVLPL 


QFKPGTNRHT 


601 


ETVEVPVTCR 


PDTAEEALVY 



13 01 TGGCGAAAAT GGGTTTGCCT GAAGAagaCT TCAACTCTTA CGCAACCCAC 

13 51 CGCGGCGACC ACTTGACCGC CCAACGCGCA ACCTTCGCCA ATCCGAAACT 

14 01 GTTTAACGAA ATGGTGAGAA ACGAAGACGG CAGCGTACGC CAAGGTtcgt 

14 51 tggcacgcgT tgaacCAGAA GGCCAAACCA TGCGCATGTG GGAAGCCATC 
1501 GAAACCTATA TGAACCGCAA ACAGCCGCTT ATCATCATTG CCGGTGCGGA 

15 51 CTATGGTCAA GGCTCAAGCC GCGACTGGGC GGCGAAGGGC GTGCGGCTGG 
1601 CGGGTGTGGA AGCCATCGCC GCCGAAGGTT TCGAGCGCAT CCACCGCACC 

16 51 AACCTCATCG GCATGGGCGT CTTGCCGCTG CAATTCAAAC CCGGCACCAA 
1701 CCGCCATACC CTGCAACTGG ACGGTACGGA AACCTACGAC GTTGTCGGCG 
1751 AACGCACACC GCGCTGCGGC CTGACCCTCG TGATTCACCG TAAAAACGGA 
1801 GAAACCGTCG AAGTTCCGGT TACCTGCC3C CCCGATACCG CAGAAGAAGC 
1851 ATTGGTATAT GAAGCCGGCG GCGTATTGCA ACGGTTTGCA CAGGACTTTT 
1901 TGGAAGGGAA CGCGGCTTAG 

This corresponds to the amino acid sequence <SEQ ID 352; ORF 099. ng>: 

KRQAGITATD IVLALTEFLR KERWGAFVE 

TPEFGATAAM FAIDAQTIDY LKLTGRDDA0 

AVYPRVLKFD LSSVTRNMAG PSNPHARFAT 

PDGAVIIAAI TSCTNTSNPR NWAAALLAR 

SKVAGI YLKE AGLLPEMEKL GFGIVAFACT 

DLYATAVLSG NRNFDGRIHP YAKQA FLASP 

LGVADGRE I R LKDIWPTDEE IDAIVAEYVK 

PSPLYDWRPM STYIRRPPYW EGALAGERTL 

SNAILAGSAA GEYLAKMGLP EEDFNSYATH 

MVRNEDGSVR QGSLARVEPE GQTMRMWEA1 

GSSRDWAAKG VRLAGVEAIA AEGFERIHRT 

LQLDGTETYD WGERTPRCG LTLVIHRKNG 
EAGGVLQRFA QDFLEGNAA* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 353>: 

m099 . seq 

1 ATGCTGGGAC GCGCGTCCAT GATGCGCCTG CCCGATATTG TCGGCGTTGA 

51 GCTGAACGGC AAACGGCAGG CGGGCATTAC GGCGACGGAT ATTGTGTTGG 

101 CACTGACCGA GTTTCTGCGC AAAGAACGCG TGGTCGGGGC GTTTGTCGAA 

151 TTCTTCGGCG AGGGCGCGAG AAGCCTGTCT ATCGGCGACC GCGCGACCAT 

2 01 TTCCAACATG ACGCCGGAGT TCGGCGCGAC TGCCGCGATG TTCGCTATTG 
251 ATGAGCAAAC CATTGATTAT TTGAAACTGA CCGGACGCGA CGACGCGCAG 

3 01 GTGAAATTGG TGGAAACCTA CGCCAAAACC GCAGGCTTGT GGGCAGATGC 

3 51 CTTGAAAACC GCCGTTTATC CTCGCGTTTT GAAATTTGAT TTGAGCAGCG 

4 01 TAACGCGCAA TATGGCAGGC CCAAGTAACC CGCATGCCCG TTTTGCGACC 
4 51 GCCGATTTGG CGGCGAAAGG GCTGGCGAAG CCTTACGAAG AGCCTTCGGA 
501 CGGCCAAATG CCCGACGGCT CGGTCATCAT CGCCGCGATT ACCAGTTGCA 
551 CCAACACTTC CAACCCGCGC AACGTTGTTG CCGCCGCGCT CTTGGCACGC 
6 01 AATGCCAACC GTCTCGGCTT GAAACGCAAA CCTTGGGTGA AATCTTCGTT 
6 51 TGCCCCGGGT TCAAAAGTAG CCGAAATCTA TTTGAAAGAA GCGGGCCTGT 
701 TGCCCGAAAT GGAAAAACTC GGCTTCGGTA TCGTCGCCTT CGCCTGCACC 
751 ACCTGCAACG GCATGAGTGG CGCGCTGGAT CCGAAAATCC AGAAAGAAAT 
8 01 CATCGACCGC GATTTGTACG CCACCGCCGT ATTATCAGGC AACCGCAACT 
8 51 TCGACGGCCG TATCCACCCG TATGCGAAAC AGGCTTTCCT CGCTTCGCCT 
901 CCGTTGGTCG TTGCCTACGC GCTGGCAGGC AGTATCCGTT TCGATATTGA 
951 AAACGACGTA CTCGGCGTTG CAGACGGCAA GGAAATCCGC CTGAAAGACA 

1001 TTTGGCCTGC CGATGAAGAA ATCGATGCCG TCGTTGCCGA ATATGTGAAA 

1051 CCGCAGCAGT TCCGCGATGT GTATGTACCG ATGTTCGACA CCGGCACAGC 

1101 GCAAAAAGCA CCCAGTCCGC TGTACGATTG GCGTCCGATG TCCACCTACA 

1151 TCCGCCGTCC GCCTTACTGG GAAGGCGCGC TGGCAGGGGA ACGCACATTA 

12 01 AGAGGTATGC GTCCGCTGGC GATTTTGCCC GACAACATCA CCACCGACCA 

12 51 CCTCTCGCCG TCCAATGCGA TTTTGGCCGT CAGTGCCGCA GG C GAG TAT T 
1301 TGGCGAAAAT GGGTTTGCC" GAAGAAGACT TCAACTCTTA CGCAACCCAC 

13 51 CGCGGCGACC ACTTGACCGC CCAACGCGCT ACCTTCGCCA ATCCGAAACT 

14 01 GTTTAACGAA ATGGTGAAAA ACGAAGACGG CAGCGTGCGC CAAGGCTCGT 

14 51 TCGCCCGCGT CGAACCCGAA GGCGAAACCA TGCGCATGTG GGAAGCCATC 

15 01 GAAACCTATA TGAACCGCAA ACAGCCGCTC ATCATCATTG CCGGTGCGGA 
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1551 CTATGGTCAA G^TAAGCC GCGACTGGGC TGCAAAAGGC GTACGCCTCG 

16 01 CCGGCGTAGA AGCGATTGTT GCCGAAGGCT TCGAGCGTAT CCACCGCACC 

16 51 AACCTTATCG GCATGGGCGT GTTGCCGCTG CAGTTCAAAC CCGACACCAA 

1701 CCGCCATACC CTGCAACTGG ACGGTACGGA AACCTACGAC GTGGTCGGCG 

1751 AACGCACACC GCGCTGCGAC CTGACCCTCG TGATT CACCG TAAAAACGGC 

13C1 GA\ACCGTTG AAGTTCCCGT TACCTGCTGC CTCGATACTG CAGAAGAAGT 

1S51 ATTGGTATAT GAAGGCGGCG GCGTGTTGCA ACGGTTTGCA CAGGATTTTT 

1901 TGGAAGGGAA CGCGGCTTAG 

This corresponds to the amino acid sequence <SEQ ID 354; ORF 099>: 
m099 . pep 

1 MLGRASMMRL PDIVGVELNG KRQAGITATD IVLALTEFLR KERWGAFVE 

51 FFGEGARSLS IGDRATISNM TPEFGATAAM FAIDEQTIDY LKLTGRDDAQ 

101 VKLVETYAKT AGLWADALKT AVYPRVLKFD LSSVTRNMAG PSNPHARFAT 

151 ADLAAKGLAK PYEEPSDGQM PDGSVIIAAI TSCTNTSNPR NWAAALLAR 

2 01 NANRLGLKRK PWVKSSFAPG SKVAEIYLKE AGLLPEMEKL GFGIVAFACT 

2 51 TCNGMSGALD PKIQKEI IDR DLYATAVLSG NRNFDGRIHP YAKQAFLASP 

3 01 PLWAYAItAG SI RFDIENDV LGVADGKEIR LKDIWPADEE IDAWAEYVK 
351 PQQFRDVYVP MFDTGTAQKA PSPLYDWRPM STYIRRPPYW EGALAGERTL 

4 01 RGMRPLAILP DNITTDHLSP SNA1LAVSAA GEYLAKMGLP EEDFNSYATH 
4 51 RGDHLTAQRA TFANPKLFNE MVKNEDGSVR QGSFARVEPE GETMRMWEAI 
501 ETYMNRKQPL I I IAGADYGQ GSSRDWAAKG VRLAGVEATV AEGFERIHRT 
551 NLIGMGVLPL QFKPDTNRHT LQLDGTETYD WGERTPRCD LTLVIHRKNG 
601 ETVEVPVTCC LDTAEEVLVY EAGGVLQRFA QDFLEGNAA* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N, gonorrhoeae 

ORF 099 shows 96.2% identity over a 639 aa overlap with a predicted ORF (ORF 099.ng) 
from N. gonorrhoeae: 

m099/g099 

10 20 30 40 50 60 

m0 99 . pep MLGRASMMRLPD I VGVELNGKRQAG I TATD I VLALTEFLRKERWGAFVE FFGEGARSLS 

IMIIIIIIIIIIIIIIhllllllMMIIIIIIIIIIIilllllllllllllllllll 

g099 MLGRASMMRLPDIVGVELTGKRQAGITATDIVLALTEFLRKERVVGAFVEFFGEGARSLS 
10 20 30 40 50 60 

70 80 90 100 110 120 

m099 . pep IGDRATISNMTPEFGATAAMFAIDEOTIDYLKLTGRDDA0VKLVETYAKTAGLWADALKT 

I I : . !; I, IIIIIMIIIIIIIIilllllllllMlll 

g0 9 9 IGDRATISNMTPEFGATAAMFAIDAOTIDYLKLTGRDDAQVKLVETYAKTAGLWAGGLKT 

70 80 90 100 110 120 

130 140 150 160 170 180 

m09 9 . pep AVYPRVLKFDLSSVTRNMAGPSNPHARFATADLAAKGLAKPYEEPSDGQMPDGSVIIAAI 

lllllillllllllllllllllllilMllllllilllMlllllinilllhlllMI 
g09 9 AVYPRVLKFDLSSVTRNMAGPSKPHARFATADLAAKGLAKPYEEPSDGQMPDGAVIIAAI 
130 140 150 160 170 180 

190 200 210 220 230 240 

m09 9 .pep TSCTNTSNPRNWAAALLARNANRLGLKRKPWVKSSFAPGSKVAEI YLKEAGLLPEMEKL 

MIIIIIIIMIIMIIIIIIIIIIIIIIMIIIIIIIIIIIII IIIMIIIIIMM! 

g 0 9 9 TS CTNTSN PRNW AAALLARNANRLGLKRKPVA/KS S FAPGS KVAG I Y LKEAGLLPEME KL 

190 200 210 220 230 240 

250 260 270 280 290 300 

m09 9 . pep GFGIVAFACTTCNGMSGALDPKIQKEI I DRDLYATAVLSGNRNFDGRI HP YAKQAFLASP 

1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 M I 1 1 1 1 1 1 1 1 il I II 1 1 1 1 1 1 1 1 1 i I ! M 1 1 1 1 1 ! 1 1 

g099 G FG I V A F ACTT CNGM S G AL D PKIQQEIIDRDL Y AT A V LS GNRN FDGR I H P YAKQ A FLAS P 

250 260 270 280 290 300 

310 320 330 340 350 360 
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m099 pep plwayalagsirfdiendvlgvadgkeirlkdiwpadeeidawaeyvkpqofrdvyvp 
MMNNMNMMMMMMNMINMMNMIMNIMMMMNNNI 

gO 9 9 PLWAYALAGSIRFDIENDVLGVADGREIRLKDIWPTDEEIDAIVAEYVKPOQFRDIYIP 
310 320 330 340 350 360 



370 380 390 400 410 420 

mO 3 9 pep MFDTGTAQKAPSPLYDWRPMSTY I RRPPYWEGALAGERTLRGMRPLAI LPDN I TTDHLS P 

I IIIIIMIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIII llllllllllhll 

g 0 9 9 MSDTGTAQKAPS PLYDWRPM3TY I RRPPYWEGALAGERTLRGMRF ?AI LPDN I TTDH ISP 

370 380 390 400 410 420 



430 440 450 460 470 480 

m0 99 pep SNAILAVSAAGEYLAKMGLPEEDFNSYATHRGDHLTAQRATFANPKLFNEMVKNEDGSVR 

IMIII IIIIIIIIIIIIIIIIMIIIIIIIIIIMIIIIIIIIMIMIhllMMI 

g 0 9 9 SNAI LAGS AAGEYLAKMGLPEEDFNSY ATHRGDHLTAQRATFANPKLFNEMVRNEDGSVR 

430 i40 450 460 470 480 



490 500 510 520 530 540 

m099 pep QG S F AR V E P E GETMRMW E A I E T YMN RKQ P L 1 1 1 AG AD Y GQGS S RD W AAKG V RL AG V E A I V 

I | I : | M | I I I : I II I II I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I : 
g 0 9 9 QGSLARVEPEGQTMRMWEAIETYMNRKOPL1 1 I AGADYGQGSSRDWAAKGVRLAGVEAI A 

490 500 510 520 530 540 



550 560 570 580 590 600 

m0 99 . pep AEGFER I HRTNLI GMGVLPLQFKPDTNRHTLQLDGTET YDWGERT PRCDLTLV I HRKNG 

IIMIIIIMMIMIIMMIII MIMIIMIIIIIIMMIIIM MINIUM 

gO 9 9 AEGFERIHRTNLIGMGVLPLQFKPGTNRHTLQLDGTETYDWGERTPRCGLTLVIHRKNG 
550 560 570 580 590 600 



610 620 630 640 

nC 9 9 . pep ETVEVPVTCCLDTAEEVLVYEAGGVLQRFAQDFLEGNAAX 

MINIMI IIMNMMMIIIMIMIIIIIIM! 

gO 99 ETVEVPVTCRPDTAEEALVYEAGGVLQRFAQDFLEGNAAX 
610 620 630 640 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 355>: 

a099 . seq 

1 ATGCTGGGAC GCGCGTCCAT GATGCGCCTG CCCGATATTG TCGGCGTTGA 

51 GCTGAACGGC AAACGGAAGG CGGGCATTAC GGCGACGGAT ATTGTGTTGG 

101 CACTGACCGA GTTTCTGCGC AAAGAACGCG TGGTCGGGGC GTTTGTCGAA 

151 TTCTTCGGCG AGGGCGCGAG AAGCCTGTCT ATCGGCGACC GCGCGACCAT 

201 TTCCAACATG ACGCCGGAGT TCGGCGCGAC TGCCGCGATG TTCGCTATTG 

2 51 ATGAGCAAAC CATTGATTAT TTGAAACTGA CCGGACGCGA CGACGCGCAG 

301 GTGAAATTGG TGGAAACCTA CGCCAAAACC GCAGGCTTGT GGGCAGATGC 

351 CTTGAAAACC GCCGTTTATC CGCGCGTTTT GAAATTTGAT TTGAGCAGCG 

4 01 TAACGCGCAA TATGGCAGGC CCGAGCAACC CGCACGCGCG TTTTGCGACC 

451 GCCGATTTGG CCGGCAAAGG CTTGGCTAAA CCTTACGAAG AGCCTTCAGA 

501 CGGCCAAATG CCTGACGGTG C AG T GAT TAT TGCCGCGATT ACTTCCTGTA 

551 CCAATACTTC CAATCCGCGC AACGTTGTCG CCGCCGCGCT GTTGGCACGC 

601 AATGCCAACC GCCTCGGCTT GCAACGCAAA CCTTGGGTGA 7AATCTTCGTT 

651 TGCCCCGGGT TCAAAAGTAG CCGAAATCTA TTTGAAAGAA GCAGATCTGC 

701 TGCCCGAAAT GGAAAAACTC GGCTTCGGTA TCGTTGCCTT CGCATGTACC 

7 51 ACCTGTAACG GCATGAGCGG CGCGCTGGAT CCGAAAATCC AGAAAGAAAT 

801 CATCGACCGC GATTTGTACG CCACCGCCGT ATTGTCAGGC AACCGCAACT 

851 TTGACGGCCG TATCCATCCG TATGCGAAAC AGGCTTTCCT CGCTTCGCCT 

901 CCGTTGGTCG TTGCCTACGC GCTGGCAGGC AGCATCCGTT TCGATATTGA 

951 AAACGACGTA CTCGGCGTTG CAGACGGCAA AGAAATCCGC CTGAAAGACA 

1001 TTTGGCCTAC CGATGAAGAA ATCGATGCCA TCGTTGCCGA ATATGTGAAA 

1051 CCGCAGCAAT TTCGCGACGT TTATATCCCG ATGTTCGACA CCGGCACAGC 

1101 GCAAAAAGCA CCAAGCCCGC TGTACGACTG GCGTCCAATG TCTACCTATA 

1151 TCCGCCGCCC ACCTTACTGG GAAGGCGCAC TGGCAGGGGA ACGCACATTA 

1201 AGCGGTATGC GTCCGCTGGC GATTTTGCCC GACAACATCA CCACCGACCA 
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12 51 TCTCTCGCCA TCCAATGCGA TTTTGGCAAG CAGTGCCGCA GGCGAATATT 
1301 TGGCAAAAAT GGGTTTGCCT GAAGAAGACT TCAACTCTTA CGCAACCCAC 

13 51 CGTGGCGACC ACTTGACCGC CCAACGCGCA ACCTTCGCCA ATCCGAAACT 

14 01 GTTTAACGAA ATGGTGAGAA ACGAAGACGG CAGCGTACGC CAAGGTTCGC 
14 51 TGGCACGCGT TGAACCCGAA GGCCAAACCA TGCGCATGTG GGAAGCCATC 
1501 GAAACCTATA TGAACCGCAA ACAGCCGCrc ATCATCATTG CCGGCGCGGA 
1551 CTACGGTCAA GGCTCAAGCC GCGACTGGGC TGCAAAAGGC GTACGCCTCG 
1601 CCGGCGTGGA AGCGATTGTT GCCGAAGGCT TCGAGCGTAT CCACCGCACC 
1651 AACTTGATCG GTATGGGCGT GTTGCCGCTG CAGTTCAAAC CGGGTACCAA 
17 01 CCGCCACACC CTGCAACTGG ACGGTACGGA AACCTACGAC GTTGTCGGCG 
17 51 AACGCACACC GCGCTGCGAC CTGACCCTTG TGATTCACCG 7AAAAACGGC 
1801 GAGACCGTCG AAGTCCCCAT TACCTGCCGC CTCGATACCG CAGAAGAAGT 
1851 GTTGGTATAT GAAGCCGGTG GCGTATTGCA ACGGTTTGCA CAGGATTTTT 
1901 TGGAAGGGAA CGCGGCTTAG 

This corresponds to the amino acid sequence <SEQ ID 356; ORF 099. a>: 

a099.pep 

1 MLGRASMMRL PDI VGVELNG KRKAGITATD IVLALTEFLR KERVVGAFVE 

51 FFGEGARSLS IGDRATISNM TPEFGATAAM FAIDEQTIDY LKLTGRDDAQ 

101 VKLVETYAKT AGLWADALKT AVYPRVLKFD LSSVTRNMAG PSNPHARFAT 

151 ADLAGKGLAK PYEEPSDGQM PDGAVIIAAI TSCTNTSNPR NVVAAALLAR 

201 NANRLGLQRK PWVKSSFAPG SKVAEIYLKE ADLLPEMEKL GFGIVAFACT 

251 TCNGMSGALD PKIQKEI IDR DLYATAVLSG NRNFDGRIKP YAKQA FLASP 

301 PLVVAYALAG SI RFDIENDV LGVADGKEIR LKDIWPTDEE IDAIVAEYVK 

351 PQQFRDVYIP MFDTGTAQKA PSPLYDWRPM STYIRRPPYW EGALAGERTL 

4 01 SGMRPLAILP DNITTDHLSP SNAILASSAA GEYLAKMGL? EEDFNSYATH 

4 51 RGDHLTAQRA TFANPKLFNE MVRNEDGSVR QGSLARVSPE GQTMRMWEAI 

501 ETYMNRKQPL I I IAGADYGQ GSSRDWAAKG VRLAGVEAIV AEGFERIHRT 

551 NLIGMGVLPL QFKPGTNRHT LQLDGTETYD WGERTPRCD LTLVIHRKNG 

601 ETVEVPITCR LDTAEEVLVY EAGGVLQRFA QDFLEGNAA* 

m099/a099 97.5% identity in 639 aa overlap 

10 20 30 40 50 60 

m099.pep MLGRASMMRL PDIVGVELNGKRQAGI TAT D I VLALTEFLRKERVVGAFVE FFGEGARSLS 

t II I I I I I I I I I M t I I i I I I I : I I I I I I I I I I M I I I I I I I i I I I I i i I I i I I i I I ! I I 
a09 9 MLGRASMMRL PDI VGVELNGKRKAGITATD I VLALTEFLRKERVVGAFVE FFGEGARSLS 

10 20 30 40 50 60 

70 80 90 100 110 120 

m099 . pep IGDRAT I SNMTPEFGATAAMFAIDEQT I DYLKLTGRDDAQVKLVETYAKT AGLWADALKT 

1 I i t I i I I i I I I I I i I I I t I I I II I i I I ! I I II i I ! II I I I M II M M f I I i ! i M I I I 
a0 9 9 IGDRAT I SNMTPEFGATAAMFAIDEQT I DYLKLTGRDDAQVKLVETYAKTAGLWADALKT 

70 80 90 100 110 120 

130 140 150 160 170 180 

m099 .pep AVYPRVLKFDLSSVTRNMAGPSNPHARFATADLAAKGLAKPYEEPSDGQMPDGSVIIAAI 
I I I I I I ! i I I I ! I 1 I f I II I I I I I ! i I I M ! I II : i I I I I I ! I I I II ! I I I I I : I I I II i 
a09 9 AVYPRVLKFDLSSVTRNMAGPSNPHARFATADLAGKGLAKPYEEPSDGQMPDGAVIIAAI 

130 140 150 160 170 180 

190 200 210 220 230 240 

m099.pep TSCTNTSNPRNWAAALLARNANRLGLKRKPWVKSSFAPGSKVAEI YLKEAGLLPEMEKL 

I I M I I II I I I I I M I i I I II I I ! I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i 
a099 TSCTNTSNPRNVVAAALLARNANRLGLQRKPWVKSSFAPGSKVAEIYLKEADLLPEMEKL 
190 200 210 220 230 240 

250 260 270 280 290 300 

m099.pep GFGIVAFACTTCNGMSGALDPKIQKEIIDRDLYATAVLSGNRNFDGRIHPYAKQAFLASP 
I I I I II I I I ! I I I I I I I I I I I I I I I I I I I 1 I I I I I II I I II I I I I I I i i I I I I I I I I I II 
a099 GFGIVAFACTTCNGMSGALDPKIQKE 1 1 DRDLYATAVLSGNRNFDGRI HP YAKQAFLASP 

250 260 270 280 290 300 

310 320 330 340 350 360 

mO 9 9 . pep PLWAYALAGS I R FDI ENDVLGVADGKE I RLKDIW PA DEE I DAVVAE YVKPQQFRDVYV P 
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t M I I t i M t I I I i t I i I 1 I I ) I I I II t I ) I 1 1 i 1 I : ! I It M : I I I 1 I M I ) 1 1 ) I I : ) 
a0 99 PLVVAYALAGSIRFDIENDVLGVADGKEIRLKDIWPTDEEIDAIVAEYVKPQQFRDVYIP 
310 320 330 340 350 360 

370 380 390 400 410 420 

m099 . pep MFDTGTAQKAPSPLYDWRPMSTYIRRPPYWEGALAGERTLRGMRPLAILPDNITT^HLSP 
I I I ! ! ! I ! ! I i ! I i i i I I ! i I I I I I i I I I I I ! ! I I I I M f I I M i I I I i I f I M i j I i I 
a0 9 9 MFDTGTAQKAPSPLYDWRPMSTYIRRPPYWEGALAGERTLSGMRPLAILPDNITTDHLSP 

370 380 390 400 410 420 

430 440 450 460 470 480 

m099 .pep SNAILAVSAAGEYLAKMGLPEEDFNSYAT.-IRGDHLTAQRATFANPKLFNEMVKNEDGSVR 
I I I M I I , I I I I I I ! I I I I I I I I i I I ! 1 i I I : I I I I I I I I I II I I I I I I I I : I I I I I 1 ! 
a099 SNAILASSAAGEYLAKMGLPEEDFNSYATHRGDHLTAQRAT FANPKLFNEMVRNEDGSVP, 

430 440 450 460 470 480 

490 500 510 520 530 540 

m09 9 .pep QGSFARVEPEGETMRMWEAIETYMNRKQPLI IIAGADYGQGSSRDWAAKGVRLAGVEAIV 

I II : I I I I I I i : I I i I I I I I I I ! ( ( I I I I I I I ! I I I I I I I I I I I I I I I I i I I I f I I I I II 
aO 9 9 QGSLARVEPEGQTMRMWEAIETYMNRKQPL 1 1 1 AGADYGQGSSRDWAAKGVRLAGVEAI V 

490 500 510 520 530 540 

550 560 570 580 590 600 

m099 . pep AEGFERIHRTNLIGMGVLPLQFKPDTNRKTLQLDGTETYDWGERTPRCDLTLVIHRKNG 
1 I i t I I t 1 1 I I i I I I I I ! I i I I I I I i I M I ' i 1 ! I I I I I I I I I I I I I i ! 1 t i H i M i I 
a0 99 AEGFERIKRTNLIGMGVLPLQFKPGTNRHTLQLDGTETYDVVGERTPRCDLTLVIHRKNG 
550 560 570 5S0 590 600 

610 620 630 640 

m0 99 .pep ETVEVPVTCCLDTAEEVLVYEAGGVLQRFAQDFLEGNAAX 
I ) 1 I I I : I I I I I I I I I I 1 I I 1 1 I I I 1 N I I I I 1 I I I 1 I I 
a0 99 ETVEVPITCRLDTAEEVLVYEAGGVLQRFAQDFLEGNAAX 

610 620 630 640 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 357 

gl02 . seq 

1 AtgtCCGCCA AAactccgtc gctcttcggc ggcgcgatga Ttatcgccgg 

51 gaaggttatc ggcgcAGgta tgttccccaa ccccaccqcc aacttggggg 

101 acgggttaat aggctcgctg attgtgctgc tgtacacctg gtttccattc 

151 tcctccggcg ccctcatgat tttggaagtc aacacccata acCCccgagg 

201 ggcaAGtttt gacaccATGg tcAAagacct gctcgGaCGc ggctggaaca 

251 tcatcaacgg catcgccgtc gctttggTCc tatacggctc gacctacgcg 

301 tacattttag tcggcggtga cctGACCGCC AAAGGCAtcg GCAgCGCAGT 

351 AGGCGGCAAA ATTTCgctca CCGTCGGACA actcgtcttc tTCGGCATCC 

401 TCGCCTTTTG CGTATGGGCA TCCGCACGCT TGGTCGACCG CTTTACCGGC 

4 51 GTCCTCATCG GCGGCATGGT ATTAACCTTT ATTTGGGCAA CCGGCGGCCT 

501 GGTTGCCGAT GCCAAACCGT CCGTCCTCTT CGACACCCAA GCCCCCGTCG 

551 GCACCGGCTA CTGGATTTAC GCCGCCACCG CCCTGCCCGT CTGCCTCGCT 

601 TCCTTCGGCT TCCACGGCAA CGTTTCCAGC CTGCTCAAAT ACTTTAAAGG 

651 CGACGcgcCc aaagtGgCGA aATCcatctg gGcaggtaca ttggTTGCCt 

701 tggtaattta cgtccTCTgg caaaccgcca tCcaaagcaa ccTGCcgcgc 

751 aacgagttcg cCCCcgtgat tgccgccgag aggcaactCT CCGTCCTgaa 

801 tgaaacccTG tccaaattcg cccaaaccgg cgatatggat aAaatattgt 

851 ccctatttcc ctacatggca atcgccacct cctttttagg cgTAACctta 

901 ggcctgtttg acaacatcac cgacatcttc aaatggaacg acagtatgtc 

951 cgggcggggc accaaaaccg tcgcgctgaa cttcctgccg CCCCtgattt 

1001 cctggctgct cctccccacc ggcttcttta ccgccattgg tgcgtccggc 

1051 ctggcggcaa ccgtctggga ccaaqGcatc atccccgcca tgctgctcta 

1101 cgtttccccc caaaaaattG gcGcaggcaa gacttataAa gtttaCGGCG 

1151 gcttgtggct gatgttagtc ttccttttci gcatcgccaa catcgccgca 

1201 CAGGTATTGA GccaAatgGa ACtcgtCccc GTATTTAAAG GATAA 



This corresponds to the amino acid sequence <SEQ ID 358; ORF 102.ng>: 

gl02 .pep 
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1 

51 
101 
151 

201 
251 
301 
351 
401 



MSAKTPSLFG GAMI I AGKVI GMFPNPTA NLGDGUGSL IVLLYTWFPF 
DTMVKDLLGR GWHIINGIAV ALVLYGSTYA 



SSGALMI LEV 
YILV GGDLTA 
VLIGGMVLTF 



NTHNPRGASF 
KGIGS AVGGK 
IWATGGLVAD 



SFGFHGNV5S 
NEFAPVIAAE 
GLFDNIADIF 
LAATVWDQGI 
QVLSQMELVP 



LLKYFKGDAP 
RQLSVLNETL 
KWUDSMSGRG 
IPAMLLYVSP 
VFKG* 



ISLTVGQLVF FGILAFC VWA SARLVDRFTG 
AK?S VLFDTO APVGTGYWIY AATALPVCLA 
KVAKSIW AGT LVALVIYVLW QTAI QSNLPR 
SKFAQTGDMD K I LSLFPY MA I ATSFLGVTL 
TKTVALNFLP P LISWLLLPT GFFTAIGA 5G 
QKIGAGKTYK VYGGLW LMLV FLFGIANIAA 



The following partia 

ml02 . seq 

1 ATGCCCAACA 

S 1 CACGGTCATC 

101 TATGGTTTAC 

151 CTTTCCAGCG 

201 CGCAAGTTTC 

2 51 TCATCAACGG 

301 TATATCTTCG 

351 AGGCGGCGAC 

4 01 TCGCCTTTTG 

4 51 GTCCTTATCG 

501 GATTGCCGAT 

551 GCACAAACTA 

601 TCCTTCGGCT 

651 CGACGCGCCC 

701 TGGTAATTTA 

751 AACGAGTTCG 

SOI CGAAACCCTG 

851 CCCTGTTTTC 

901 GGACTCTTCG 

951 CGGCCGCACC 

1001 GCCTGCTCTT 

1051 GCGGCAACCG 

1101 GCGCAAAAAA 

L151 GGCTGATGGT 

1201 GTATTGAGCC 



DNA sequence was identified in N, meningitidis <SEQ ID 359- 



AAACCCCTTC 
GGCGCAGGCA 
CGGCTCGCTG 
GCCTGATGAT 
GACACGATGG 
CATCGCCGTC 
TCGGCGGCGA 
GTTTCACTCA 
CGTATGGGCA 
GCGGCATGGT 
GCCAAGCCGT 
CTGGATTTAC 
TCCACGGCAA 
AAAGTGGCTA 
CGTCCTCTGG 
CCCCCGTCAT 
TCCAAATTCG 
CTATATGGCG 
ACTACATCGC 
AAAACCGCCG 
CCCCACCGGC 
TCTGGACAGG 
TTCGGCGCAG 
TTGGGTCTTC 
AAATGGAACT 



ACTGTTCGGC 
TGCTCGCCAA 
GCCGTGTTGC 
TTTGGAAGTC 
TCAAAGACCT 
GCCTTCGTTT 
CCTGACCGCC 
CCGTCGGACA 
TCCGCACGCT 
ATTGACCTTT 
CCGTCCTCTT 
GCCGCCACCG 
CGTCTCCAGC 
AATCCATCTG 
CAAACCGCCA 
CGCCGCCGAA 
CCCAAACCGG 
ATCGCCACCT 
CGACATCTTC 
CGCTGACCTT 
TTCGTTACCG 
CATCATCCCC 
GCAAAACCTA 
CTTTTCGGCA 
CGTCCCCGTA 



GGCGCGATGA 
CCCGACCGCC 
TGTACACCTG 
AACACCCATT 
GCTCGGACGC 
TATACCTGCT 
AAAGGCTTAG 
ACTCGTCTTC 
TGGTCGACCG 
ATTTGGGCGG 
CGATACCCAA 
CCCTGCCCGT 
CTGCTCAAAT 
GACGGGCACA 
TCCAAGGCAA 
GGGCAAGTCT 
CAATATGGAC 
CGTTTTTAGG 
AAATGGAACG 
CCTGCCGCCC 
CCATCGGCTA 
GCCATGCTGC 
TAAAGTTTAC 
TCGTCAACAT 
TTTAAAGGAT 



TTATCGCCGG 
A^ATCCGGCG 
GTTTTCTATG 
ATCCGCACGG 
GGCTGGAACA 
TACTTACGCT 
GCAGCGCGGC 
TTCGGCATCC 
CTTCACCGGC 
CCGGCGGGCT 
GCCCCCGCCG 
CTGCCTCGCT 
ACTTTAAAGG 
CTGATTGCGC 
CCTGCCGCGC 
CCGTCCTCAT 
AAAATATTGT 
CGTAACGCTC 
ACAGCATCTC 
CTGATTTCCT 
CGTCGGCCTG 
TCTACCGTTC 
GGCGGCTTGT 
CGCCGCACAG 
AA 



This corresponds to the amino acid sequence <SEQ ID 360; ORF 1 02- 

ml02 -pep. 



1 
51 
101 
151 
201 
251 
301 
351 
401 



MPNKTPSLFG GAMIIAGTVI GAGMLANPTA TSGVWFTGSL AVLLYTWFSM 



LSSGLMILEV 
YIFVGGDLTA 
VLIGGMVLTF 



NTHYPHGASF 
KGLGSAAGGD 
IWAAGGLIAD 



SFGFHGNVSS 
NEFAPVIAAE 
GLFDYIADIF 
AATVWTGII P 
VLSQMELVPV 



LLKYFKGDAP 
GQVSVLIETL 
KWNDSI SGRT 
AMLLYRSRKK 
FKG* 



DTMVKDLLGR GWNII NGIAV AFVLYLLTYA 

VSLTVGQLVF FGILAFC VWA SARLVDRFTG 

AKPSVLFDTQ APAGT NYWIY AATALPVCLA 

KVAKS IWTGT LIALVIYVLW QT A I QGN L P R 

SKFAQTG NMD KILSLFSYMA IATS FLGVTL 

KTAALTFLPP LISCLL FPTG FV TAIGYVGL 

FGAGKTYKVY GGLWLMVWVF LFGIVNIAAQ 



ml02/gl02 86.0% identity in 415 aa overlap 

10 20 30 40 50 60 

ml 02 .pep MPNKTPSLFGGAMIIAGTVIGAGMLANPTATSGVWFTGSLAVLLYTWFSMLSSGLMILEV 
I t I I I I I I I I I t I I I I I I I 1 I : MM: I : 111 | I | 1 I I I : I : : M M M 
gl02 MSAKTPSLFGGAMI I AGKVIGAGMFPNPTANLGDGLIGSLI VLLYTWFPFSSGALMILEV 

10 20 30 40 50 60 

70 80 90 100 110 120 

rul02 .pep NTHYPHGASFDTMVKDLLGRGWNI INGIAVAFVLYLLTYAYIFVGGDLTAKGLGSAAGGD 

Ml M M M M f M M M M M M I M I M : I M I I I I ! M I I ! I I ! I I : i I I : I I 
gl02 NTHNPKGASFDTMVKDLLGRGWNIINGIAVALVLYGSTYAYILVGGDLTAKGIGSAVGGK 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml 02 .pep VSLTVGQLVFFGILAFCVWASARLVDRFTGVLIGGMVLTFIWAAGGLIADAKPSVLFDTQ 
: I I I I ! I I I I i I I I I I ! I I I M I I ! I I I I I I I! I M I M I I I I : I I I : I I I I i I I M ! M 
g!02 ISLTVGQLVFFGILAFCVWASARLVDRFTGVLIGGMVLTFI WATGGLVADAKPSVLFDTQ 

130 140 150 160 170 180 



ml 02 . pep 



190 200 210 220 230 240 

APAGTN Y W I Y AATAL P VC LAS FGFHGNVSSLLKY "KG DAPKVAKS IWTGTLI ALVI Y V L W 
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I I : I I : I I I I I I I I M I I ! j ! I i I I 1 i I II i I I I I I I I I I I I I I 1 i I : I ! I : i i i I I ! I I 
g 1 0 2 A P VGTG Y W I Y AAT A L P VC LA S FG H I G N V S S L L K Y FKG DAP K V A K S I W AG T L V A L V I Y V LW 

ISO 200 210 220 230 240 

250 260 270 230 290 300 

ml 02 . pep QT A I QG N L PRNE FA P V I AAEG QV S V L I ETLS K FAQT G N MD K I L 3 L F ^ YMA I AT S F LG V T L 

| I I ! I : I 1 M I I I I I t I I II . - i ' I ! I I I I I I I I I : I I I I If M I I 1 I I I I I I I I I I 
g!02 QTAI QSNLPRNEFAPV I AAERQLSVLh'ETLSK F7iQ7GDMDKI LSLFPYMA1 ATSFLGVTL 

250 260 270 280 290 300 

310 320 330 340 350 

mi02 . pep glfdyiadifkwwdsisgr-tktaaltflppliscllfptgfvtaigyvglaatvwt-g: 

III! I I I I I I I t i f : I I I I ! I : I t : I I I I I I I I ' ' I I I I I I I I I j i I ! 
gl02 GLFDNI ADI FKWNDSMSGRGTKTVA.LHFLPPLISWLLLr TGFFTAIGASGLAATVWDQGI 

310 320 330 340 350 360 

360 370 380 390 400 410 

ml02 .pep IPAMLLYRSRKKFGAGKTYKVYGGLWLMVWVFLFGIVNIAAQVLSQMELVPVFKGX 
I I I I ; i I | : I : t I I I I i M I I I I M 1 : M M I I : I I I I I I I I I 1 I I I 1 1 I I I I 
g 1 0 2 I P AM LL Y V S PQK I G AG KT Y K V YG GLW LML - V FL FG I AW I AAOV L 5 QMELV P V FKGX 

370 380 390 400 410 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 361 >: 
al02 . seq 

1 ATGCCCACCA AAACCCCTTC ACTGTTCGGC GGCGCGATGA TTATCGCCGG 

51 CACGNTCATC GGCGCAGGTA TGCTCGCCAA CCCGACCGCC ACATCCGGCG 

101 TATGGTTTAC CGGCTCGCTG GCCGTCTGC TGTACACCTG GTTTTCCATG 

151 CTCTCCAGCG GCCTGATGAT TTTGGAAGTC AACACCCACT ACCCCCACGG 

201 CGCGANCTTC GACACCATGG TTAAAGACCT GCTCGGACGG AGCTGGAACA 

251 TCATCAACGG CATCGCCGTC GCCTTCGTTT TATACCTGCT TACTTACGCT 

3 01 TATATCTTCG TCGGCGGCGA CCTGACCGCC AAAGGCTTAG GCAGCGCGGC 
351 AGGCGGCAAT GTTTCACTCA CCGTCGGACA ACTCGTCTTC TTCGGCATTC 

4 01 TCGCCTTTTG CGTATGGGCA TCCGCACGCT TGGTCGACCG ATTCACCAGC 
4 51 GTCCTCATCG GCGGCATGGT ATTAACCTTT ATTTGGGCAA CCGGCGGCCT 
501 GATTGCCGAT GCCAAACTGC CCGTCCTCTT CGACACCCAA GCCCCTACCG 
551 GCACCAACTA CTGGATTTAT GTCGCCACCG CCCTGCCCGT CTGCCTTGCG 
601 TCATTCGGTT TCCACGGCAA CGTCTCCAGC CTGCTCAAAT ACTTTAAAGG 

6 51 CGACGCGCCC AAAGTGGCTA AATCCATCTG GACGGGCACA CTGATTGCGC 

7 01 TGGTAATTTA CGTCCTCTGG CAAACCGCCA T CC AANGCAA CCTGCCGCGC 

7 51 AACGAGTTCG CCCCCGTGAT TGCCGCCGAA GGGCAAGTCT CCGTCNTGAT 
801 TGAAACCCTG TCCAAATTCG CCCAAACCGG CAATATGGAC AAAATATTGT 

8 51 CCCTGTTTTC CTATATGGCG ATCGCCACCT CGTTTTTAGG CGTAACGCTC 
901 GGACTCTTCG ACTACATCGC CGACATCTTC AAATGGAACG ACAGCGTGTC 
951 CGGCCGCACC AAAACCGCCG CGCTGACCTT CCTGCCGCCT NTAATTTCCT 

1001 GCCTGCTCTT CCCCACCGGC TTTGTTACCG CCATCGGNTA CGTCGGCCTG 

1051 GCGGCAACCG TCTGGACAGG CATCATCCCC GCCATGCTGC TNTACCGTTC 

1101 GCGCAAAAAA TTCGGCGCAG GCAAAACCTA TAAAGTTTAC GGCGGCTTGT 

1151 GGCTGATGGT TTGGGTCTTC CTTTTCGGCA TCNTCAACAT CGCCGCACAN 

1201 GTATTGAGCC AAATGGAACT CGTCCCCGTA TTTAAAGGAT AA 
1202 

This corresponds to the amino acid sequence <SEQ ID 362; ORF 102.a>: 

al02 .pep 

1 

51 
101 
151 
201 
251 
301 
351 
401 



MPTKTPSLFG GAMIIAGTXI GAGMLAN PTA TSGVWFTGSL AVLLYTWFSM 

LSSGLMILEV NTHYPHGAXF DTMVKDLLGR SWNIINGIAV AFVLYLLTYA 

YIF\/GGDLTA KGLGSAAGGN VSLTVGQLVF FGILAFCVWA SARLVDRFTS 

VLIGGMVLTF IWATGGLIAD AKLPVLFDTQ APTGTNYWIY VATALPVCLA 

SFGFHGNVSS LLKYFKGDAP KVAKSIWTGT LIALVIYVLW QTAIQXNLPR 

NEFAPVIAAE GQVSVXIETL SKFAQTGNMD KILSLFSYMA IATSFLGVTL 

GLFDYIADIF KWNDSVSGRT KTAALTFLPP XISCLLFPTG FVTAIGYVGL 

AATVWTGIIP AMLLYRSRKK FGAGKTYKVY GGLWLMVWVF LFGIXNIAA>v 

VLSQMELVPV FKG * 



ml02 / a!02 



95.5% identity in 4 13 aa overlap 
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10 20 30 40 50 60 

m 1 0 2 . pe p M P N KT P S L FG G AM 1 1 AGT V I G AG M LAN PTA7SG VW FTG S L A V L L Y T W F S MLS SG LM I LE V 

I t : I I 1 ! I I II ( I i ! I i ! I 1 I I i M ! I I i i 1 ! 1 I I i i I I I I ! I I I I I I I i I I ! I I ! ! I I 
a 1 0 2 MPTKT PSLFGGAM I IAGTX I GAGMLAN FT AT S G VW FTGS LA VLL YT W FSMLSSG LM I LEV 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml 02 . pep N T H Y P HG AS F I3TMV KD L LGRG W N 1 1 N G I A V A FV L Y L LT Y A Y I FV GG D LT AKG LG S AAGG D 

I I f I I I I I I M ! I I I I I I I : I I I ! I I I [ I i I ! I i I I I II 1 I I I I I ! I f I I I I t I I I i I : 
a 102 NTHYPKGAXFDTMVKDLLGRSWNI INGIAVAFVLYLLTYAYI FVGGDLTAKGLGSAAGGN 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml02 .pep VSLTVGQLVFFGILAFCVWASARLVDRFTGVLIGGMVLTFIWAAGGLIADAKPSVLFDTQ 
i I I I I I I I I f I I I M I I I I 1 I I I I ! I I I i : I I I I I I ! [ I I I 1 I : I I I ! i I i I I I I I I I 
a 102 VSLTVGQLVFFGILAFCVWASARLVDRFTSVLIGGMVLTFIWATGGLIADAKLPVLFDTQ 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml02 . pep APAGTNYWI YAATALPVCLASFG FHGN VSSLLKYFKG DA PKVAKS IWTGTLIALVI YVLW 

I I : I I I I I I i : ( I I I I I I I I I I I I I M I I I I I I I I i I I I I I I I I I I I I I I I I I I I I f ! I ! 
a 102 APTGTNYWIYVATALPVCLASFGFHGNVSSLLKYFKGDAPKVAKS IWTGTLIALVI YVLW 

190 200 210 220 230 240 

250 260 270 230 290 300 

ml 02. pep OTAIQGNLPRNEFAPVIAAEGQVSVLIETLSKFAQTGNMDKI LSLFSYMAIATSFLGVTL 

Mill I I I I I I i I I I ! i i I I I I I I I I I ! I I I 1 i ! I I i I ! I i 1 I t I ! I I I [ I I I I I ! I ! 
al02 QTAIQXNLPRNEFAPVIAAEGQVSVXIETLSKFAQTGNMDKILSLFSYMAIATSFLGVTL 

250 260 270 280 290 300 

310 320 330 340 350 360 

ml02 . pep GLFDYIADIFKWNDSISGRTKTAALTFLPPLISCLLFPTGFVTAIGYVGLAATVWTGIIP 
I I I I I 1 M I I I H i I : I 111 it I i I I i M i M I 1 i i M i I 1 I M I M M M t ! i I I 1 M 
al02 GLFDYIADIFKWNDSVSGRTKTAALTFLPPXISCLLFPTGFVTAIGYVGLAATVWTGIIP 

310 320 330 340 350 360 

370 380 390 400 410 

ml 02 . pep AMLL YRS RKKFG AGKT YKV YGGLW LMVWV FL FG I VN I AAQVL SQME LV PV FKGX 

I I I I I M ! I I i I I ! I I I i ! ! ! I I I I I I I I I I I I i I I I I I I I M I I If I I I I I 
a!02 AMLLYRSRKKFGAGKTYKVYGGLWLMVWVFLFGIXNIAAXVLSQMELVPVFKGX 

370 380 390 400 410 
The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 363>: 

gl05 . seq 

1 Atgtccgcag aaaCATACAc acAAAtcggc tGGgtaggct taggGcaaat 

51 gGgtctgcct atgGTAACGC GGCTCTTGGA CGGCGGCATC GAAGTCGGCG 

101 TATACAACCG CTCGCCCGAC AAAACTGCCC CCATCTCcgc CAAAGGAGCA 

151 AAAGTTTACG GCagcACCGC CGAACTCGTC CGCGCCTGCC CCGTCATTTT 

201 CCTGATGGTT TCCGACTATG CCGCCGTGTG CGACATCCTG AACGGAGTCC 

251 GCGACGGATT GGCCGGCAAA ATCATCGTCA ACATGAGCAC CATCTCCCCG 

3 01 ACCGAAAACC TCGCCGTCAA AGCACTTGTC GAAGCCGCAG GCGGACAGTT 

3 51 TGCCGAAGCA CCCGTTTCCG GATCGGTCGG ACCCGCCACC AACGGCACAC 
401 TGCTGATTCT GTTCGGCGGC AGCGAAGCCG TTTTAAACCC GCTGCAAAAA 

4 51 ATATTTT CCC TTGTCGGCAA AAAAACCTTC CATTTCGGCG ATGTCGGCAA 
501 AGGCTCGGGC GCGAAACTCG TCTTGAACTC GCTCTTAGGC ATTTTCGGCG 
551 AAGCGTACAG CGAAGCGATG CTGATGGCGC GGCAGTTCGG CATCGATACC 
601 GACACCATCG TCGAAGCCAT CGGCGGCTCG GCAATGGACT CGCCTATGTT 
651 TCAAACAAAA AAATCACTAT GGGCAAACCG TGAGTTCCCC CCTGCCTTTG 
701 CACTCAAACA CGCTTCCAAA GAcctTAACC TCGccgtcAA AGAGCTTGAA 
751 CAGGCAGGCA ACACCCTGCC CGCCGTCGAA ACCGTTGCTG CCAGCTACCG 
801 CAAAGCAGTT GAAGCCGGCT ACGGCGAACA GGACGTTTCC GGCGTTTACC 
851 TGAAATTGGC AGAACACTGA 
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This corresponds to the amino acid sequence <SEQ ID 364; ORF 105.ng>: 

gl05 .pep 

1 MSAETYTQIG WVGLGQMGLP MVTRLLDGGI EVGVYNRS PD KTAPI SAKGA 

51 KVYGSTAELV RAC PVIFLMV SPYAAVCDIL NGVRDGLAGK IIVNMSTISP 

101 TENLAVKALV EAAGGQFAEA PVSGSVGPAT NGTLLILFGG SEAVLNPLQK 

151 I FSLVGKKTF HFGDVGKGSG AKLVLNSLLG I FGEAYSEAM LMARQFGIDT 

201 DTI VEAIGGS AMDSPMFQTK KSLWANREFP PAFALKKASK DLNLAVKELE 

251 QAGNTLPAVE TVAASYRKAV EAGYGEQDVS GVYLKLAEH 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 365>: 



ml05 . seq 












1 


ATGTCCGCAA 


ACGAATACGC 


ACAAATCGGC 


TGGaTAGGCT 


TAGGGCAAAT 


51 


GGGTCTGCCT 


ATGGTAACGC 


GGCTCTTGGA 


CGGCGGCATC 


GAAGTCGGCG 


101 


TATACAACCG 


CTCGCCCGAC 


AAAACTGCCC 


CCATCTCCGC 


CAAAGGCGCA 


151 


AAAGTTTACG 


GCAACACCGC 


CGAACTCGTC 


CGCGACTATC 


CCGTCATTTT 


201 


CCTGATGGTT 


TCCGACTATG 


CCGCCGTGTG 


CGACATCCTG 


AACGGAGTCC 


251 


G CG ACGGATT 


GGCCGGCAAm 


ATCATCGTCA 


ACATGAGCAC 


CATCTCCCCG 


301 


ACCGAAAaGC 


TCGCCGTCAA 


AGCACTTGTC 


GAAGCGCAGm 


GaCAGTTTGC 


351 


CGAAGCACCC 


GTTTCCGGAT 


CGGTCGGGCC 


CGCCACCAAC 


GGCACGCTGC 


401 


TGATTCTGTT 


CGGCGGCAGC 


GAAcCGtTTT 


AAACCCGCTG 


CAAAAAATAT 


451 


TTTCCCTCGT 


C^-GCAAAAAA 


ACCTTCCATT 


TCGGCGATGT 


CGGCAAAGGT 


501 


TCGGGCGCGA 


AACTCGTCTT 


GAACTCGCTC 


TTGGGCATTT 


TCGGCGAaCG 


551 


TAcAGCGAAs 


GmTgCTGATG 


GCGCGGCAGT 


TCGGCATCGA 


TACCGACACC 


601 


ATCGTCGAAG 


CCATCGGsGA 


CTCGGCAATG 


GACTCGCCCA 


TGTTCCAAAC 


651 


CAAAAAATCC 


CTGTGGGCAA 


ACCGCGAATT 


CCCGmCCGmC 


TTCGCCCTCA 


701 


AACACGCCTC 


CAAAGACCTC 


AACCTCGCCG 


TCAAAGAGCT 


TGAACAGGCA 


751 


GGCAACACCC 


TGCCCGCCGT 


CGAAACCGTT 


GCTGCCAGCT 


ACCGCAAAGC 


S01 


AGTCGAAGCC 


GGCTACGGGA 


CACAGGACGT 


TTCCGGCGTT 


TACCTGAAAC 


851 


TGGCAGAACA 


CTGA 









This corresponds to the amino acid sequence <SEQ ID 366; ORF 105>: 

ml05 .pep 

1 MSANEYAQIG WIGLGQMGLP MVTRLLDGGI EVGVYNRSPD KTAPI SAKGA 

51 KVYGNTAELV RDY PVIFLMV SPYAAVCDIL NGVRDGLAGX IIVNMSTISP 

101 TEKLAVKALV EAQRQFAEAP VSGSVGPATN GTLLILFGGS EPFXTRCKKY 

151 FPSSAKKPSI SAMSAKVRAR NSSXTRSWAF SANVQRXXLM ARQFGIDTDT 

2 01 IVEAIGDSAM DSPMFQTKKS LWANREFPXX FALKHASKDL NLAVKELEQA 

2 51 GNTLPAVETV AASYRKAVEA GYGTQDVSGV YLKLAEH 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 105 shows 79.9% identity over a 289 aa overlap with a predicted ORF (ORF 105.ng) 
from N. gonorrhoeae; 

ml05/gl05 



10 20 30 40 50 60 

gl05 . pep MS AETYTQ I GWVGLGQMGLPMVTRLLDGG I EVGVYNRS PDKTAP I S AKGAKVYGSTAELV 

III: hlllhiliililllllltililillllililiilililMlitililMilit 

m 1 0 5 MSANEYAQ I GW I GLGQMGL PMVTRLLDGG I EVGVYNRS PDKTAP I SAKGAKV YGNTAELV 

10 20 30 40 50 60 



70 80 90 100 110 120 

gl05 .pep RAC P V I F LMV S D Y AA VCD I LNG VRDG LAG K 1 1 VNM STISPTEN LAV KAL V E AAGGQ F AE A 

I IIIIIIIIIIIIIIIIIIIIIIIIII lllllllllllhlllllllll Mill 
ml 05 RDYPVIFLMVSDYAAVCDILNGVRDGLAGXIIVNMSTISPTEKLAVKALVEAQR-QFAEA 
70 80 90 100 110 



130 140 150 160 170 180 

gl0 5 . pep PVSGSVGPATNGTLLILFGo -EAVLNPLQKIFSLVGKKTFHFGDVGKGSGAKLVLNSLLG 

Illlllllllllllllllllll ^ :| I : I I - h 

ml 0 5 PVSGSVGPATNGTLLI LFGGSEPFXTRCKKY FPSSAKKP -SI SAMSAKVRARNSSXTRSW 

120 130 140 150 160 170 
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190 200 210 220 230 240 

gl05 .pep I FGE AYS E AMLMARQFG I DTDT I VEA I GGS AMDS PM FQT KKS LWANRE F P PAFALKKAS K 

I • : i I I I I I I I M I i I I I 1 I I I I I I I I I I I I I I S I I II I I I I | I ! | I | | I 
ml 0 5 AFSANVQRXXLMAROFGIDTDTIVEAIGDSAMDSPMFOTKKSLWANREFPXXFALKHASK 
l^O 190 200 210 220 230 

250 260 270 280 289 

gl05 . pep DLNLAVKELEQAGNTLPAVETVAASYRKAVEAGYGEQDVSGVYLKLAEH 
I I , , i : i. ' I : l!l!l|[|i!|M 
ml 05 DLNLAVKELEQAGNTLPAVETVAASYRKAVEAGYGTQDVSGVYLKIAEH 
240 250 260 270 280 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 367>: 
al05 . seq 

1 ATGTCCGCAA ACGAATACAC ACAAATCGGC TGGAT AGGCT TAGGGCAAAT 

51 GGGTCTGCCT ATGGTAACGC GGCTCTTGGA CGGCGGCATC GAAGTCGGCG 

101 TATACAACCG CTCGCCCGAC AAAACTGCCC CCATCTCCGC CAAAGGCGCA 

151 AAAGTTTACG GCAACACCGC CGAACTCGTC CGCGACTATC CCGTCATTTT 

2 01 CCTGATGGTT TCCGACTATG CCGCCGTGTG CGACATCCTG AACGGAGTCC 
251 GCGACGGATT GGCCGGCAAA ATCATCGTCA ACATGAGCAC CATCTCCCCG 
301 ACCGAAAACC TCGCCGTCAA AGCACTTGTC GAAGCCGCAG GCGGACAGTT 

3 51 TGCCGAAGCA CCCGTTTCCG GATCGGTCGG GCCCGCCACC AACGGCACGC 

4 01 TGCTGATTCT GTTCGGCGGC AGCGAAGCCG TTTTAAACCC GCTGCAAAAA 
4 51 ATATTTTCCC TCGTCGGCAA AAAAACCTTC CATTTCGGCG ATGTCGGCAA 
501 AGGTTCGGGC GCGAAACTCG TCTTGAACTC GCTCTTGGGC ATTTTCGGCG 
551 AAGCGTACAG CGAAGCGATG CTGATGGCGC GGCAGTTCGG CATCGATACC 
601 GACACCATCG TCGAAGCCAT CGGCGGCTCG GCAATGGACT CGCCCATGTT 
651 CCAAACCAAA AAATCCCTGT GGGCAAACCG CGAATTCCCA CCCGCCTTCG 
701 CCCTCAAACA CGCCTCCAAA GACCTCAACC TCGCCGTCAA AGAGCTTGAA 
7 51 CAGGCAGGCA ACACCCTGCC CGCCGTCGAA ACCGTTGCTG CCAGCTACCG 
801 CAAAGCAGTC GAAGCCGGCT ACGGCGAACA GGACGTTTCC GGCGTTTACC 
851 TGAAATTGGC AGAACACTGA 

This corresponds to the amino acid sequence <SEQ ID 368; ORF 105.a>: 

al05 . pep 

1 MSANEYTQIG WIGLGQMGLP MVTRLLDGGI EVGVYNRSPD KTAPISAKGA 

51 KVYGNTAELV RDY PVI FLMV SDYAAVCDIL NGVRDGLAGK IIVNMSTISP 

101 TENLAVKALV EAAGGQFAEA PVSGSVGPAT NGTLLILFGG SEAVLNPLQK 

151 IFSLVGKKTF HFGDVGKGSG AKLVLNSLLG I FGE AY SEAM LMARQFGIDT 

2 01 DTIVEAIGGS AMDS PM FQT K KSLWANREFP PAFALKHASK DLMLAVKELE 

251 QAGNTLPAVE TVAASYRKAV EAGYGEQDVS GVYLKLAEH * 

ml05/al05 96.5% identity in 289 aa overlap 

10 20 30 40 50 60 

ml05.pep MSANEYAQIGWIGLGQMGLPMVTRLLDGGIEVGVYNRSPDKTAPISAKGAKVYGNTAELV 
II I II I : 1 I I 1 I I I I I I i I I I I I I I i M I ) I I 1 I I ! ! I I I I I I I I I I I i I ! I I I j I I I I I 
al05 MSANEYTQIGWIGLGQMGLPMVTRLLDGGIEVGVYNRSPDKTAPISAKGAKVYGNTAELV 
10 20 30 40 50 60 

70 80 90 100 110 119 

ml05.pep RDYPVI FLMVSDYAAVCDILNGVRDGLAGKI IVNMSTISPTENLAVKALVEAAG-QFAEA 

I I I M I I I I 1 ! I I I I It I f I I i I I I I ! I I I I M I I I 1 I I 1 i I M I I I I i I I I M Mill 
a 10 5 RDYPVIFLMVSDYAAVCDILNGVRDGLAGKIIVNMSTISPTENLAVKALVEAAGGQFAEA 

70 80 90 100 110 120 

120 130 140 150 160 170 179 

ml05. pep PVSGSVG PATNGTLLILFGGSEAVLNPLQKI FSLVGKKTFHFGDVGKGSGAKLVLNSLLG 

1 M I i I I I ! I I I I I I I I I II II I I I I I I I M I M I I I I I I j I ! I I I I I I i I | I I | | | | | j 
al0 5 PVSGSVG PATNGTLLILFGGSEAVLNPLQKI FSLVGKKTFHFGDVGKGSGAKLVLNSLLG 

130 140 150 160 170 180 

180 190 200 210 220 230 
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ml 0 5 . pe p I FGDV- QRXM LMARQ FGIDTDTIVEAIG D SAM DS PM FQTKK S L WAN RE FPXAFA LKH AS K 

III:: : : I I I I I I I I I I I I I ! I ! I I I I I I t ! I I 1 I I I I I ! I I I I I i | I I I I ' I I .' 1 
a 1 0 5 I FG E A Y S E AM LMARQ FG I DT DT I V E A I G G S AM D S PM FQTK K S LWANR EFPPA FA L KK ASK 

190 200 210 220 230 2-30 

240 250 260 270 280 

ml 0 5 . pep DLNLAVKELEQAGHTLPAVETVAASYRKJWEAGYGEQDVSGV Y LKLAEHX 

I I I I I I I I i I I I ! I i I ! ! I I I I i I I I I I I I I I I I I I I II I I I! I I I I II | 
a 1 0 5 DLN LAV KELE Q AG NTLPAVET V AAS Y RKAVEAG YGEQDVSGVY LKLAE HX 

250 260 270 280 290 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 369>: 

gl05-l . seq 

1 ATGTCCGCAG AAACATACAC ACAAATCGGC TGGGTAGGCT TAGGGCAAAT 

51 GGGTCTGCCT ATGGTAACGC GGCTCTTGGA CGGCGGCATC GAAGTCGGCG 

101 TATACAACCG CTCGCCCGAC AAAACTGCCC CCATCTCCGC CAAAGGAGCA 

151 AAAGTTTACG GCAGCACCGC CGAACTCGTC CGCGCCTGCC CCGTCATTTT 

201 CCTGATGGTT TCCGACTATG CCGCCGTGTG CGACATCCTG AACGGAGTCC 

2 51 GCGACGGATT GGCCGGCAAA ATCATCGTCA ACATGAGCAC CATCTCCCCG 

301 ACCGAAAACC TCGCCGTCAA AGCACTTGTC GAAGCCGCAG GCGGACAGTT 

351 TGCCGAAGCA CCCGTTTCCG GATCGGTCGG ACCCGCCACC AACGGCACAC 

401 TGCTGATTCT GTTCGGCGGC AGCGAAGCCG TTTTAAACCC GCTGCAAAAA 

4 51 ATATTTTCCC TTGTCGGCAA AAAAACCTTC CATTTCGGCG ATGTCGGCAA 

501 AGGCTCGGGC GCGAAACTCG TCTTGAACTC GCTCTTAGGC ATTTTCGGCG 

551 AAGCGTACAG CGAAGCGATG CTGATGGCGC GGCAGTTCGG CATCGATACC 

601 GACACCATCG TCGAAGCCAT CGGCGGCTCG GCAATGG.\:T CGCC7ATGTT 

651 TCAAACAAAA AAATCACTAT GGGCAAACCG TGAGTTCCCC CCTGCCTTTG 

701 CACTCAAACA CGCTTCCAAA GACCTTAACC TCGCCGTCAA AGAGCTTGAA 

7 51 CAGGCAGGCA ACACCCTGCC CGCCGTCGAA ACCGTTGCTG CCAGCTACCG 

801 CAAAGCAGTT GAAGCCGGCT ACGGCGAACA GGACGTTTCC GGCGTTTACC 

851 TGAAATTGGC AGAACACTGA 



This corresponds to the amino acid sequence <SEQ ID 370; ORF 105-l.ng>: 

gl05-l .pep 

1 MSAETYTQ1G WVGLGQMGLP MVTRLLDGGI EVGVYNRSPD KTAPISAKGA 

51 KVYGSTAELV RACP VIFLMV SDYAAVCDIL NGVRDGLAGK 1IVNMSTISP 

101 TENLAVKALV EAAGGQFAEA PVSGSVGPAT NGTLLILFGG SEAVLNPLQK 

151 IFSLVGKKTF HFGDVGKGSG AKLVLWSLLG IFGEAYSEAM LMARQFGIDT 

201 DTI VEAIGGS AMDSPMFQTK KSLWANREFP PAFALKHASK DLNLAVKELE 

251 QAGNTLPAVE TVAASYRKAV EAGYGEQDVS GVYLKLAEH * 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 371 >; 

ml 0 5 - 1 . seq 

1 ATGTCCGCAA ACGAATACGC ACAAATCGGC TGGATAGGCT TAGGGCAAAT 

51 GGGTCTGCCT ATGGTAACGC GGCTCTTGGA CGGCGGCATC GAAGTCGGCG 

101 TATACAACCG CTCGCCCGAC AAAACTGCCC CCATCTCCGC CAAAGGCGCA 

151 AAAGTTTACG GCAACACCGC CGAACTCGTC CGCGACTATC CCGTCATTTT 

201 CCTGATGGTT TCCGACTATG CCGCCGTGTG CGACATCCTG AACGGAGTCC 

251 GCGACGGATT GGCCGGCAAA ATCATCGTCA ACATGAGCAC CATCTCCCCG 

301 ACCGAAAACC TCGCCGTCAA AGCACTTGTC GAAGCCGCAG GCGGACAGTT 

351 TGCCGAAGCA CCCGTTTCCG GATCGGTCGG GCCCGCCACC AACGGCACGC 

401 TGCTGATTCT GTTCGGCGGC AGCGAAGCcG TTTTAAACCC GCTGCAAAAA 

4 51 ATATTTTCCC TCGTCGGCAA AAAAACCTTC CATTTCGGCG ATGTCGGCAA 

501 AGGTTCGGGC GCGAAACTCG TCTTGAACTC GCTCTTGGGC ATTTTCGGCG 

551 AAGCGTACAG CGAAnCGATG CTGATGGCGC GGCAGTTCGG CATCGATACC 

601 GACACCATCG TCGAAGCCAT CGGsGACTCG GCAATGGACT CGCCCATGTT 

651 CCAAACCAAA AAATCCCTGT GGGCAAACCG CGAATTCCCG CCCGCCTTCG 

701 CCCTCAAACA CGCCTCCAAA GACCTCAACC TCGCCGTCAA AGAGCTTGAA 

751 CAGGCAGGCA ACACCCTGCC CGCCGTCGAA ACCGTTGCTG CCAGCTACCG 

301 CAAAGCAGTC GAAGCCGGCT ACGGCGAACA GGACGTTTCC GGCGTTTACC 

851 TGAAACTGGC AGAACACTGA 



This corresponds to the amino acid sequence <SEQ ID 372; ORF 105-1>; 

ml 05-1 .pep 

1 MSANEYAQIG WIGLGQMGLP MVTRLLDGGI EVGVYURSPD KTAPISAKGA 
51 KVYGNTAELV RDY PVIFLMV SDYAAVCDIL NGVRDGLAGK IIVNMSTISP 
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101 TENLAVKALV EAAGGQFAEA PVSGSVGPAT NGTLLI LFGG SEAVLNPLQK 

151 IFSLVGKKTF HFGDVGKGSG AKLVLNSLLG I FGEAYSEXM LKARQFGI DT 

201 DTIVEAIGDS AMDSPMFQTK KSLWAHREFP PAFALKHASK DLNLAVKELE 

251 QAGNTLPAVE TVAASYRKAV EAGYGEQDVS GVYLKLALH * 

ml05-l/gl05-l 96. 9% identity in 239 aa overlap 



10 2 0 30 4 0 50 60 

m!05-l .pep MSANEYAQI GWI GLGQMGLPMVTRLLDGGIEVGVYNRSPDKTAF1 SAKGAKVYGNTAELV 
I I I : 1:1! I I : I I I I I I 1 I 1 I I I I I I I I I I ( ! I I I I i I I I I I I I I I I I ! I I I I : II I 1 I 
g 105-1 MSA.ETYTQIGW VGLGQMGLPMVTRLLDGGI EVGVYNRSPDKTAPI SAKGAKVYGSTAELV 

10 20 30 40 50 50 



70 80 90 100 HO 120 

ml05- 1 .pep RDYPVIFLMVSDYAAVCDILNGVRDGLAGKIIVNMSTISPTEIILAVKALVEAAGGQFAEA 
I I I i I I I I I if I I I I I I I I I i I I I I 1 t i I I I I I I I I I I I I ! I I I I I I I I ! I I I I I I I I 
gl0 5-l RACPVIFLMVSDYAAVCDILNGVRDGLAGKIIVNMSTISPTENLAVKALVEAAGGQFAEA 
70 80 90 100 110 120 



130 140 150 160 170 180 

ml05-l .pep PVSGSVGFATNGTLLILFGGSEAVLNPLQKIFSLVGKKTFHFGDVGKGSGAKLVLNSLLG 
I i I I i M 1 I I I I I i i i I M I I I i I I I I I 1 I I t I i i i I I ) i i 1 I i i 1 ! i 1 t i 1 I 1 M | I I t 
g 105-1 PVSGSVGPATNGTLLILFGGSEAVLNPLQKI FSLVGKKTFKFGDVGKG5GAKLVLNSLLG 

130 140 150 160 170 130 



190 200 210 220 230 240 

ml 05-1 .pep I FGEAYSEXMLMARQFGI DT DTI VEA I GDS AMDS PMFQTKKSLWANREFP PAFALKHASK 
I i ! I I ! I ! I I I I I I t I i t I I I I I I I 1 ! I I 1 1 i I I I I I I I I ! I I I i I { t I i { I > I I I I I 
gl05-l I FGEAYSEAMLMARQFGI DTDTI VEAIGGSAMDSPMFQTKKSLWANREFPPAFALKHAS?; 

190 200 210 220 230 240 



250 260 270 280 290 

ml05-l -pep DLNLAVKELEQAGNTLPAVETVAASYRKAVEAGYGEQDVSGVYLKLAEHX 
I I i I I I I I i I I I I I I I I I I I I I ! t 1 I I I I I I I I I I ! I M ! I I I ! 1 I I I ! I 
g 1 05 - 1 DLNLAVKELEQAGNTLPAVETVAAS YRKAVEAGYGEQDVSGV YLKLAEHX 

250 260 270 280 290 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 373>: 

al05-l . seq 

1 ATGTCCGCAA ACGAATACAC ACAAATCGGC TGGATAGGCT TAGGGCAAAT 

51 GGGTCTGCCT ATGGTAACGC GGCTCTTGGA CGGCGGCATC GAAGTCGGCG 

101 TATACAACCG CTCGCCCGAC AAAACTGCCC CCATCTCCGC CAAAGGCGCA 

151 AAAGTTTACG GCAACACCGC CGAACTCGTC CGCGACTATC CCGTCATTTT 

201 CCTGATGGTT TCCGACTATG CCGCCGTGTG CGACATCCTG AACGGAGTCC 

251 GCGACGGATT GGCCGGCAAA ATCATCGTCA ACATGAGCAC CATCTCCCCG 

301 ACCGAAAACC TCGCCGTCAA AGCACTTGTC GAAGCCGCAG GCGGACAGTT 

351 TGCCGAAGCA CCCGTTTCCG GATCGGTCGG GCCCGCCACC AACGGCACGC 

4 01 TGCTGATTCT GTTCGGCGGC AGCGAAGCCG TTT7AAACCC GCTGCAAAAA 

451 ATATTTTCCC TCGTCGGCAA AAAAACCTTC CATTTCGGCG ATGTCGGCAA 

501 AGGTTCGGGC GCGAAACTCG TCTTGAACTC GCTCTTGGGC ATTTTCGGCG 

551 AAGCGTACAG CGAAGCGATG CTGATGGCGC GGCAGTTCGG CATCGATACC 

601 GACACCATCG TCGAAGCCAT CGGCGGCTCG GCAATGGACT CGCCCATGTT 

651 CCAAACCAAA AAATCCCTGT GGGCAAACCG CGAATTCCCA CCCGCCTTCG 

701 CCCTCAAACA CGCCTCCAAA GACCTCAACC TCGCCGTCAA AGAGCTTGAA 

7 51 CAGGCAGGCA ACACCCTGCC CGCCGTCGAA ACCGTTGCTG CCAGCTACCG 

801 CAAAGCAGTC GAAGCCGGCT ACGGCGAACA GGACGTTTCC GGCGTTTACC 

851 TGAAATTGGC AGAACACTGA 



This corresponds to the amino acid sequence <SEQ ID 374; ORF 105-1. a>: 

al05-l -pep 

1 MSANEYTQIG WIGLGQMGLP MVTRLLDGGI EVGVYNRSPD KTAPISAKGA 

51 KVYGNTAELV RDY PVIFLMV SDYAAVCDIL NGVRDGLAGK IIVNMSTI5P 

101 TENLAVKALV EAAGGQFAEA PVSGSVGPAT NGTLLI LFGG SEAVLNPLQK 

151 IFSLVGKKTF HFGDVGKGSG AKLVLNSLLG I FGEAYSEAM LMARQFGI DT 

201 DTIVEAIGGS AMDSPMFQTK KSLWANREFP PAFALKHASK DLNLAVKELE 

251 QAGNTLPAVE TVAASYRKAV EAGYGEQDVS GVYLKLAEH * 



alO5-l/ml05-l 99.0% identity in 289 aa overlap 



10 20 30 40 50 60 

al05~l . pep MSANEYTOIGWIGLGQMGLPHVTRLLDGGIEVGVYNRSPDKTAPISAKGAKVYGNTAELV 
I I I I I I : I I I I M t I t I I I I I I I I I I I | j | | I I I | I | ! I | | | | | | | | | | ] | | M | | | | I | 
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ml 05-1 MSANEYAQIGWIGLGQMGLPMVTRLLDGGI EVGVYNRSPDKTAPI SAKGAKV YGNTAELV 

10 20 30 40 50 60 

70 80 90 100 110 120 

a 105-1 . pep RDYPVI FLMVSDYAAVCDI LMGVRDGLAGKI I VNMSTISPTENLAVKALVEAAGGQFAEA 
I t I I | I f t I I I M I I II I t I M ! I I I I I I I I ! I I I ! i I I I I I I i 1 I ! I i ' I I I I j I I I I 
ml 0 5- I RDYPVI FLMVS DYAAVCDI LNGVRDGLAGKI I VNMSTI S PTENLAVKALVEAAGGQFAEA 

70 SO 90 100 110 120 

130 140 150 160 170 180 

al05-l .pep PVSGSVGPATNGTLLI LFGGSEAVLNPLQKI FSLVGKKTFHFGDVGKGSGAKLVLNSLLG 
| | | | j M I | | I I I I I ! I I I I II I I I I I I I I I I I I I I I I M ! I I I I I I I I I I I I I I I II I I 
ml 05-1 pvSGSVGPATKGTLLI LFGGSEAVLNPLQKI FSLVGKKTFHFGDVGKGSGAKLVLNSLLG 

130 "40 150 160 170 180 

190 200 210 220 230 24 0 

al 05-1 .pep I FGEAYSEAMLHARQFGIDTDTI VEAIGGSAMDSPMFQTKKSLWANREFPPAFALKHASK 
| | I I I I I I I I I II I I I I I I ) I ! I II f I I I I I I I I I I M M I I ! I I I I I I I I I i I I I I I 
ml05-l IFGEAYSEXNLMARQFGIDTDTIVEAIGDSAMDSPMFQTKK3LWANREFPPAFALKHASK 
190 200 210 220 230 240 

250 260 270 280 290 

al05-l . pep DLNLAVKELEQAGNTLPAVETVAASYRKAVEAGYGEQDVSGVYLKLAEHX 
I I I I I I I I II I ! M I II 1 I I I I I I ! I I I I I ! I I I I I I I I I I I ! i I II I I I 
ml 05-1 DLNLAVKELEQAGNTLPAVETVAASYRKAVEAGYGEQDVSGVYLKLAEHX 

250 260 270 280 2S0 



The following partial DNA sequence was identified in A', gonorrhoeae <SEQ ID 375>: 

"TTATTTG GGCAACCGGC GGCCTGGTTG CCGATGCCAA 



gl07 . seq 




1 


ATGGTATTAA 


51 


ACCGTCCGTC 


101 


TTTACGCCGC 


151 


GGCAACGTTT 


201 


GgCGAaATCc 


251 


TCTggcaaac 


301 


gtgattgccg 


351 


attcgcccaa 


401 


tggcaatcgc 


451 


atcgccggac 


501 


accgtcgcgc 



This corresponds to the amino acid sequence <SEQ ID 376; ORF 107.ng>: 

gl07 . pep 

1 MVLTFIWATG GLVADA KPSV LFDTQAPVGT GYWIYAATAL PVCLASFGFH 

51 GNVSSLLKYF KGDAPKVAKS IW AGTLVALV IYVLWQTAI Q SNLPRNEFAP 

101 VIAAERQLSV LNETLSKFAQ TGDMDK ILSL FPYMAIATSF LGV TLGLFDN 

151 IAGHLQMERQ YVRAAPKPSR * 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 377>: 



ml07 . seq 












1 


ATGGTATTGA 


CCTTTATTTG 


GGCGGCCGGC 


GGGCTGATTG 


CCGATGCCAA 


51 


GCCGTCCGTC 


CTCTTCGATA 


CCCAAGCCCC 


CGCCGGCACA 


AACTACTGGA 


101 


TTTACGCCGs 


CACCGCCCTG 


CCCGTCTGCC 


TCGCTTCCTT 


CGGCTTCCAC 


151 


GGCAACGTCT 


CCAGCCTGCT 


CAAATACTTT 


AAAGGCGACG 


CGCCCAAAGT 


201 


GGCTAAATCC 


ATCTGGACGG 


GCACACTGAT 


TGCGCTGGTA 


ATTTACGTCC 


251 


TCTGGCAAAC 


CGCCATCCAA 


GGCAACCTGC 


CGCGCAACGA 


GTTCGCCCCC 


301 


GTCATCGCCG 


CCGAAGGGCA 


AGTCTCCGTC 


CTCATCGAAA 


CCCTGTCCAA 


351 


ATTCGCCCAA 


ACCGGCAATA 


TGGACAAAAT 


ATTGTCCCTG 


TTTTCCTATA 


401 


TGGCGATCGC 


CACCTCGTTT 


TTAGGCGTAA 


CGCTCGGACT 


CTTCGACTAC 


451 


ATCGCCCATC 


TTCAAATGGA 


ACGACAGCAT 


CTCCGGgCCG 


CACCAAAACC 


501 


GCCGCGCTGA 











This corresponds to the amino acid sequence <SEQ ID 378; ORF 107>: 

ml07 .pep. . 

1 MVLTFI WAAG GLIADA KPSV LFDTQAPAGT NYWI YAXTAL PVCLAS FGFH 
51 GNVSSLLKYF KGDAPKVAKS IWTGTLIALV IYVLWQTAIQ GNLPRNEFAP 
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101 VIAAEGQVSV LIE71.GKFAQ TGNMPK ILSL F5YMAIATSF LGV TLGLFDY 
151 IAHLQMERQH LRAAPKPPR * 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from A', gonorrhoeae 

ORF 107 shows 89.4% identity over a 1 70 aa overlap with a predicted ORF (ORF 107.ng) 
from A', gonorrhoeae: 

ml07/gl07 

10 20 30 40 50 60 

m 1 0 7 pep MVLT F I WAAGGL I ADAKPS VLFDTQAPAGTNYW I YAX TAL P VCLAS FG FHGNVSS LLKY F 

i | | I i | I I = I I ! : i I I I I I I I I I I M h I h II I I I I I I I I I I ! I I I I I I I I I I I I I I I 
gl07 MVLTFIWATGGLVADAKPSVLFDTOAPVGTGYWIYAATALPVCLASFGFHGNVSSLLKYF 
10 20 30 40 50 60 

70 80 90 100 110 120 

ml 07 .peo KGDAPKVAKS I WTGTLIALVIYVLWQTAIQGNLPRNEFAPVIAAEGQVSVLIETLSKFAQ 

IllUilllllhllhlillliilliiihl Mil III! Mill hill I I I I I I I I 

g 1 0 7 KGDAPKVAKS I WAGTLVALVI YVLWQTAI QSNLPRNE FAPVI AAERQLS VLNETLS KFAQ 

70 80 90 100 110 120 

130 140 150 160 170 

ml07 -pep TGNMDKILSLFSYMAIATSFLGVTLGLFDYIA-HLQMERQHLRAAPKPPR 

Ihllllllll lllllllllllllllll II llllllhhlilil | 
g!0 7 TGDMDKILSLFPYMAIATSFLGVTLGLFDNIAGHLQMERQYVRAAPKPSR 
130 140 150 ISO 170 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 379>: 

al07 . seq 

1 ATGGTATTAA CCTTTATTTG GGCAACCGGC GGCCTGATTG CCGATGCCAA 

51 ACTGCCCGTC CTCTTCGACA CCCAAGCCCC TACCGGCACC AACTACTGGA 

101 TTTATGTCGC CACCGCCCTG CCCGTCTGCC TTGCGTCATT CGGTTTCCAC 

151 GGCAACGTCT CCAGCCTGCT CAAATACTTT AAAGGCGACG CGCCCAAAGT 

201 GGCTAAATCC ATCTGGACGG GCACACTGAT TGCGCTGGTA ATTTACGTCC 

251 TCTGGCAAAC CGCCATCCAA GGCAACCTGC CGCGCAACGA GTTCGCCCCC 

301 GTGATTGCCG CCGAAGGGCA AGTCTCCGTC CTGATTGAAA CCCTGTCCAA 

3 51 ATTCGCCCAA ACCGGCAATA TGGACAAAAT ATTGTCCCTG TTTTCCTATA 

4 01 TGGCGATCGC CACCTCGTTT TTAGGCGTAA CGCTCGGACT CTTCGACTAC 
4 51 ATCGCCGACA TCTTCAAATG GAACGACAGC GTGTCCGGCC GCACCAAAAC 
501 CGCCGCGCTG ACCTTCCTGC CGCCTCTAAT TTCCTGCCTG CTCTTCCCCA 
551 CCGGCTTTGT TACCGCCATC GGCTACGTCG GCCTGGCGGC AACCGTCTGG 
601 ACAGGCATCA TCCCCGCCAT GCTGCTCTAC CGTTCGCGCA AAAAATTCGG 
651 CGCAGGCAAA ACCTATAAAG TTTACGGCGG CTTGTGGCTG ATGGTTTGGG 
701 TCTTCCTTTT CGGCATCGTC AACATCGCCG CACAGGTATT GAGCCAAATG 
7 51 GAACTCGTCC CCGTATTTAA AGGATAA 

This corresponds to the amino acid sequence <SEQ ID 380; ORF 107.a>: 

al07 .pep 

1 MVLTFIWATG GLIADAKLPV LFDTQAPTGT NYWIYVATAL PVCLASFGFH 

51 GNVSSLLKYF KGDAPKVAKS IWTGTLIALV IYVLWQTAIQ GNLPRNEFAP 

101 VIAAEGQVSV LIETLSKFAQ TGNMDKILSL FSYMAIATSF LGVTLGLFDY 

151 IADIFKWNDS VSGRTKTAAL TFLPPLISCL LFPTGFVTAI GYVGLAATVW 

201 TGI I PAMLLY RSRKKFGAGK TYKVYGGLWL MVWVFLFGIV NIAAQVLSQM 

251 ELVPVFKG* 

ml07/al07 94.8% identity in 154 aa overlap 

10 20 30 40 50 60 

ml07 .pep MVLT FIWAAGGLI ADAKPS VLFDTQAPAGTNYW I YAXTALPVC LAS FGFHGNVSSLLKYF 

I I I f t I i I : I i I I I I I I I i f i I i I I : I I 1 I i i f : I I f I I ! I I I M I I I I I I I I I I M 
a!07 MVLTFIWATGGLIADAKLPVLFDTQAPTGTNYWIYVATALPVCLAS FGFHGNVSSLLKYF 

10 20 30 40 50 60 
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70 80 90 100 110 120 

ml07 . pep KGDAPKVAKSIWTGTLIALVI YVLWQTAIQGNLPRNEFAPVI AAEGQVSVLI ETLSKFAQ 

! I I ! I ! i I I f I I I I I i I I I I I I i [ I I I M I I I I I I I I I I I t I I I I i I i I ! I I I I I I I I I I 
a 107 KGDAPKVAKS I WTGTL IALVI YVLWQTAIQGNLPRNEFAPVI AAEGQVSVLI ETLSKFAQ 

70 30 90 100 110 120 

130 140 150 160 170 

m 1 0 7 . pe p TGNM D K I L S L F S YMA I AT S FLG VT LG L FD Y I AH LQM E RQ HLRAAP KPPRX 

I I! I i I I ! I I I I I I I I ! I f ! I I I I I I I f ! It I : 
a 107 TGHMDKI LSLFSYMAIATSFLGVTLGLFDYIADI FKWNDSVSGRTKTAALTFLPPLISCL 

130 140 150 160 170 180 

a 107 LFPTGFVTAIGYVGLAATVWTGI I PAMLLYRSRKKFGAGKTYKVYGGLWLMVWVFLFGI V 

190 200 210 220 230 240 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 38 1>: 

gl08 . seq 

1 ATGttgccgg gCTTCAACCG GATATTCAaa cggTTTGCTC CAACACTCGG 

51 AAcggCGCAT AAAACGCCgc ccTTCGCGTT ATCCCGAACG GGGCGGCTAA 

101 TCAGATCCTA TCGCCATAAA AGGCGGGGTT TCAACCGAAA AGGAATTGAG 

151 ATGAATAAAA CCTTGTCTAT TTTGCCGGCG GCAATCTTAC TCGGCGGGTG 

201 CGCCGCCGGC GGCAACACAT TCGGCAGCTT AGACGGCGGC ACGGGTATGG 

251 GTGGCAGCAT CGTCAAAATG ACGGTAGAAA gccAATGCCG TGCGGAATTG 

3 01 GACAGGCGCA GCGAATGGCG TTTGACCGCG CTGGCGATGA GTGCCGAAAA 
351 ACAGGCGGAA TGGGAAAACA AGATTTGCGG CTGCGCTACC GAAGAAGCAC 

4 01 CTAACCAGCT GACCGGCAAC GATGTGATGC AGATGCTGAa ccagtccacG 
4 51 CGCaatcagg cacTtgccgc CCtgaccgTC AAAacggtTT CcgcctgcTT 
501 CAaacgcctg tACCGCTAa 

This corresponds to the amino acid sequence <SEQ ID 382; ORF 108.ng>: 

gl08 -pep 

1 MLPGFNRIFK RFAPTLGTAH KTPPFALSRT GRLIRSYRHK RRGFNRKGIE 

51 MNKT LSILPA AILLGGCAAG G NTFGSLDGG TGMGGSIVKM TVESQCRAEL 

101 DRRSEWRLTA LAMSAEKQAE WENKICGCAT EEAPNQLTGN DVMQMLNQST 

151 RNQALAALTV KTVSACFKRL YR* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 383>: 

mi08 . seq 

1 ATGTTGCCGG GCTTCAACCG GATATTCAAA CGGTTTGTTC CAACACTCGG 

51 AACGGCGCAT AAAACGCCGC CCTTCGCGTT ATCCCGAACG GGGCGGCTAA 

101 TCAGATTCTA TCGCCATAAA AGGCGGGGTT TCAACCGAAA AGGAATTGAG 

151 ATGAATAAAA CCTTGTCTAT TTTGCCGGTG GCAATCTTAC TCGGCGGCTG 

201 CGCCGCCGGA GGCGGTAACA CATTCGGCAG CTTAGACGGT GGCACAGGCA 

2 51 TGGGCGGCAG CATCGTCAAA ATGGCGGTTG GGAGCCAATG CCGTGCGGAA 
301 TTGGACAAAC GCAGCGAATG GCGTTTGACC GCGCTGGCGA TGAGTGCCGA 

3 51 AAAAC AGGCG GAGTGGGAAA ACAAGATTTG CGCTTGCGTC GCCCAAGAAG 
401 CACCCGAACG GATGACCGGC AACGATGTGA TGCAGATGCT GGCTCCGTCC 

4 51 ACGCGCAATC AGGCACTTGC CGCCCTGACC GCCAAAACGG TTTCCGCCTG 
501 CTTCAAACAC CTGTACCGCT AA 

This corresponds to the amino acid sequence <SEQ ID 384; ORF 108>: 

ml08 -pep 

1 MLPGFNRIFK RFVPTLGTAH KTPPFALSRT GRLIRFYRHK RRGFNRKGIE 

51 MNKT LSILPV AILLGGCAAG G GNTFGSLDG GTGMGGSIVK MAVGSQCRAE 

101 LDKRSEWRLT ALAMSAEKQA EWENKICACV AQEAPERMTG NDVMQMLAPS 

151 TRNQALAALT AKTVSACFKH LYR* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 108 shows 89.6% identity over a 1 73 aa overlap with a predicted ORF (ORF 108.ng) 
from N. gonorrhoeae: 

ml08/g!08 
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10 20 30 40 50 60 

ml 08 Dep MLPGFNRIFKRFVPTLGTAHKTPPFALSRTGRLIRFYRKKHRGFNRKGIEMNKTLSILPV 

I i I I I I I I I I I I • I I I I I I I 1 I I I I ! I !! 1 I I I M I I I I I I M I I I I I II I I I I I i ! i 
gl 08 MLPGFNRI FKRFAPTLGTAHKTPPFALSRTGRLIRSYPJ^KRRGFNRKGIEMIvKTLSILPA 

10 20 30 40 50 60 

70 80 90 100 110 120 

mlOtl . pep AILLGGCAAGGGNTFGSLDGGTGMGGSIVKMAVGSQCRAELDKRSEWRLTALAMSAEKQA 

I i! I IIMIIIIIIIMIMIIhl IMMIIMIMIIMiMMMll 

glOS AI LLGGCAAGG- NTFGSLDGGTGMGGSIVKMTVESOCRAELDRRSSWRLTALAMSAEKQA 

70 80 90 100 110 

130 140 150 160 170 

ml 08 . pep EWENKICACVAQEAPERMTGNDVMQMLAPSTRNQALAALTAKTVSACFKHLYRX 

|||lil|:|:::|lh-liiillill I I 1 i I I 1 I 1 I N 1 I i I II I I : I I I 
gl08 EWENKICGCATEEAPNQLTGNDVMOMLNQSTRNQALAALTVKTVSACFKRLYRX 
120 130 140 150 160 170 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 385>: 

al08 . seq 



1 


ATGTTGCCGG 


51 


AACGGCGCAT 


101 


TCAGATTCTA 


151 


ATGAATAAAA 




CGCCGCCGGG 


251 


TGGGCGGCAG 


301 


TTGAACAAAC 


351 


AAAACAGGCG 


401 


CACCCAACCA 


451 


ACGCGCAATC 


501 


CTTCAAACAC 



TCGCCATAAA AGGCGGGGTT TCAACCGAAA 



CATCGTCAAA ATGGCGGTAG AAAGCCAATG 
GCAGCGAATG GCGTTTGACC GCGCTGGCGA 



CAACACTCGG 
GGGCGGCTAA 
AGGAATTGAG 
TCGGCGGCTG 
GGCACAGGTA 
CCGTGCGGAA 
TGAGTGCCGA 
GCCCAAGAAG 
GGATCCGTCC 
TTTCCGCCTG 



This corresponds to the amino acid sequence <SEQ ID 386; ORF 1 08. a>: 

al08 .pep 

1 MLPGFNRIFK KFVPTLGTAH KTPPFALSRT GRLIRFYRHK RRGFNRKGIE 

51 MNKT LSILFV AILLG GCAAG GGNTFGSLDG GTGMGGSIVK MAVESQCRAE 

101 LNKRSEWRLT ALAMSAEKQA EWENKICACV AQEAPNQLTG NDVMQMLDPS 

151 TRNQALAALT AKTVSACFKH LYR* 



ml08/al08 



96.5% identity in 173 aa overlap 



10 20 30 40 50 60 

mi08 .pep MLPGFNRI FKRFVPTLGTAHKTPPFALSRTGRLIRFYRHKRRGFNRKGIEMNKTLSILPV 

1 It I I I i I ! ! I I I ( I I t I I I i I I I ! i U I I i I ( I ( I I 11 i I I i I i I I 1 I Ml I i I I i t I I 
a 108 MLPGFNRI FKRFVPTLGTAHKTPPFALSRTGRL I RFYRHKRRGFNRKG I EMNKT LSI LPV 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml 08 .pep AILLGGCAAGGGNTFGSLDGGTGMGGS I VKMAVGSQCRAELDKRSEWRLT ALAMSAEKQA 

I I I I I I i I M i I I I !! I I I I I 1 I I M 1 i I 1 I I I I I I I M I : I I I M I I I I M I ! I I I M 
a 108 AILLGGCAAGGGNTFGSLDGGTGMGGSIVKMAVESQCRAELNKRSEWRLTALAMSAEKQA 

70 80 90 100 110 120 

130 140 150 160 170 

ml08 . pep EWENKICACVAQEAPERMTGNDVMQMLAPSTRN'QALAALTAKTVSACFKHLYRX 
I I M I I I I I I I I I I I : : : I I I I I I M I I I I f I I I I I I ! I I I I If I M I j I j ! i 
a 108 EWENKICACV AQEAPNQLTGNDVMQMLDPSTRNQALAALTAKTVSACFKHLYRX 

130 140 150 160 170 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 387>: 
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gl09 . seq 

I ATGTATTATC GCCGGGTTGT GGGGCTATCC GATGGACTTG GCGATTTGGC 

51 AGCCGGTATT GATCGTAGGC GTATGCTTAC CGCTTTTGGA AGCGGGCATG 

101 GAAATGACGC GCAAAGGCAA AACCACCCAA TCCGCCGCCA TCGTGGTGTT 

151 CTCTTCCGTC TGGTCAATCC GG7TTTCGGC TGGGCGTTGA CGATGCTGTT 

201 GGATAATTTG GGCTTAATCG GCTGCAAAGA ACGCAGCGCG CAATTAGGTT 

251 TTGTCGGACG AGTATTGATA CCCGCAGTAG GTTTCTTAAT CTTGTGTGTG 

3 01 GCGATGGGTG CGGTCGGGAT GCTGCCCGGT ATCCCTCCGT TTTTGGAGCA 

3 51 GTTCAAATCT TTGGGCTAG 

This corresponds to the amino acid sequence <SEQ ID 388; ORF 109.ng>: 

gl0 9 . pep 

1 MYYRRWGLS DGLGDLAAGI DRRRMLTAFG SGHGNDAQRQ NHPIRRKRGV 

51 LFRLVNPVFG WALTMLLDNL GLIGCKERSA QLGFVGR VLI PAVGFLILCV 

101 AMGA VGMLPG IPPFLEQFKS LG 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 389>: 

ml09 . seq 

1 ATGTATTATC GCCGGGTTAT GGGGCTATCC GATGGACTTG GCGATTTGGC 

51 AGCCGGTATT GAGCGTAGCC TTGGTCGTAG GCGTATACTT ACCGCTTTTG 

101 GAAGCGGGCA TGGAAATGAC GCGCAAAGGC AAAACCACCC AATCCGCCGC 

151 CATCGTGGTG TTCTCTTlCG CCTTGTCAAT CCGGTTTTCG GCTGGGCGTT 

201 GACGATGCTG TTGGATAATT TGGGCTTAAT CGGCTGCAAA GAGCGCAGTG 

2 51 CGCAATTAGG TTTCGCCGGA CGCGTGTTGA TACCCGCAGT AGGTTTCTTG 

301 ATCTTGTGTG TGGCGATGGG TGCGGTCGGG ATGCTGCCCG GTATCCCGCC 

351 GTTTTTGGAA CACTTCAAAT CTTTGGGCTA G 

This corresponds to the amino acid sequence <SEQ ID 4; ORF 109>: 

ml 0 9 -pep 

1 MYYRRVMGLS DGLGDLAAGI ERSLGRRRIL TAFGSGHGND AQRQNHPJRR 

51 HRGVLFRLVN PVFGWALTML LDNLGLIGCK ERSAQLGFAG R VLIPAVGFL 

1°1 ILCVAMGA VG MLPGIPPFLE HFKSLG* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 109 shows 92.9% identity over a 126 aa overlap with a predicted ORF (ORF 109.ng) 
from N. gonorrhoeae: 

m!09/gl09 



10 20 30 40 50 60 

ml09 . pep MYYRRVMGLSDGLGDLAAGIERSLGRRRILTAFGSGHGNDAQRQNHPIRRHRGVLFRLVN 

ihiiiiiiiiiiiiiimiMiiiiiiiim 

gl0 9 MY YRRVVGLSDGLGDLAAG I DR RRMLTAFGSGHGNDAQRQNHPIRRHRGVLFRLVN 

10 20 30 40 50 



70 80 90 100 110 120 

ml 0 9 .pep P V FGW ALTM L LDN LG L I G C KE RS AQ LG F AGR V LIPAVGFLIL CV AMG A V GM L PGIPPFLE 

lllllllllllll!MIIIMI!llllhl!lll!l!M!l!M!l!IIIIIIIIIIMI 

gl09 PVFGWALTMLLDNLGLIGCKERSAQLGFVGRVLI PAVGFLILCV AMGAVGMLPGI PPFLE 

60 70 80 90 100 110 



ml 0 9. pep HFKSLGX 

HUM 
gl0 9 QFKSLGX 
120 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 391>: 

al09 . seq 

1 ATGTATTATC GCCGGGTTGT GGGGCTATCC GATGGACTTG GCGATTTGGC 

51 AGCCGGTATT GAGCGTAGCC TTGGTCGTAG GCGTATACTT ACCGCTTTTG 

101 GAAGCGGGCA TGGAAATGAC GCGCAAAGGC AAAACCACCC AATCCGCCGC 

151 CACCGTGGTG TTCTCTTCCG CTTGGTCAAT CCGGTTTTCG GCTGGGCGTT 
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201 GACGATGCTG TTGU-.TAATT TGGGCTTAAT CGGCTGCAAA GAGCGCAGCG 

2 51 CGCAATTAGG TTTCACCGGA CGCGTATTGA TACCCGTAGT AGGTTTCTTG 
301 ATCTTGTGTG TGGCGATGGG TGCGGTCGGG ATGCTGCCCG GTATCCCGCC 

3 51 GTTTTTGGAG CACTTCAAAT CTTTGGGCTA G 

This corresponds to the amino acid sequence <SEQ ID 392; ORF 109>: 
al 0 9 . pep 

1 MYYRRVVGLS DGLGDLAAGI ER5LGRRRIL TAFGSGHGMD AQRQNKPIRR 
51 HRGVLFRLVN PVFGWALTML LDNLGLIGCK ERSAQLGFTG R VLIPWGFL 
101 ILCVAMGA VG MLPGI PPFLE HFKSLG* 

ml09/al09 97.6% identity in 126 aa overlap 

10 20 30 40 50 60 

ml 0 9. pep MY YRRVMGLS DGLGDLAAGI ERSLGRRR I LTAFGSGHGNDAQRQNH PI RRHRGVLFRLVN 

II 1 I I I : I t I M I I I M I I I I M i II I I i i ! i I I i , I I II I I I I I M j I i i i 11 M 1 It ) 
al0 9 MYYRRWGLSDGLGDLAAGIERSLGRRRILTAFGSGHGNDAQRQNHPIRRHRGVLFRLVN 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml 09 , pep PVFGWALTMLLDNLGLIGCKERSAQLGFAGRVLIFAVGFLILCVAWGAVGMLPGI PPFLE 

M I I II I I I I II f i M I I I I I I I 1 I I i i : 1 I I I i I : I I I I I I i I I II I I I I I ! I I I i I I I 
al09 PVFGWALTMLLDNLGLIGCKERSAQLGFTGRVLI PWGFLILCVAMGAVGMLPGI PPFLE 

70 80 90 100 110 120 



m!09.pep HFKSLGX 
I I I I I I I 

al09 HFKSLGX 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 393>: 

gill . seq 

1 ATGCCGTCTG AAACACGCCT GCCGAACCTT ATCCGCGCCT TGATATTTGC 

51 CCTGGGTTTC ATCTTCCTGA ACGCCTGTTC GGaacaaacC GCGCAaaccg 

101 TTACCCTGCA AGGCGAAACG ATGGGTACGA CCtATACCGT CAAATACCTT 

151 TCAAATAATC GGGACAAACT CCCCTCCCCT GCC AAAATAC AAAAGCGCAT 

2 01 TGATGATGCG CTTAAAGAAG TCAACCGGCA GATGtccaCC TACCAGACCG 
251 ATTCCGAAAT CAGCCGGTTt atacagacan atgctggaga gctcttcgcg 
301 tntcatgcag nttctataac tgattccgcc gaagactgtc tgcctaatac 
351 gcctatctca tcggcgctct ga 

This corresponds to the amino acid sequence <SEQ ID 394; ORF 1 1 l.ng>: 

gill . pep 

1 MPSETRLPNL IRALIFALGF IFLNA CSEQT AQTVTLQGET MGTTYTVKYL 

51 SNNRDKLPSP AKIQKRIDDA LKEVNRQMST YQTDSEISRF IQTAGELFAH 

101 ASITDSAEDC LPNTPISSAL * 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 395>: 

mill . seq 

1 ATGCCGTCTG AAACACGCCT GCCGAACTTT ATCCGCGTCT TGATATTTGC 

51 CCTGGGTTTC ATCTTCCTGA ACGCCTGTTC GGAACAAACC GCGCAAACCG 

101 TTACCCTGCA AGGCGAAACG ATGGGCACGA CCTATAyCGT CAAATACCTT 

151 TCAAATAATC GGGACAAACT CCCCTCACCT GCCGAAATAC AwAAACGCAT 

201 CGATGACGCG CTTAAAGAAk TCAACCGGyA GATGTCCACC TATCAGCCCG 

251 ACTCCGAAAT CAGCCGGTTC AACCAACACA CAGCCGGCAA GCCCCTCCGC 

3 01 ATTTCAAGCG ACTTCGCACA CGTTACTGCC GAAGCCGTCC GCCTGAACCG 

3 51 CCTGACACAC GGCGCGCTGG ACGTAACCGT CGGCCCCTTG GTCAACCTTT 

4 01 GGGGATTCGG CCCCGACAAA TCCGTTACCC GTGAACCGTC GCCGGAACAA 

4 51 ATCAAACAGG CGGCATCTTA TACGGGCATA GACAAAATCA TTTTGAAACA 

5 01 AGGCAAAGAT TACGCTTCCT TGAGCAAAAC CCACCCCAAG GCCTATTTGG 
551 ATTTATCTTC GATTGCCAAA GGCTTCGGCG TTGATAAAGT TGCGGGCGAA 
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601 CTGGAAAAAT ACGGCATTCA AAATTATCTG GTCGAAATCG GCGGCGAGTT 

6 51 GCACGGCAAA GGCAAAAACG CGCGCGGCGA ACCGTGGCGC ATCGGTATCG 
701 AGCAGCCCAA TATCGTCCAA GGCGGCAATA CGCAGATTAT CGTCCCGCTG 
751 AACAACCGTT CGCTTGCCAC TTCCGGCGAT TACCGTATTT TCCACGTCGA 
801 TAAAAACGGC AAACGCCTCT CCCATATCAT CAACCCGAAC AACAAACGAC 
351 CCATCAGCCA CAACCTCGCC TCCATCAGCG TGGTCGCAGA CAGTGCGATG 
901 ACGGCGGACG GCTTGTCCAC AGGATTATTC GTATTGGGCG AAACCGAAGC 
951 CTTAAAGCTG GCAGAGCGCG AAAAACTCGC TGTTTTCCTG ATTGTCAGGG 

1001 ATAAAGGCGG cTACCGCACC GCCATGTCTT CCGAATTTGA AAAACTGcTC 

1051 CGCTAA 

This corresponds to the amino acid sequence <SEQ ID 396; ORF 1 1 1>: 

mill . pep 

1 MPSETRLPNF IRVLI FALGF IFLNA CSEQT AQTVTLQGET MGTTYXVKYL 

51 SNNRDKLPSP AEIXKRIDDA LKEXNRXMST YQPDSEI SRF NQHTAGKPLR 

101 ISSDFAHVTA EAVRLNRLTH GALDVTVGPL VNLWGFGPDK SVTREPSPEQ 

151 IKQAASYTGI DKI ILKQGKD YASLSKTHPK AYLDLSSIAK GFGVDKVAGE 

201 LEKYGIQNYL VEIGGELKGK GKNARGEPWR IGIEQPNIVQ GGNTQT IVPL 

251 NNRSLATSGD YRIFHVDKNG KRLSHIINPN NKRPISHNLA SISWADSAM 

301 TADGLSTGLF VLGETEALKL AEREKLAVFL IVRDKGGYRT AMSSEFEKLL 
351 R* 

ORF 1 1 1 shows 88.7% identity over a 97 aa overlap with a predicted ORF (ORF 1 1 1 .ng) 
from N. gonorrhoeae: 

mill . pep/gill . pep 

10 20 30 40 50 60 

mill .pep MPSETRLPNFIRVLIFALGFIFLNACSEQTAQTVTLQGETMGTTYXVKYLSNNRDKLPSP 

illllllliOhiitillliilillllitiMiittlMlMlhlllilllilllill 

g _ I M r S ETRLPNL I RAL I FALGFIFI^ACSEQTAQTVTLQGETMGTTYTVKYLSNNRDKLPSP 

10 20 30 40 50 60 

70 80 90 100 110 120 

mill .pep AEIXKRIDDALKEXKRXMSTYQPDSEISRFNQHTAGKPLRISSDFAHVTAEAVRLNRLTH 

hi imnm it iiiii mini i =ii: 

gill AKIQKRIDDALKEVNRQMSTYQTDSEISRFIQTXAGELFAXHAXSITDSAEDCLPNTPIS 
70 80 90 100 110 120 

130 140 150 160 170 180 

mill .pep GALDVTVGPLVNLWGFGPDKSVTREPSPEQIKQAASYTGI DKI ILKQGKD YASLSKTHPK 

gill SALX 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 397>: 

alll . seq 

1 ATGCCGTCTG AAACACGCCT GCCGAACTTT ATCCGCACCT TGATATTTGC 

51 CCTGAGTTTT ATCTTCCTGA ACGCCTGTTC GGAACAAACC GCGCAAACCG 

101 TTACCCTGCA AGGTGAAACG ATGGGCACGA CCTATACCGT CAAATACCTT 

151 TCAAATAATC GGGACAAACT CCCCTCACCT GCCGAAATAC AAAAGCGCAT 

201 CGATGACGCG CTTAAAGAAG TCAACCGGCA GATGTCCACC TATCAGCCCG 

251 ACTCCGAAAT CAGCCGGTTC AACCAACACA CAGCCGGCAA GCCCCTCCGC 

301 ATTTCAAGCG ACTTCGCACA CGTTACTGCC GAAGCCGTCC ACCTGAACCG 

351 CCTGACACAC GGCGCGCTGG ACGTAACCGT CG^CCCCTTG GTCAACCTTT 

4 01 GGGGATTCGG CCCCGACAAA TCCGTTACCC GTGAACCGTC GCCGGAACAA 

4 51 ATCAAACAAG CAGCATCTTA TACGGGCATA GACAAAATCA TTTTGAAACA 

501 AGGCAAAGAT TACGCTTCCT TGAGCAAAAC CCACCCCAAG GCCTATTTGG 

551 ATTTATCTTC GATTGCCAAA GGCTTCGGCG TTGATAAAGT TGCGGGCGAA 

601 CTGGAAAAAT ACGGCATTCA AAATTATCTG GTCGAAATCG GCGGCGAGTT 

651 GCACGGCAAA GGCAAAAACG CGCGCGGCGA ACCTTGGCGC ATCGGCATCG 

701 AACAGCCCAA CATCGTCCAA GGCGGCAATA CGCAGATTAT CGTCCCGCTG 

7 51 AACAACCGTT CGCTTGCCAC TTCCGGCGAT TACCGTATTT TCCACGTCGA 
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801 TAAAAGCGGC AAACGCCTCT CCCATATCAT TAATCCGAAC AACAAACGAC 

8 51 CCATCAGCCA CAACCTCGCC TCCATCAGCG TGGTCGCAGA CAGTGCGATG 

901 ACGGCGGACG GCTTGTCCAC AGGATTATTC GTATTGGGCG AAACCGAAGC 

951 CT7AAAGCTG GCAGAGCGCG AAAAACTCGC TGTTTTCCTG ATTGTCAGGG 

1001 ATAAAGGCGG CTACCGCACC GCCATGTCTT CCGAATTTGA AAAACTGCTC 

10 51 CGCTAA 

This corresponds to the amino acid sequence <SEQ ID 398; ORF 1 1 1 .a>: 

alll -pep 

1 MPSETRLPNF IRTLI FALS F IFLNA CSEQT AQTVTLQGET MGTTYTVKYL 

51 SNNRDKLPSP AEIQKRIDDA LKEVNRQMST YQPDSEISRF NCHTAGKPLR 

101 ISSDFAHVTA EAVHLNRLTH GALDVTVGPL VMLWGFGPDK SVTREPSPEQ 

151 IKQAASYTGI DKIILKQGKD YASLSKTHPK AYLDLSSIAK GFGV DOTAGE 

201 LEKYGIQNYL VEIGGELHGK GKNARGEPWR IGIEQPNIVQ GGNTQI I VPL 

251 NNRSLATSGD YRIFHVDKSG KRLSHIINPN NKRPI SHNLA SISVVADSAM 

301 TADGL5TGLF VLGETEALKL AEREKLAVFL IVRDKGGYRT AMSSEFEKLL 

351 R* 



mlll/alll 97.7% identity in 351 aa overlap 



10 20 30 40 50 60 

mill .pep MPSETRLPKFIRVLIFALGFIFLNACSEQTAQTVTLQGETMGTTYXVKYLSNNRDKLPSP 
I I i I I j I I I I I I : I I I I ! : II I ! I I f I I I I I i I I I I If I I i I I I I : I I I I I I I I I I 1 I I ! 
alll MPSETRLPNFIRTLIFALSFIFLNACSEQTAQTVTLQGETMGTTYTVKYLSNMRDKLPSP 

10 20 30 40 50 60 



70 80 90 100 110 120 

mill .p&p AEIXKRIDDALKEXNRXMSTYQPDSEISRFNQHTAGKPLRISSDFA.HVTAEAVRLNRLTH 
III f I I I I I I I I II I I I I I I I I (I I I I I I I I I I I I I I I I I I I I I I II I I I : M I I I I 
alll AEIQKRIDDALKEVNRQMSTYQPDSEISRFNQHTAGKPLRISSDFAHVTAEAVHLNRLTH 

70 30 90 100 110 120 



130 140 150 160 170 180 

mill .pep GALDVTVGPLVNLWGFGPDKSVTREPSPEQIKQAASYTGIDKI ILKQGKDYASLSKTHPK 

II I I I I I I I I I ! I I I I I I I I I I i I I I I I M II ! I I I I I I I II l I I I I 1 I I I I I I I M I I I 
alll GALDVTVGPLVNLWGFGPDKSVTREPSPEQIKQAASYTGIDKI ILKQGKDYASLSKTHPK 

130 140 150 160 170 180 



190 200 210 220 230 240 

mill .pep AYLDLSSIAKGFGVDKVAGELEKYGIQHYLVEIGGELHGKGKNARGEPWRIGIEQPNIVQ 
I I t I I I I 1 I 1 i M I 1 t M 1 1 M I I M I 1 I 1 1 I I I ; I M I i I t I 1 I I I I I I 11 ! 1 I I j I 1 I 
alll AYLDLSSIAKGFGVDKVAGELEKYGIQNYLVEIGGELKGKGKNARGEPWRIGIEQPNIVQ 

190 200 210 220 230 240 



250 260 270 280 290 300 

mill .pep GGNTQIIVPLNNRSLATSGDYRIFHVDKNGKRLSHIINPNNKRPISHNLASISWADSAM 
i ! M I I i I I i I i ! I I M (I i i I I I I i II : I I I I ! I I 1 I M I I ! M i I I I I I i I I \\ M I I 
alll GGNTQI IVPLNNRSLATSGDYRIFHVDKSGKRLSHIINPNNKRPISHNLASISWADSAM 

250 260 270 280 290 300 



310 320 330 340 350 

ml 11. pep TADGLSTGLFVLGETEALKLAEREKLAVFLIVRDKGGYRTAMSSEFEKLLRX 
1 I I I I M I I II I I I I I I I I I I I I I I I I I II I I I I I I I I ! II I I I II ! I I I I I 
alll TADGLSTGLFVLGETEALKLAEREKLAVFLIVRDKGGYRTAMSSEFEKLLRX 

310 320 330 340 350, 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 399>: 

glll-l . seq 

1 ATGCCGTCTG AAACACGCCT GCCGAACCTT ATCCGCGCCT TGATATTTGC 

51 CCTGGGTTTC ATCTTCCTGA ACGCCTGTTC GGAACAAACC GCGCAAacCG 

1C1 TTACCCTGCA AGGCGAAACG ATGGGTACGA CCTATACCGT CAAATACCTT 

151 TCAAATAATC GGGACAAACT CCCCTCCCCT GCCAAAATAC AAAAGCGCAT 

201 TGATGATGCG CTTAAAGAAG TCAACCGGCA GATGTCCACC TACCAGACCG 
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251 ATTCCGAAAT CAGCCGGTTC AACCAACACA CAGCCGGCAA GCCCCTCCGC 

301 ATTTCAAGCG ATTTCGCACA CGTTACCGCC GAAGCCGTCC GCCTGAACCG 

351 CCTGACTCAC GGCGCACTGG ACGTAACCGT CGGCCCTTTG GTCAACCTTT 

401 GGGGGTTCGG CCCCGACAAA TCCGTTACCC GTGAACCGTC GCCGGAACAA 

4 51 ATCAAACAGG CGGCATCTTA TACGGGCATA GACAAAATCA TTTTGCAACA 

501 AGGCAAAGAT TACGCTTCCT TGAGCAAAAC CCACCCCAAA GCCTATTTGG 

551 ATTTATCTTC GATTGCCAAA GGCTTCGGCG TTGATAAAGT TGCGGGCGAA 

601 CTGGAAAAAT ACGGCATTCA AAATTATCTG GTCGAAATCG GCggcGAGTT 

651 GCACGGCAAA GGCAAAAATG CGCACGGCGA ACCGTGGCGC ATCGGTATAG 

701 AGCAACCCAA TATcatccaa ggcggcaata cgcAGattat cgtcccgctg 

751 aaCaaccgtt cgcttgccac ttccggcgAT taccgtaTTT tccacgtcgA 

801 TAAAAACGGC Aaacgccttt cccacATCAT CAATCCCAAC AACAAACGAC 

851 CCATCAGcCA CAAcctcgcc tcCATCAgCg TGGTCTCAGA C ACTGCAATG 

901 ACGGCGGACG GTTTATCCAC A^GATTATTT GTTTTAGGCG AAACCGAAGC 

951 CTTAAGGCTG GCAGAACAAG AA_\AACTCGC TGTTTTCCTA ATTG7CCGGG 

1001 ATAAGGACGG CTACCGCACC GCCATGTCTT CCGAATTTGC CAAGCTGCTC 

1051 CGCTAA 



This corresponds to the amino acid sequence <SEQ ID 400; ORF 1 1 l-l.ng>: 

glll-l.pep 

1 MPSETRLPNL IRALI FALGF IFLKAC SEQT AQTVTLQGET MGTTYTVKYL 

51 SNNRDKLPSP AKIQKRIDDA LKEVNRQMST YQTDSEI SRF NQHTAGKPLR 

101 ISSDFAHVTA EAVRLNRLTH GALDVTVGPL VNLWGFGPDK SVTREPSPEQ 

151 IKQAASYTGI DKIILQQGKD Y^SLSKTHPK AYLDLSSIAK GFGVDKVAGE 

201 LEKYGIQNYL VEIGGELHGK GKNAHGEPWR IGIEQPNIIQ GGNTQI IV PL 

251 NNRSLATSGD YRI FHVDKNG KRLSHI INPN NKRPISHNLA SISVVSDSAM 

301 TADGLSTGLF VLGETEALRL AEQEKLAVFL IVRDKDGYRT AMSSEFAKLL 

351 R* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 401 >: 

mlll-l. saq 

1 ATGCCGTCTG AAACACGCCT GCCGAACTTT ATCCGCGTCT TGATATTTGC 

51 CCTGGGTTTC ATCTTCCTGA ACGCCTGTTC GGAACAAACC GCGCAAACCG 

101 TTACCCTGCA AGGCGAAACG ATGGGCACGA CCTATACCGT CAAATACCTT 

151 TCAAATAATC GGGACAAACT CCCCTCACCT GCCGAAATAC AAAAACGCAT 

201 CGATGACGCG CTTAAAGAAG TCAACCGGCA GATGTCCACC TATCAGCCCG 

251 ACTCCGAAAT CAGCCGGTTC AACCAACACA CAGCCGGCAA GCCCCTCCGC 

301 ATTTCAAGCG ACTTCGCACA CGTTACTGCC GAAGCCGTCC GCCTGAACCG 

351 CCTGACACAC GGCGCGCTGG ACGTAACCGT CGGCCCCTTG GTCAACCTTT 

4 01 GGGGATTCGG CCCCGACAAA TCCGTTACCC GTGAACCGTC GCCGGAACAA 

4 51 ATCAAACAGG CGGCATCTTA TACGGGCATA GACAAAATCA TTTTGAAACA 

501 AGGCAAAGAT TACGCTTCCT TGAGCAAAAC CCACCCCAAG GCCTATTTGG 

551 ATTTATCTTC GATTGCCAAA GGCTTCGGCG TTGATAAAGT TGCGGGCGAA 

601 CTGGAAAAAT ACGGCATTCA AAATTATCTG GTCGAAATCG GCGGCGAGTT 

651 GCACGGCAAA GGCAAAAACG CGCGCGGCGA ACCGTGGCGC ATCGGTATCG 

701 AGCAGCCCAA TATCGTCCAA GGCGGCAATA CGCAGATTAT CGTCCCGCTG 

751 AACAACCGTT CGCTTGCCAC TTCCGGCGAT TACCGTATTT TCCACGTCGA 

801 TAAAAACGGC AAACGCCTCT CCCATATCAT CAACCCGAAC AACAAACGAC 

851 CCATCAGCCA CAACCTCGCC TCCATCAGCG TGGTCGCAGA CAGTGCGATG 

901 ACGGCGGACG GCTTGTCCAC AGGATTATTC GTATTGGGCG AAACCGAAGC 

951 CTTAAAGCTG GCAGAGCGCG AAAAACTCGC TGTTTTCCTG ATTGTCAGGG 

1001 ATAAAGGCGG CTACCGCACC GCCATGTCTT CCGAATTTGA AAAACTGCTC 

1051 CGCTAA 

This corresponds to the amino acid sequence <SEQ ID 402; ORF 1 1 1-1>: 

ml 1 1 - 1 . pep 

1 MPSETRLPNF IRVLI FALGF IFLNAC SEQT AQTVTLQGET MGTTYTVKYL 

51 SNNRDKLPSP AEIQKRIDDA LKEVNRQMST YQPDSEISRF NQHTAGKPLR 

101 ISSDFAHVTA EAVRLNRLTH GALDVTVGPL VNLWGFGPDK SVTREPSPEQ 

151 IKQAASYTGI DKIILKQGKD YASLSKTHPK AYLDLSSIAK GFGVDKVAGE 

201 LEKYGIQNYL VEIGGELHGK GKNARGEPWR IGIEQPNIVQ GGNTQI IVPL 

251 NNRSLATSGD YRI FHVDKNG KRLSHI INPN NKRPISHNLA SISVVADSAM 

301 TADGLSTGLF VLGETEALKL AEREKLAVFL I VRDKGGYRT AMSSEFEKLL 

351 R* 



mlll-l/glll-1 96.6% identity in 351 aa overlap 



10 20 30 40 50 60 

ml 11-1 . pep MPSETRLPNFIRVLI FALGFI FLNA.CSEQTAQTVTLQGETMGTTYTVKYLSNNRDKLPSP 
I M i I I I II : I I : I I I I I I I I I I I I I I I I I I I I I I II I I I I I ! ! I I I I I f I I I I I I I I I I 
g 1 1 1 - 1 MPSETRLPN L I RAL I FALG F I FLN AC SEQTAQTVT LQG ETi-'GTT Y T V K Y LSNNRDKLPSP 
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10 20 30 40 50 60 

70 80 90 100 110 120 

ml 11-1 . pep AEIQKRIDDALKEVNRQMSTYQPDSEISRFNQHTAGKPLPJSS.0FAHV7AEAVRLNRLTH 
1 : I It M I M I I M I i I lit I I i I I I I 1 I I I I I ! i ! I I I ! Ill I 1 111 1 ! I I i I t i i I I 
gl 1 1 -1 AKIQKRIDDALKEVNRQMSTYQTDSEISRFNQHTAGKPLRISSDFAHVTASAVRLNRLTK 
l'j 80 90 100 110 120 

130 140 150 160 170 180 

ml 11-1 .pep GALDVTVGPLVNLWGFGPDKSVTREPSPEQIKOAAS YTGI DKI I LKQGKDYASLSKTHPK 
I I t 1 { I I I I ( I I I I (1 I I I I H I I I I I ( I I I I I I I I I I I I I I I I (: I I ( I I I I I I ! I I I I 
gl 11-1 GALDVTVGPLVNLWGFGPDKS VTREPS PEQIKQAAS YTGI DK 1 1 LQQGKDYA5LSKTHPK 

130 140 150 160 170 ISO 

190 200 210 220 230 240 

ml 11-1 .pep AYLDLSSIAKGFGVDKVAGELEKYGIQNYLVEIGGELHGKGKNARGEPWRIGIEQPNIVQ 
M I I I I I I I I I I I I I I I I I I i I I I I I I i I I I I II I I I I I I I M I : I i I f I I 1 I I t I I I : I 
gl 1 1-1 AYLDLSSIAKGFGVDKVAGELEKYGIQNYLVEI GGELHGKGKNAHGEPWRI GIEQPNI iq 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml 11-1 .pep GGNTQI I VPLNNRSLATSGDYRI FHVDKNGKRLSHI INPNNKRPI SHNLASI 5VVADS AM 
M I I I I I I ! I I I I I I ! I I I I I I I I I I I I [ I I I f I I I I I I ! I II I I I i I I I I I I I I : j I I I 
g 1 1 1 - 1 GGNTQI IVPLNNRSLATSGDYRI FHVDKNGKRLSHI INPNNKRPI SHNLASI SWSDSAM 

250 260 270 280 290 300 

310 320 330 340 350 

ml 11-1 .pep TADGLSTGLFVLGETEALKLAEREKLAVFLIVRDKGGYRTAMSSEFEKLLRX 
I I I I I I I I I I I I I i I I I I : i I I : I I I 1 I I I I f I t I I 1 I I I I I I I I I I I I I 
glll-1 TADGLSTGLFVLGETEALRLAEQEKLAVFLIVRDKDGYRTAMSSEFAKLLRX 

310 320 330 340 350 

glll-l/p44550 

sp | P44550) YOJL_HAEIN HYPOTHETICAL LIPOPROTEIN HI0172 PRECURSOR >gi | 1074292 I pir | | C64 144 
hypothetical protein HI0172 - Haemophilus influenzae (strain Rd KW20) >gi| 1573128 (U32702) 
lipoprotein, putative [Haemophilus influenzae Rd] Length = 346 
Score = 349 bits (885), Expect = 2e-95 

Identities ~ 177/328 (531), Positives = 240/328 (72%), Gaps = 4/328 (1%) 



Query : 


23 


LNACSEQTAQTVTLQGETMGTTYXVKYLSNNRDKLPSPAEIXKRI DDALKEXNRXMSTYQ 


82 






L AC ++T + -r + L G+TMGTTY VKYL + S + + 1+ LK+ N MSTY + 




Sbjct : 


17 


LAACQKET-KVISLSGKTMGTTYHVKYLDDGSITATS-EKTHEEI EAILKDVNAKM5TYK 


74 


Query : 


83 


PDSEI SRFNQHT-AGKPLRISSDFAHVTAEAVRLNRLTHGALDVTVGPLVNLWGFGPDKS 


141 






DSE+SRFNQ+T P + IS+DFA V AEA+RLNt+T GALDVTVGP+VNLWGFGP+K 




Sbjct : 


75 


KDSELSRFNQNTQVNTPIEISADFAKVLAEAIRLNKVTEGALDVTVGPVVNLWGFGPEKR 


134 


Query : 


142 


VTREPSPEQI KQAASYTGIDKI I LKQGKDYASLSKTHPKAYLDLSSIAKGFGVDKVAGEL 


201 






++P+PEQ+ + ++ GIDKI L K+ A+LSK P+ Y+DLSSI AKGFGVD+VA *L 




Sbjct : 


135 


PEKQPTPEQLAERQAWVGIDKITLDTNKEKATLSKALPQVYVDLSSIAKGFGVDQVAEKL 


194 


Query : 


202 


EKYGIONYLVEIGGELHGKGKNARGEPWRIGIEQPNIVQGGNTQIIVPLNNRSLATSGDY 


261 






E+ QNY+VEIGGE+ KGKN G+PW+I IE+P + + + LNN +A+SGDY 




Sbjct : 


195 


EQLNAQN YMVEI GGEI RAKGKN I EGKPWQI AI EKPTTTGERAVEAVI GLNNMGMASSGDY 


254 


Query: 


262 


RI FHVDKNGKRLSHI INPNNKRPI SHNLASI SVVADSAMTADGLSTGLFVLGETEALKLA 


321 






RI+ ++NGKR +H I+P PI H+LASI+V+A ++MTADGLSTGLFVLGE +AL++A 




Sbjct : 


255 


RI Y- FEENGKRFAHEI DPKTGYPIQHHLASITVLAPTSMTADGLSTGLFVLGEDKALEVA 


313 


Query: 


322 


EREKLAVFLI VRDKGGYRTAMSSEFEKL 34 9 








E+ LAV+LI+R G+ T SS F+KL 




Sbjct : 


314 


EKNNLAVYLI IRTDNGFVTKSSSAFKKL 341 





The following partial DNA sequence was identified in N. meningitidis <SEQ ID 403>: 

alll-l . seq 

1 ATGCCGTCTG AAACACGCCT GCCGAACTTT ATCCGCACCT TGATATTTGC 

51 CCTGAGTTTT ATCTTCCTGA ACGCCTGTTC GGAACAAACC GCGCAAACCG 

101 TTACCCTGCA AGGTGAAACG ATGGGCACGA CCTATACCGT CAAATACCTT 

151 TCAAATAATC GGGACAAACT CCCCTCACCT GCCGAAATAC AAAAGCGCAT 

201 CGATGACGCG CTTAAAGAAG TCAACCGGCA GATGTCCACC TATCAGCCCG 

251 ACTCCGAAAT CAGCCGGTTC AACCAACACA CAGCCGGCAA GCCCCTCCGC 

301 ATTTCAAGCG ACTTCGCACA CGTTACTGCC GAAGCCGTCC ACCTGAACCG 
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351 CCTGACACAC GGCGCGCTGG ACGTAACCGT CGGCCCCTTG GTCAACCTTT 

4 01 GGGGATTCGG CCCCGACAAA TCCGTTACCC GTGAACCGTC GCCGGAACAA 

4 51 ATCAAACAAG CAGCATCTTA TACGGGCATA GACAAAATCA TTTTGAAACA 

501 AGGCAAAGAT TACGCTTCCT TGAGCAAAAC CCACCCCAAG GCCTATTTGG 

551 ATTTATCTTC GATTGCCAAA GGCTTCGGCG TTGATAAAGT TGCGGGCGAA 

601 CTGGAAAAAT ACGGCATTCA AAATTATCTG GTCGAAATCG GCGGCGAGTT 

551 GCACGGCAAA GGCAAAAACG CGCGCGGCGA ACCTTGGCGC ATCGGCATCG 

701 AACAGCCCAA CATCGTCCAA GGCGGCAATA CGCAGATTAT CGTCCCGCTG 

751 AACAACCGTT CGCTTGCCAC TTCCGGCGAT TACCGTATTT TCCACGTCGA 

301 TAAAAGCGGC AAACGCCTCT CCCATATCAT TAATCCGAAC AACAAACGAC 

8 51 CCATCAGCCA CAACCTCGCC TCCATCAGCG TGGTCGCAGA CAGTGCGATG 

901 ACGGCGGACG GCTTGTCCAC AGGATTATTC GTATTGGGCG AAACCGAAGC 

951 CTTAAAGCTG GCAGAGCGCG AAAAACTCGC TGTTTTCCTG A7~GTCAGGG 

1001 ATAAAGGCGG CTACCGCACC GCCATGTCTT CCGAATTTGA AAAACTGCTC 

1051 CGCTAA 



This corresponds to the amino acid sequence <SEQ ID 404; ORF 1 1 1-1. a>: 

alll-l .pep 

1 MPSETRLFNF IRTLI FALSF I FLNA.C SEQT AQTVTLQGET MGTTYTVKYL 

51 SNNRDKLPSP AEIQKRIDDA LKEVNRQMST YQPDSEI SRF NQHTAGKPLR 

101 I SSDFAHVTA EAVHLNRLTH GALDVTVGPL VNLWGFGPDK SVTREPSPEQ 

151 IKQAASYTGI DKI ILKQGKD YASLSKTHPK AYLDLSSIAK GFGVDKVAGE 

201 LEKYGIQNYL VEIGGELHGK GKNARGEPWR IGIEQPNIVQ GGNTQI I VPL 

251 NNRSLATSGD YRI FHVDKSG KRLSHTINPN NKRPISHNLA SISVVADSAM 

301 TADGLSTGLF VLGETEALKL AEREKLAVFL I VRDKGGYRT AMSSEFEKLL 

351 R* 



alll-l/mlll-1 98.9% identity in 351 aa overlap 



10 20 30 40 50 60 

alll-l .pep MPSETRLPNFIRTLIFALSFIFLNACSEQTAQTVTLQGETMGTTYTVKYLSMMRDKLPSP 
! I I i I I j I I I ! I : ! I I I I : I I I ! I I I I I i 1 ! I I I I i I 1 I I I ! ! i I I I I j I f M I I ! i I I I 
mlll-1 MPSETRLPNFIRVLIFALGFIFLNACSEQTAQTVTLQGETMGTTYTVKYLSNNRDKLPSP 

10 20 30 40 50 60 



80 90 100 110 120 

dlll-1 .pep AE1QKRIDDALKEVNRQMST YQPDSEI SRFNQHTAGKPLRI SSDFAHVTAEAVHLNRLTH 
i [ ! ! t I I t H I t I i 1 II I I i I I i I I I I I I I I I I I I I I I I 1 t 1 ( I ! 1 I 1 I M [ 1 : I I I I II 
mlll-1 AEIQKRIDDALKEVNRQMSTYQPDSEISRFNQHTAGKPLRISSDFAHVTAEAVRLNRLTH 
70 80 90 100 110 120 



130 140 150 160 170 180 

alll-l .pep GALDVTVGPLVNLWGFGPDKSVTREPSPEQIKQAASYTGIDKIILKQGKDYASLSKTHPK 
I | i I I I I I I I I I f I I I i ! I I I II I I I I i ! I I I I I t I I I i I I I I I i 1 I I I I I ! I I 1 I I I I I 
mlll-1 GALDVTVGPLVNLWGFGPDKSVTREPSPEQIKQAASYTGI DKI ILKQGKDYASLSKTHPK 

130 140 150 160 170 180 



190 200 210 220 230 240 

alll-l .pep AYLDLSS IAKGFGVDKVAGELEKYGIQNYLVEIGGELHGKGKNARGEPWRIGIEQPMI VQ 
I I I il I I I I i I I I i I I I I I I ! M 1 1 M I I I I I I I i I I ! I I I I I I I I M it I I I I I ! I It I 
mlll-1 AYLDLSSIAKGFGVDKVAGELEKYGIQNYLVEIGGELHGKGKNARGEPWRIGIEQPNTVQ 

190 200 210 220 230 240 



250 260 270 280 290 300 

alll-l .pep GGKTQIIVPLNHRSLATSGDYRI FHVDKSGKRLSHIINPNNKRPI SHN LASTS WADS AM 
M I ! i I I If I I ! I I I I I I I I I I I I I I I I : I I I 1 I M I I I i I I I I I I I f I I ! I ! I I II 11 I 
mlll-1 GGNTQI I VPLNNRSLATSGD YRI FHVDKNGKRLSHI INPNNKRPI SHNLAS ISVVADSAM 

250 260 270 280 290 300 



310 320 330 340 350 

alll-l .pep tadglstglfvlgetealklaereklavflivrdkggyrtamssefekllrx 
1 1 1 1 1 u I i I i i i m 1 1 1 ( 1 i i i 1 1 i i 1 1 i i i 1 1 i I i i i i i i i i i i i i 1 1 i i 
mlll-1 tadglstglfvlgetealklaereklavfli vrdkggyrtamssefekllrx 

310 320 330 340 350 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 405>: 

g!14 . seq 

1 ATGGCTTCCA TCACTTCGCC GCTGCACGGG GCGCAGCAGG AATGCAGCAA 
51 GACTTTTTTA TGTCCGCCGG GCGGGACGAG TATGGGGCGG TCAATGTCGG 
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101 TAACGGTAGG TTTGTTTTGT GTTTCCATTA ACTTAACAAT ATCTGTCGAA 

151 TACGGTCAAA GCGGCTATTT TACCAGAGCC GCCGAATGTA AAACAGGGTG 

201 TCAGGGCATC AGCCCGAGCT GCCTGAACGA ACGGACGGTT TGCGAGGTAA 

251 CGATAAAATG GTCGAGCAGC GAAACATCAA CCAGCGACAT GGCCTGTGCC 

301 AGCCGCCTTG TGAACATGAT GTCTTCCTGC GAAGGTTCAG GCGAGCCGCC 

3 51 CGGATGGTTG TGCGCGATAA TCAGGCTGTC GGCATATTCG TCCAATGCCA 

401 GTTTGACGAT TTCGCGGATG TAA 

This corresponds to the amino acid sequence <SEQ ID 406; ORF 1 14.ng>: 
gll4 . pep 

1 MASITSPLHG AQQECSKTFL CPPGGTSMGR SM SVTVGLFC VSINLTISV E 

51 YGQSGYFTRA AECKTGCQGI SPSCLNERTV CEVTIKWSS5 ETSTSDMACA 

101 SRLVNMMSSC EGSGEPPGWL CAIIRLSAYS SNA3LTISRM * 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 407>: 

ml 14 . seq 

1 ATGGCTTCCA TCACTTCGCC GCTGCACGGG GCGCACAGAG AATGCAGCAA 

51 GACTTTTTTA TGTCCACCGG GCGGGACGAG TATAGGGCGG TCAATGTCGG 

101 TAACGGTAGG TTTGTTTTGT GTTTCCATTA ACTTAACAAT ATCTGTTGAA 

151 TACGGTTGAA GCGGCTATTT TAT C AG AG CC GCCGCATGTA AAACAGAGTG 

201 TCAGGGCATC AACCCGAGCT GTCTGAACGA ACAGACGCTT TGCGAkGTAA 

251 CGATAAAATG GTCGAGCAGC GACACATCGA CCAGCGACAT TGCCTGTGCC 

3 01 AGCCGCCTTG TGAACATGAT GTCTTCCTGC GAArGTTCsG GCGAGCCGcC 
351 CGgATGGTTG TGCGCAATAA TCAGGCTGTC GGCATATTCG TCCAATGCCA 

4 01 GTTTGACGAT TTCGCGGATG TAA 

This corresponds to the amino acid sequence <SEQ ID 408; ORF 1 14>: 

mll4 . pen 

1 MASITSPLHG AHRECSKTFL CPPGGTSIGR SM SVTVGLFC VSINLTISV E 

51 YGXSGYFIRA AACKTECQGI NPSCLNEQTL CXVTIKWSSS DTSTSDIACA 

101 SRLVNMMSSC EXSGEPPGWL CAIIRLSAYS SNASLTISRM * 



mll4/gll4 90.0% identity over a 140 aa overlap 

10 20 30 40 50 60 

ml 14 .pep MAS ITS PLHGAHRE CS KTFLCPPGGTSIGRSMSVTVGLFCVS INLTISVE YGXSGYFIRA 

I I I | I I I I I I I = : I I I I I I I II I I II I : I ! I I I I I I I I I i I I II I ! I I I I I I I I I I II 
gll4 MASITSPLHGAQQECSKTFLCPPGGTSMGRSMSVTVGLFCVSINLTISVEYGOSGYFTRA 
10 20 30 40 50 60 



70 80 90 100 110 120 

ml 14 . pep AACKTECQGINPSCLNEOTLCXVTIKWSSSDTSTSDIACASRLVNMMSSCEXSGEPPGWL 

i iii iiiMiiiih hi mm im i ii hi mi m in in i ii inn 

gll4 AECKTGCQGI SPSCLNERTVCEVTIKWSSSETSTSDMACASRLVNMMSSCEGSGEPPGWL 

70 80 90 100 110 120 



130 140 
mll4.pep CAIIRLSAYSSNASLTISRMX 

MIMIIIIIIIMIIimi 

gl 14 CAI IRLSAYSSNASLTISRMX 

130 140 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 409> 

all4 . seq 

1 ATGCCGGAGG CAAGCATCGC CTCCATCACT TCGCCGCTGC ACGGGGCGCA 

51 ACAGGAATGC AGCAAGACTT TTTTATGTCC GCC^GGCGGG ACGAGTATGG 

101 GGCGGTCAAT GTCGGTAACG GTAGGTTTGT TTTGTGTTTC CATTAACTTA 

151 ACGATATCTG TCGAATACGG TTGAAGCGGC TATTTTATCA GAGCCGCCGC 

201 ATGTAAAACA GGGTGTCAGG GCATCAGCCC GAGCTGCCTG AACGAACGGA 

2 51 CGGTTTGCGC CGTTACGATA AAATGGTCGA GCAGCGACAC ATCGACCAGC 

301 GACATTGCCT GTGCCAGCCG CCTTGTGAAC ATGATGTCTT CCTGCGAAGG 

351 TTCGGGCGAG CCGCCCGGAT GGTTGTGCGC GATAATCAGG CTGTCGGCAT 

4 01 ATTCGTCCAA. TGCCAGTTTG ACAATTTCAC GGATGTAA 
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This corresponds to the amino acid sequence <SEQ ID 410; ORF 1 14.a>: 

all 4 . pep 

1 MPEASIASIT SPLHGAQQEC SKTFLCPPGG TSMGRSM SVT VGLFCVSINL 

SI TISV EYG *SG YFIRAAACKT GCQGISP5CL NERTVCAVTT KWSSSDTSTS 

101 DIACASRLVN MMSSCEGSGE PPGWLCAIIR LSAYSSNASL TISRM* 



mll4/all4 92.9% identity in 140 aa overlap 



10 20 30 40 50 

ml 14 . pep MASITSPLHGAKRECSKTFLCPPGGT"SIGRSMSVTVGLFCVS INLTISVEYGXSG 

: I I 1 I I I I M I : : I I ! I! I I I I I I I , ( : II I I I I I I I I I I I I I I I I i I M ! i [ I i 
all 4 MPEASIASITSP 1 HGAQQECSKTFLCPPGGTSMGRSMSVTVGLr CVSINLTI SVEYGXSG 

10 20 30 40 50 60 



60 70 80 90 100 110 

ml 1 4 . pep YFIRAAACKTECQGINPSCLNEQTLCXVTIKWSSSDTSTSDIACASRLVNMMSSCEXSGE 
MINIMI. M I I : I I I I I I : i : I I I I I I I I ! II I I I I I I I II M I I I I I I I I II! 
all 4 YFIRAAACKTGCQGI SPSCLNERTVCAVTIKWSSSDTSTSDIACASRLVNMMSSCEGSGE 

70 80 90 100 110 120 



120 130 140 

mll4 .pep PPGWLCAIIRLSAYSSNASLTISRMX 
I ! I I I I I I I M I I I I I I I I I I I I II I 
a 1 1 4 P PGW LCAI I RLS AY S SNASLT I SRMX 

130 140 



The following partial DNA sequence was 


gll7 . seq 






1 


atggtcgacg 


aactcgacCT 


51 


TGCCGACATC 


GGACGCTACG 


101 


GCTGCAACAG 


CACCGTCGCC 


151 


AAACTTACCC 


ACTTCGCCCG 


201 


CGCACAGCAA 


GCGGAAACCA 


251 


Acatccgcgt 


cgtaTTAATC 


301 


ttTTtaaGCA 


ACGCCCCCGA 


351 


aacccTCGAC 


ATCTTCGCCC 


401 


TCAAATGGCA 


GCTCGAAGAT 


451 


TACCGCGAAA 


TCGCCCTGCT 


501 


ATACATCGAA 


AACTTCCTCG 


551 


ATATCCACTT 


TGAAGTCGCC 


601 


AAAAAAATGG 


TGAAGAAAAA 


651 


CGCCGTGCGG 


ATTCTGGTCG 


701 


gcaTCGTCCA 


CAGCCTCTGG 


751 


ATCGCCAACC 


CCAAAGgcaA 


801 


cggcccGGAa 


gacaaaggtg 


851 


accAATTCaa 


CgaatTcggT 


901 


ggcaaaggcg 


attccGCCtA 


951 


CTTGGACTGG 


CGCGAAAATA 


1001 


CCGCCTTCAA 


AACCGAGCTT 


1051 


CACGGCAAAG 


TCCTCTCTCT 


1101 


CTACGCCCTG 


CACAGCAGCA 


1151 


AaggGCAGAT 


TGTGCCGCTG 


1201 


GAAATcatta 


cCGCcaaAGA 


1251 


AGGctgGGtc 


aAATCCGGCA 


1301 


GCCAGcaaAa 


cgCcgaCACC 


1351 


AAGCAGCTTG 


CCAAACTCAC 


1401 


aaATCTCGGC 


tacaaAAAGC 


1451 


gcgaaatttc 


caaccgcgcc 


1501 


ccgccccCCG 


TGCCCGTCAG 



identified in N. gonorrhoeae <SEQ ID 41 1>: 

GCTGCCCGAT GCCGTCGCCG CCACCCTGCT 
TCCCCGATTG GAACCTATTG GTTTCCGAGC 
GAGCTGGTCA AAGGTGtgga CGAAGTGCAG 
GGTGGACAGC CTCGCCACGC CGGAAGAACG 
TGCGGAAAAT GCTGCTGGCg atggttaccg 
AAACTGGCGA TGCGTacgcg caccCTGcta 
CAGCCCTGAA AAACgcgccG TCgccaaAga 
CGCTCGCCAA CCGCTTGGGC GTGTGGCAGC 
TTGGGCTTCC GCCATCAAGA ACCCGAAAAA 
TTTGGACGAA AAACGCACCG AACGCCTCGA 
ATATCCTGCG TACGGAACTC AAAAAATACA 
GGCCGTCCGA AACACATCTA CTCCATTTAC 
ACTCAGCTTC GACGgccTGT TCGACATCCG 
ATACCGTCCC CGaGTGTTAC ACCACGCTGG 
CAGCCCATTC CGGGCGagtt CGAcgactAC 
CGgttATAAA AGtTTGCACA CCGTCATCGT 
tggaaGtgCA AATCCGCACC TTCGAtatGC 
gtcgccgCCC ACTGGCGtta caaagaaggc 
cgaacaaAAA ATcgccTggt TGCgccaACT 
TGGCGGAAAG CGGCAAGGAA GACCTCGCCG 
TTCAACGACA CG ATTTATGT TTTGACCCCG 
GCCAACGGGC GCAACCCCCA TCGACTTCGC 
TcggCGACCG CTGCCGGGGC GCGAAAGTCG 
TCCACCCCGC TCGAAAACGG ACAGCGCGTC 
AGGGCATCCT TCCGTCAACT GGCTTTACGA 
AGGCCATCGG caaAATCCGC GCCTAcatCC 
GTGCGCGAAG AAGGCCGTGT CCAACTCGAC 
GC^,AAACCC AACCTGCAAG AGCTTgccga 
cagaagacct ccacacCGCc gtcggacaag 
atCcaaaaag cctgcggcac GCTgaacgaa 
CGCAACCACC ATCGTCAAAC AGTCCAAAAT 
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1551 CAAAAAAGGT GGC.ViAACCG GCGTGCTCAT CGACGGCGAA GACGGCTTGA 

1601 TGACCACGCT TGCCAAATGC TGCAAACCCG CGCCGCCCGA CGATATTGCC 

16 51 GGCTTCGTTA CCCGCGAGCG CGGCATTTCC GTCCACCGCA AAACCTGCCC 

1701 CTCTTTCCGA CACCTTGCCG AACACGCGCC CGAAAAAGTA CTGGACGCAA 

1751 GTTGGGCGGC GTTGCAGGAA GGGCAAGTGT TCGCCGTCGA TATCGAAATC 

1S01 CGC'JCCCAAG ACCGCTCCGG GCTTTTGCGC GACGTATCCG ACGCGCTCGC 

18 51 CCGCCACAAA CTCAACGTTA CCGCCGTGCA AACCCAGTCC CGCGACTTGG 

1901 AAGCCAGCAT GAGGTTCACG CTCGAAGTCA AACAAGtCAA CGacCTCCCG 

1951 CGCGTCCTCG CCGGCCTCGG CGATGTCAAA GGCGTATTGA GCGTTACCCG 

2001 GCTTTAA 

This corresponds to the amino acid sequence <SEQ ID 412; ORF 1 1 7.ng>: 

gll7 . pep 

1 MVDELDLLPD AVAATLLADI GRYVPDWNLL VSERCNSTVA ELVKGVDEVQ 

51 KLTHFARVDS LATPEERAQQ AETMRK MLLA MVTDIRWLI KLAM RTRTLL 

101 FLSNAPDSPE KRAVAKETLD I FAPLANRLG WQLKWQLED LGFRHQEPEK 

151 YREIALLLDE KRTERLEYIE NFLDILRTEL KKYNIKr EVA GRPKHIYSIY 

201 KKMVKKKLSF DGLFDIRAVR ILVDTVPECY TTLGIVHSLVJ QPI PGEFDDY 

251 IANPKGNGYK SLHTV1VGPE DKGVEVQIRT FDMHQFNEFG VAAHWRYKEG 

3 01 GKGDSAYEQK IAWLRQLLDW RENMAESGKE DLAAAFKTEL FNDTIYVLTP 

3 51 HGKVLSLPTG ATPIDFAYAL HSSIGDRCRG AKVEGQIVPL STPLENGQRV 

4 01 EI ITAKEGHP SVNWLYEGWV KSGKAIGKIR AYIRQQNADT VREEGRVQLD 

4 51 KQLAKLTPKP NLQELAENLG YKKPEDLYTA VGQGEISNRA IQKACGTLNE 

5 01 PPPVPVSATT IVKQSKIKKG GKTGVLIDGE DGLMTTLAKC CKPAPPDDIA 
551 GFVTRERGIS VHRKTCPSFR HLAEHAPEKV LDASWAALQE GQVFAVDIEI 
601 RAQDRSGLLR DVSDALARHK LNVTAVQTQS RDLEASMRFT LEVKQVNDLP 

6 51 RVLAGLGDVK GVLSVTRL* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 41 3>: 

mll7.seq (partial) 

1 . . GTGAAACTCA AGAAATACAA TGTCCATTTC GAAGTCGCCG GCCGCCCGAA 

51 ACACATCTAC TCCATTTACA AAAAAATGGT GAAGAAAAAA CTCAGCTTCG 

101 ACGGCCTCTT TGACATCCGC GCCGTGCGAA TTCTGGTTGA TACCGTCCCC 

151 GAGTGTTACA CCACGCTGGG TATCGTCCAG AGCCTCTGGC AGCCCATTCC 

201 CGGCGAGTTC GACGACTACA TCGCCAATCC CAAAGGCAAC GGCTATAAAA 

251 GTTTGCACAC CGTCATCGTC GGCCCGGAAG ACAAAGGCGT GGAAGTACAA 

301 ATCCGCACCT TCGATATGCA CCAATTCAAC GAATTCGGTG TCGCCGCCCA 

351 CTGgCGTTAC AAAGAGGGCG GCAAGGGCGA TTCCGCCTAC GAACAGAAAA 

4 01 TCGCCTGGTT GCGCCAACTC TTGGACTGGC GCGAAAACAT GGCGG AAAGC 

4 51 GGCAAGGAAG ACCTCGCCGC CGCCTTCAAA ACCGAGCTTT TCAACGACAC 

501 GATTTATGTT TTGACCCCGC ACGGCAAAGT CCTCTCCCTG CCCACGGGCG 

551 CGACCCCCAT CGACTTCGCC TACGCCCTGC ACAGCAGCAT CGGCGACCGT 

601 TGCCGCGGTG CGAAAGTCGA AGGGCAGATT GTGCCGCTGT CCACCCCGCT 

651 CGAAAACGGA CAGCGCGTCG AAATCATTAC CGCCAAAGAA GGG CATCCTT 

701 CCGTCAACTG GCTTTACGAA GGCTGGGTCA AATCCAACAA GGCAATCGGC 

751 AAAATCCGCG CCTACATCCG CCAGCAAAAC GCCGACACCG TGCGCGAAGA 

801 AGGCCGCGTC CAACTCGACA AACAGCTTGC CAAACTCACG CCCAAACCCA 

8 51 ACCTGCAAGA GCTTGCCGAA AATCTCGGCT ACAAAAAGCC AGAAGACCTC 

901 TACACCGCCG TCGGACAAGG CGAAATTTCC AACCGCGCCA TCCAAAAAGC 

951 CTGCGGCACg CTGAACGAAC CGCCGCCCGT ACCCGTCAGC GAAACCACCA 

1001 TCGTCAAACA GTCCAAAATC AAAAAAGGCG GCAAAAACGG CGTGCTCATC 

1051 GACGGCGAAG ACGGTCTGAT GACCACGCTT GCCAAATGCT GCAAACCCGC 

1101 GCCGCCCGAC GATATTATCG GCTTCGTTAC CCGCGAGCGC GgCATTTCAG 

1151 TGCACCGCAA AwyyTkCy CG TCTTTCCAAC ACCTCGCCGA ACACGCGCCC 

12 01 GAwAAAGTGC TGGACGCAAG CTGGGCGGCA TTGCAGGAAG GACAAGTATT 

12 51 CGCCGTCGAT ATCGAAATCC GCGCCCAAGA CCGCTCCGGG CTTTTGCGCG 

13 01 ACGTATCCGA CGCGCTCGCC CGCCACAAAC TCAACGTTAC CGCCGTGCAA 

13 51 ACCCAGTCCC GCGACTTGGA AGCCAGCATG AGGTTCACGC TCGAAGTCAA 

14 01 ACAAGTCAAC GACCTCCCGC GCGTCCTCGC CAGCCTCGGC GACGTCAAAG 
14 51 GCGTATTGAG CGTTACCCGG CTTTAA 

This corresponds to the amino acid sequence <SEQ ID 414; ORF 1 17>: 

mll7.pep (partial) 
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101 
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VPLSTPLENG 


251 


KIRAYIRQQN 
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SIYKKMVKKK LSFDGLFDIR AVRILVDTVP 
DDYIANPKGN GYKSLKTVIV GPEDKGVEVQ 
KEGGKGDSAY EQKIAWLRQL LDWRENMAES 
LTPHGKVLSL PTGATPIDFA YALHSSIGDR 
QRVEIITAKE GHPSVNWLYE GWVKSNKAIG 
QLDKQLAKLT PKPNLQELAE NLGYKKPEDL 
LNEPPPVPVS E7TIVKQSKI KKGGKNGVLI 
DIIGFVTRER GISVHRKXXX SFQHLAEHAP 
IEIRAQDRSG LLRDVSDALA RKKLNVTAVQ 
DLPRVLASLG DVKGVLSVTR L* 

Computer analysis of this amino acid sequence gave \he following results; 
Homology with a predicted ORF from N. gonorrhoeae 

ORP 1 1 7 shows 97.6% identity over a 490 aa overlap with a predicted ORF (ORF 1 1 7.ng) 
from N. gonorrhoeae: 

mll7/gll7 

10 20 30 

ml 17 .pep VKLKKYNVHFEVAGRPKHI YSIYKKMVKKKL 

:imhll!lillillil!l!imilii 

gill EKYREIALLLDEKRTERLEYIENFLDILRTELKKYNIHFEVAGRPKHI YSIYKKMVKKKL 

150 160 170 180 190 200 

40 50 60 70 SO 90 

ml 17 .pep SFIXSLFDIRAVRILVDTVPECYTTLGIVHSLWQPIPGEFDDYIANPKGNGYKSLHTVIVG 

IIIMIIIIMIIIIIMMIIIMII'lllltilllllMIIMIMIIlllllllMI 

gl!7 SFDGLFDIRAVRILVDTVPECYTTLGIVHSLWQPIPGEFDDYIANPKGNGYKSLHTVIVG 
210 220 230 240 250 260 

100 110 120 130 140 150 

n.-17 . pep PEDKGVEVQIRTFDMHQFNEFGVAAHWRYKEGGKGDSAYEQKIAWLRQLLDWRENMAESG 

IMIIIMIMMIIIIIIIIIIIIIMIMIIMIIIIIIIIIIIMIIIIIMIIIII 

gll7 PEDKGVEVQIRTFDMHQFNEFGVAAHWRYKEGGKGDSAYEQKIAWLRQLLDWRENMAESG 
270 280 290 300 310 320 

160 170 180 190 200 210 

mll7 .pep KEDLAAAFKTELFNDTIYVLTPHGKVLSLPTGATPIDFAYALHSSIGDRCRGAKVEGQIV 

IIMllllllliillllllllllinillilllilllllllilliMllllillllltll 

g 1 1 7 KEDLAAAFKTELFNDTI YVLTPHGKVLSLPTGATP I DFAYALHSS I GDRCRGAKVEGQI V 

330 340 350 360 370 380 

220 230 240 250 260 270 

ml 17 .pep PLSTPLENGQRVEIITAKEGHPSVNWLYEGWVKSNKAIGKIRAYIRQQNADTVREEGRVQ 

IIMIIIIIIllllllllllMlllllllllllhlllllllllllllllllllllllll 

g 1 1 7 PLSTPLENGQRVE 1 1 TAKEGHPS VNVILYEGWVKSGKAI GKI RAY I RQQNADTVREEGRVQ 

390 400 410 420 430 440 

280 290 300 310 320 330 

mll7 .pep LDKQLAKLTPKPNLQELAENLGYKKPEDLYTAVGQGEISNRAIQKACGTLNEPPPVPVSE 

llllllllllinillMlMllllilllllMlllMlllllllllllllllllllll 

gll7 LDKQLAKLTPKPNLQELAENLGYKKPEDLYTAVGOGEISNRAIQKACGTLNEPPPVPVSA 
450 460 470 480 490 500 

340 350 360 370 380 390 

mil 7 .pep TTIVKOSKIKKGGKNGVLIDGEDGLMTTLAKCCKPAPPDDIIGFVTRERGISVHRKXXXS 

iiiiiiMiiiiihiiiiiiMMiiiiMiiiiiiiiii imiimmih i 

gll7 TTIVKQSKIKKGGKTGVLIDGEDGLMTTLAKCCKPAPPDDIAGFVTRERGISVHRKTCPS 
510 520 530 540 550 560 



ml 17 . pep 



400 410 420 430 440 450 

FOHLAEHAPXKVLDASWAALOEGQVFAVDI E I RAO DRSGLLRDVS DALARH KLNV T AV 0 T 
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MINIM IIIIIIIIIIIIIIMIIIIIIIIIllllllllllMMlMiiiMiii 

FRHLAEHAPEFCVLDASWAALQEGQVFAVDIEIRAQDR3GLLRDVSDALARHKLNVTAVQT 
570 580 590 600 610 620 

460 470 480 490 

OSRDLEASMRFTLEVKQVNDLPRVLASLGDVKGVLSVTRLX 

IMIMIIHMIIIIIIMIIIIIhlllllfllMllii 

QSRDLEASMRFTLEVKQVNDLPRVLAGLGDVKGVLSVTRLX 
630 640 650 660 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 415> : 

all7 . seq 

1 ATGGTTCATG AACTCGACCT GCTCCCCGAT GCCGTCGCCG CCACCCTGCT 
51 TGCCGACATC GGACGCTACG TCCCCGACTG GAACCTATTG GTTTCCGAAC 
101 GCTGCAACAG TACCGTCGCC GAGCTGGTCA AAGGTGTGGA CGAAGTGCAG 

151 AAACTCACCC ACTTCGCCCG GGTGGACAGC CTCGCCACGC CGGAAGAACG 
2 01 CGCCCAGCAG GCAGAAACTA TGCGGAAAAT GCTGCTGGCG ATGGTTACCG 

2 51 ACATCCGCGT CGTGTTAA.TC AAACTGGCGA TGCGTACGCG CACCCTGCAA 

301 TTTTTAAGCA ACGCCCCCGA CAGCCCCGAA AAACGCGCCG TCGCCAAAGA 

351 AACCCTCGAC ATCTTCGCCC CGCTCGCCAA CCGTTTGGGC GTGTGGCAGC 

4 01 TCAAATGGCA GCTCGAAGAT TTGGGCTTCC GCCATCAAGA ACCCGAAAAA 

4 51 TACCGCGAAA TCGCCCTGCT TTTGGACGAA AAACGCACCG AACGCCTCGA 

501 ATACATCGAA AACTTCCTTA ATATCCTGCG TACGGAACTC AAAAAATACA 

551 ATATCCACTT TGAAGTCGCC GGCCGTCCGA AACACATCTA CTCCATTTAC 

601 AAAAAAATGG TGAAGAAAAA ACTCAGCTTC GACGGGTTGT TCGACATCCG 

651 CGCCGTGCGG ATTCTGGTTG ATACCGTCCC CGAGTGTTAC ACCACACTGG 

V01 GCATTGTCCA CAGCCTCTGG CAGCCCATTC CCGGCGAGTT CGACGACTAC 

7 51 ATCGCCAACC CGAAAGGCAA CGGCTATAAA AGTTTGCACA CCGTCATCGT 
801 CGGCCCGGAA GACAAAGGCG TGGAAGTGCA AATCCGCACC TTCGATATGC 

8 51 ACCAATTCAA CGAATTCGGT GTCGCCGCGC ACTGGCGTTA CAAAGAGGGC 
901 GGCAAAGGCG ATTCCGCCTA CGAACAAAAA ATCGCCTGGT TACGCCAACT 
951 TTTGGACTGG CGCGAAAACA TGGCGGAAAG CGGCAAGGAA GACCTCGCCG 

1001 CCGCCTTCAA AACCGAGCTT TTCAACGACA CGATTTATGT TTTGACCCCG 

1051 CACGGCAAAG TCCTCTCCCT GCCCACAGGC GCGACCCCCA TCGACTTCGC 

1101 CTACGCCCTG CACAGCAGCA TCGGCGACCG TTGCCGCGGT GCGAAAGTCG 

1151 AAGGGCAGAT TGTGCCGCTG TCCACCCCGC TCGAAAACGG ACAGCGTGTC 

1201 GAAATCATTA CCGCCAAAGA AGGGCATCCT TCCGTCAACT GGCTTTACGA 

12 51 AGGCTGGGTC AAATCCAACA AGGCAATCGG CAAAATCCGC GCCTACATCC 

1301 GCCAGCAAAA CGCCGACACC GTGCGCGAAG AAGGCCGCGT CCAACTCGAC 

1351 AAACAGCTTG CCAAACTCAC GCCCAAACCC AACCTGCAAG AGCTTGCCGA 

14 01 AAATCTCGGC TACAAAAAGC CAGAAGACCT CTACACCGCC GTCGGACAAG 

14 51 GCGAAATTTC CAACCGCGCC ATCCAAAAAG CCTGCGGCAC GCTGAACGAA 

1501 CCGCCGCCCG TACCCGTCAG CGAAACCACC ATCGTCAAAC AGTCCAAAAT 

1551 CAAAAAAGGC GGCAAAAACG GCGTGCTCAT CGACGGCGAA GACGGTCTGA 

1601 TGACCACGCT TGCCAAATGC TGCAAACCCG CGCCGCCCGA CGACATTGTC 

1651 GGCTTCGTTA CCCGCGATCG CGGCATTTCG GTACACCGCA AAACCTGCCC 

1701 CTCTTTCCGA CACCTCGCCG AACACGCGCC CGAAAAAGTA CTGGACGCAA 

17 51 GTTGGGCGGC GTTGCAGGAA GGACAAGTGT TCGCCGTCGA TATCGAAATC 

18 01 CGCGCCCAAG ACCGCTCCGG GCTTTTGCGC GACGTATCCG ACGCGCTCGC 

18 51 CCGCCACAAA CTCAACGTTA CCGCCGTGCA AACCCAGTCC CGCGACTTGG 
1901 AAGCCAGCAT GAGGTTCACG CTCGAAGTCA AACAAGTTAC CGACCTCCCA 

19 51 CGCGTCCTCG CCAGCCTCGG CGACGTCAAA GGCGTATTGA GCGTTACCCG 

20 01 GCTTTAA 

This corresponds to the amino acid sequence <SEQ ID 416; ORF 1 17.a>: 

all 7 .pep 

1 MVHELDLLPD AVAATLLADI GRYVPDWNLL VSERCNSTVA ELVKGVDEVQ 

51 KLTHFARVDS LATPEERAQQ AETMRK MLLA MVTDIRWLI KLA MRTRTLO 

101 FLSNAPDSPE KRAVAKETLD IFAPLANRLG VWQLKWQLED LGFRHQEPEK 

151 YREIALLLDE KRTERLEYIE NFLNILRTEL KKYNIHFEVA GRPKHIYSI Y 

201 KKMVKKKLSF DGLFDIRAVR ILVDTVPECY TTLGI VHSLW QPI PGEFDDY 

251 IANPKGNGYK SLHTVIVGPE DKGVEVQIRT FDMHQFNEFG VAAKWRYKEG 

301 GKGDSAYEQK IAWLRQLLDW RENMAESGKE DLAAAFKTEL FNDTIYVLTP 

351 HGKVLSLPTG ATPIDFAYAL HSS IGDRCRG AKVEGQI VPL STPLENGQRV 



WO 99/57280 



gll7 

ml 1 7 . pep 
gl!7 



WO 99/57280 



PCT/US99/09346 



333 



401 
451 
501 
551 
601 
651 



EI ITAKEGHP 
KQLAKLTPKP 
PPPVPVSETT 
GFVTRDRGIS 
RAQDRSGLLR 
RVLASLGDVK 



SVNWLYEGWV 
NLQELAENLG 
IVKQSKIKKG 
VHRKTCPSFR 
DVSDALARHK 
GVLSVTRL* 



KSNKAIGKIR 
YKKPEDLYTA 
GKNGVLIDGE 
HLAEHAPEKV 
LNVTAVQTQS 



AYIRQQNADT 
VGQGE ISNRA 
DGLMTTLAKC 
LDASWAALQE 
RDLEASMRFT 



VREEGRVQLD 
IQKACGTLNE 
CKPAPPDDIV 
GQVFAVDIEI 
LEVKQVTDLP 



mll7/all7 



98.0% identity in 490 aa overlap 



10 20 30 

ml 17 . pep " . " K \ K K Y N V H FE V AG R P K H I Y S I Y K KM V KK K L 

: , ! I I I : I II I I M II i I I if I I M I I I I I 
a 117 EKYREIALLLDLKRTERLEYIENFLNILRTELKKYNIHFEVAGRPKHI YSIYKKMVKKKL 

150 160 170 180 190 200 

40 50 60 70 80 90 

ml 17 -pep SFDGLFDIRAVRILVDTVPECYTTLGIVHSLWQPIPGEFDDYIANPKGNGYKSLHTVIVG 
If I I i I I I I I It I I I 1 I ! t M I ! i I I i M II I I ( I I I I I I M I I I I II I I I If I I I I I I I 
all 7 SFDGLFDIRAVRILVDTVPECYTTLGIVHSLWQPIPGEFDDYIANPKGNGYKSLilTVIVG 
210 220 230 240 250 260 

100 110 120 130 140 150 

ml 17 .pep PEDKGVEVQIRTFDMHQFNEFGVAAHWRYKEGGKGDSAYEQKIAWLRQLLDWRENMAESG 
I I M I I [ I I I I I I II I I I I I ! t I I I II I I I I I ! I I I I I I I I I I M I i I ! I 1 I I I I I I I I I 
all 7 PEDKGVEVQIRTFDMHQFNEFGVAAHWRYKEGGKGDSAYEQKIAWLRQLLDWRENMAESG 

270 280 290 300 310 320 

160 170 180 190 2C0 210 

ml 17 . pep KEDLAAAFKTELFNDTIYVLTPHGKVLSLPTGATPIDFAYALHSSIGDRCRGAKVEGQIV 
I I I I I I I i I i I I I M I I I I I I II I I I I I II I I I I 1 I I 1 I I I I I I I M I I I I II If M I I I 
all 7 KEDLAAAFKTELFNDTI YVLTPHGKVLSLPTGATPIDFAYALHSSIGDRCRGAKVEGQIV 

330 340 350 360 370 380 

220 230 240 250 260 270 

ml 17 .pep PLSTPLENGQRVEIITAKEGHPSVNWLYEGWVKSNKAIGKIRAYIRQQNADTVREEGRVQ 
M ! M I I I I I I I I I I I I I I I I I I I I I I I I II I ! I I I I I I I I II I I I I I I I I ! I I I I I II I 
all7 PLSTPLENGQRVEI ITAKEGHPSVNWLYEGWVKSNKAIGKIRAYIRQQNADTVREEGRVQ 

390 400 410 420 430 440 

280 290 300 310 320 330 

ml 17 .pep LDKQLAKLTPKPNLQELAENLGYKKPEDLYTAVGQGEISNRAIQKACGTLNEPPPVPVSE 
I | | I M II I I I I I I I I I I I I I I I I I I I I I I I i M I M I I I I I I I I I I I I I 1 I I M I I II I 
all7 LDKQLAKLTPKPNLQELAENLGYKKPEDLYTAVGQGEISNRAIQKACGTLNEPPPVPVSE 
450 460 470 480 490 500 

340 350 360 370 380 390 

ml 17 .pep TTIVKQSKIKKGGKNGVLIDGEDGLMTTLAKCCKPAPPDDIIGFVTRERGISVHRKXXXS 
I I I I I I I I I I I II I I I I I I I I I I I I I I I I I i I I I I I M I I I : I I I I I : I I I I I I I I : I 
all7 TTIVKQSKIKKGGKNGVLIDGEDGLMTTLAKCCKPAPPDDIVGFVTRDRGISVHRKTCPS 
510 520 530 540 550 560 

400 410 420 430 440 450 

ml 17 .pep FQHLAEHAPXKVLDASWAALQEGQVFAVDIEIRAQDRSGLLRDVSDALARHKLNVTAVQT 
I : | I M I I I I I I I I I II I I I I I I I I I I II I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I 
all7 FRHLAEHAPEKVLDASWAALQEGQVFAVDIEIRAQDRSGLLRDVSDALARHKLNVTAVQT 
570 580 590 600 610 620 



460 470 480 490 

ml 17 . pep QSRDLEASMRFTLEVKQVNDLPRVLASLGDVKGVLSVTRLX 
I I I I I I I I I II ! I I I I I 1 : I ' ' I I I I I I I I I I I I I I I i I I I 
al 17 QSRDLEASMRFTLEVKQVTDLtRVLASLGDVKGVLSVTRLX 
630 640 650 660 
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The following partial DNA sequence was identified in A ; . gonorrhoeae <SEQ ID 41 7>: 

gll7-l .seq 

1 ATGACCGCCA TCAGCCCGAT TCAAGACACG CAAAGCGCGA CCCTGCAAGA 

51 ATTGCGCGAA T^GTTCGACA GCTACTGCGC CGC~CTGCCG GACAACGATA 

101 AAAACCTCAT CGGTACCGCA TGGTCGCTGG CGCAGGAACA TTATCCTGCC 

15! GATGCCGCCA CGCCGTATGG CGAGCCGCTG CCCGACCACT TCCTCGGCGC 

201 GGCGCAAATG GTCGACGAAC TCGACCTGCT GCCCGATGCC GTCGCCGCCA 

251 CCCTGCTTGC CGACATCGGA CGCTACGTCC CCGATTGGAA CCTATTGGTT 

301 TCCGAGCGCT GCAACAGCAC CGTCGCCGAG CTGGTCAAAG GTGTGGACGA 

351 AGTGCAGAAA CTTACCCACT TCGCCCGGGT GGACAGCCTC GCCACGCCGG 

4 01 AAGAACGGGC ACAGCAAGCG GAAACCATGC GGAAAATGCT GCTGGCGATG 

4 51 GTTACCGACA TCCGCGTCGT ATTAATCAAA CTGGCGATGC GTACGCGCAC 

501 CCTGCAATTT TTAAGCAACG CCCCCGACAG CCCTGAAAAA CGCGCCGTCG 

551 CCAAAGAAAC CCTCGACATC TTCGCCCCGC TCGCCAACCG CTTGGGCGTG 

601 TGGCAGCTCA AATGGCAGCT CGAAGATTTG GGCTTCCGCC ATCAAGAACC 

651 CGAAAAATAC CGCGAAATCG CCCTGCTTTT GGACGAAAAA CGCACCGAAC 

701 GCCTCGAATA CATCGAAAAC TTCCTCGATA TCCTGCGTAC GGAACTCAAA 

7 51 AAATACAATA TCCACTTTGA AGTCGCCGGC CGTCCGAAAC ACATCTACTC 

801 CATTTACAAA AAAATGGTGA AGAAAAAACT CAGCTTCGAC GGCCTGTTCG 

851 ACATCCGCGC CGTGCGGATT CTGGTCGATA CCGTCCCCGA GTGTTACACC 

901 ACGCTGGGCA TCGTCCACAG CCTCTGGCAG CCCATTCCCG GCGagttCGA 

951 cgactACATC GCCAACCCCA AAGgcaACGg tt ATAAAAGt TTGCACACCG 

1001 TCATCGTcgg cccGGAagaa aaaggtgtgg aagtgcAAAT CCGCACCTTC 

1051 GATATGCacc AATTCaaCga ATTCGGTGTC GCCGCCCACT GGCGTTACAA 

1101 AGAAGGCGGC AAAGGCGATT CCGCCTACGA ACAAAAAATC GCCTGGTTGC 

1151 GCCAACTCTT GGACTGGCGC GAAAATATGG CGGAAAGCGG CAAGGAAGAC 

1201 CTCGCCGCCG CCTTCAAAAC CGAGCTTTTC AACGACACGA TTTATGTTTT 

12 51 GACCCCGCAC GGCAAAGTCC TCTCTCTGCC AACGGGCGCA ACCCCCATCG 

1301 ACTTCGCCTA CGCCCTGCAC AGCAGCATCG GCGACCGCTG CCGGGGCGCG 

1351 AAAGTCGAAG GGCAGATTGT GCCGCTGTCC ACCCCGCTCG AAAACGGACA 

14 01 GCGCGTCGAA ATCATTACCG CCAAAGAAGG GCATCCTTCC GTCAACTGGC 

14 51 TTTACGAAGG CTGGGTCAAA TCCGGCAAGG CCATCGGCAA AATCCGCGCC 

1501 TACATCCGCC AGCAAAACGC CGACACCGTG CGCGAAGAAG GCCGTGTCCA 

1551 ACTCGACAAG CAGCTTGCCA AACTCACGCC CAAACCCAAC CTGCAAGAGC 

1601 TTGCCGAAAA TCTCGGCTAC AAAAAGCCAG AAGACCTCTA CACCGCCGTC 

1651 GGACAAGGCG AAATTTCCAA CCGCGCCATC CAAAAAGCCT GCGGCACGCT 

1701 GAACGAACCG CCGCCCGTGC CCGTCAGCGC AACCACCATC GTCAAACAGT 

17 51 CCAAAATCAA AAAAGGTGGC AAAACCGGCG TGCTCATCGA CGGCGAAGAC 
1801 GGCTTGATGA CCACGCTTGC CAAATGCTGC AAACCCGCGC CGCCCGACGA 

18 51 TATTGCCGGC TTCGTTACCC GCGAGCGCGG CATTTCCGTC CACCGCAAAA 
1901 CCTGCCCCTC TTTCCGACAC CTTGCCGAAC ACGCGCCCGA AAAAGTACTG 

19 51 GACGCAAGTT GGGCGGCGTT GCAGGAAGGG CAAGTGTTCG CCGTCGATAT 
2001 CGAAATCCGC GCCCAAGACC GCTCCGGGCT TTTGCGCGAC GTATCCGACG 
2051 CGCTCGCCCG CCACAAACTC AACGTTACCG CCGTGCAAAC CCAGTCCCGC 
2101 GACTTGGAAG CCAGCATGAG GTTCACGCTC GAAGTCAAAC AAGTCAACGA 
2151 CCTCCCGCGC GTCCTCGCCG GCCTCGGCGA TGTCAAAGGC GTATTGAGCG 
2201 TTACCCGGCT TTAA 



This corresponds to the amino acid sequence <SEQ ID 418; ORT 117-l.ng>: 

gll7-l .pep 

1 MTAISPIQDT OSATLOELRE WFDSYCAALP DNDKNLIGTA WSLAQEHYPA 

51 DAATPYGEPL PDHFLGAAQM VDELDLLPDA VAATLLADIG RYVPDWNLLV 

101 SERCNSTVAE LVKGVDEVQK LTHFARVDSL ATPEERAQQA ETMRKMLLAM 

151 VTDIRVVLIK LAM RTRTLQF LSNAPDSPEK RAVAKETLDI FAPLANRLGV 

201 WQLKWQLEDL GFRHQEPEKY REIALLLDEK RTERLEYIEN FLDILRTELK 

251 KYNIHFEVAG RPKHIYSIYK KMVKKKLSFD GLFDIRAVRI LVDTVPECYT 

301 TLGIVHSLWQ PIPGEFDDYI ANPKGNGYKS LHTVIVGPEE KGVEVQIRTF 

351 DMHQFHEFGV AAHWRYKEGG KGDSAYEQKI AWLRQLLDWR ENMAESGKED 

4 01 LAAAFKTELF NDTI YVLTPH GKVLSLPTGA TPIDFAYALH SSIGDRCRGA 

4 51 KVEGQIVPLS TPLENGQRVE I I TAKEGHPS VNWLYEGWVK SGKAIGKIRA 

501 YIRQQNADTV REEGRVQLDK QLAKLTPKPN LQELAENLGY KKPEDLYTAV 

551 GQGEISNRAI QKACGTLNEP PPVPVSATTI VKQSKIKKGG KTGVLIDGED 

601 GLMTTLAKCC KPAPPDDIAG FVTRERGISV HRKTCPSFRH LAEHAPEKVL 

651 DASWAALQEG QVFAVDIEIR AQDRSGLLRD VSDALARHKL NVTAVQTQSR 

701 DLEASMRFTL EVKQVNDLPR VLAGLGDVKG VLSVTRL* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 419>: 

m!17-l . seq 

1 ATGACCGCCA TCAGCCCGAT TCAAGACACG CAAAGCGCGA CTCTGCAAGA 
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51 ATTGCGCGAA TGGTTCGACA GCTACTGCGC CGCTCTGCCG GACAACGATA 

101 AAAACCTCAT CGGTACCGCA TGGTTGCTGG CGCAGGAACA TTACCCCGCC 

151 GATGCCGCCA CGCCGTATGG CGAGCCGCTG CCCGACCACT TCCTCGGCGC 

201 GGCGCAAATG GTTCATGAAC TCGACCTGCT CCCCGATGCC GTCGCCGCCA 

251 CCCTGCTTGC CGACATCGGA CGCTACGTCC CCGACTGGAA CCTATTGGTT 

301 TCCGAACGCT GCAACAGTAC CGTCGCCGAG CTGGTCAAAG GTGTGGACGA 

251 AGTGCAGAAA CTCACCCACT TCGCCCGGGT GGACAGCCTC GCCACGCCGG 

4 01 AAGAACGCGC CCAGCAGGCA GAAACTATGC GGAAAATGCT GCTGGCGATG 

4 51 GTTACCGACA TCCGCGTCGT GTTAATCAAA CTGGCGATGC GTACGCGCAC 

501 CCTGCAATTT TTAAGCAACG CCCCCGACAG CCCCGAAAAA CGCGCCGTCG 

551 CCAAAGAAAC CCTCGACATC TTCGCCCCGC TCGCCAACCG TTTGGGCGTG 

601 TGGCAGCTCA AATGGCAGCT CGAAGATTTG GGCTTCCGCC ATCAAAAGCC 

651 CG AAAAATAC CGCGAAATCG CGCTGCTTTT GGACGAAAAA C"-:ACCGAAC 

701 GCCTCGAATA CATCGAAAAC TTCCTCAACA TCCTGCGCGG TGAACTCAAG 

751 AAATACAATG TCCATTTCGA AGTCGCCGGC CGCCCGAAAC ACATCTACTC 

801 CATTTACAAA AAAATGGTGA AGAAAAAACT CAGCTTCGAC GGCCTCTTTG 

8 51 ACATCCGCGC CGTGCGAATT CTGGTTGATA CCGTCCCCGA GTGTTACACC 

901 ACGCTGGGTA TCGTCCACAG CCTCTGGCAG CCCATTCCCG GCGAGTTCGA 

951 CGACTACATC GCCAATCCCA AAGGCAACGG CTATAAAAGT TTGCACACCG 

1001 TCATCGTCGG CCCGGAAGAC AAAGGCGTGG AAGTACAAAT CCGCACCTTC 

1051 GATATGCACC AATTCAACGA ATTCGGTGTC GCCGCCCACT GGCGTTACAA 

1101 AGAGGGCGGC AAGGGCGATT CCGCCTACGA ACAGAAAATC GCCTGGTTGC 

1151 GCCAACTCTT GGACTGGCGC GAAAACATGG CGGAAAGCGG CAAGGAAGAC 

1201 CTCGCCGCCG CCTTCAAAAC CGAGCTTTTC AACGACACGA TTTATGTTTT 

1251 GACCCCGCAC GGCAAAGTCC TCTCCCTGCC CACGGGCGCG ACCCCCATCG 

1301 ACTTCGCCTA CGCCCTGCAC AGCAGCATCG GCGACCGTTG CCGCGGTGCG 

1351 AAAGTCGAAG GGCAGATTGT GCCGCTGTCC ACCCCGCTCG AAAACGGACA 

14 01 GCGCGTCGAA ATCATTACCG CCAAAGAAGG GCATCCTTCC GTCAACTGGC 

14 51 TTTACGAAGG CTGGGTCAAA TCCAACAAGG CAATCGGCAA AATCCGCGCC 

1501 TACATCCGCC AGCAAAACGC CGACACCGTG CGCGAAGAAG GCCGCGTCCA 

1551 ACTCGACAAA CAGCTTGCCA AACTCACGCC CAAACCCAAC CTGCAAGAGC 

1601 TTGCCGAAAA TCTCGGCTAC AAAAAGCCAG AAGACCTCTA CACCGCCGTC 

1651 GGACAAGGCG AAATTTCCAA CCGCGCCATC CAAAAAGCCT GCGGCACGCT 

17 01 GAACGAACCG CCGCCCGTAC CCGTCAGCGA AACCACCATC GTCAAACAGT 

1751 CCAAAATCAA AAAAGGCGGC AAAAACGGCG TGCTCATCGA CGGCGAAGAC 

1801 GGTCTGATGA CCACGCTTGC CAAATGCTGC AAACCCGCGC CGCCCGACGA 

IzdI 7ATTATCGGC TTCGTTACCC GCGAGCGCGG CATTTCAGTG CACCGCAAAA 

1901 CCTGCCCGTC TTTCCAACAC CTCGCCGAAC ACGCGCCCGA AAAAGTGCTG 

1951 GACGCAAGCT GGGCGGCATT GCAGGAAGGA CAAGTATTCG CCGTCGATAT 

2001 CGAAATCCGC GCCCAAGACC GCTCCGGGCT TTTGCGCGAC GTATCCGACG 

2051 CGCTCGCCCG CCACAAACTC AACGTTACCG CCGTGCAAAC CCAGTCCCGC 

2101 GACTTGGAAG CCAGCATGAG GTTCACGCTC GAAGTCAAAC AAGTCAACGA 

2151 CCTCCCGCGC GTCCTCGCCA GCCTCGGCGA CGTCAAAGGC GTATTGAGCG 

2201 TTACCCGGCT TTAA 

This corresponds to the amino acid sequence <SEQ ID 420; ORF 1 1 7-1 >: 

mll7-l .pep 

1 MTAISPIQDT QSATLQELRE WFDS YCAALP DNDKNLIGTA WLLAQEHYPA 

51 DAATPYGEPL PDHFLGAAQM VHELDLLPDA VAATLLADI G RYVPDWNLLV 

101 SERCNSTVAE LVKGVDEVQK LTHFARVDSL ATPEERAQQA ETMRK MLLAM 

151 VTDIRWLIK LA MRTRTLQF LSNAPDSPEK RAVAKETLDI FAPLANRLGV 

201 WQLKVfQLEDL GFRHOKPEKY REIALLLDEK RTERLEYIEN FLNILRGELK 

251 KYNVHFEVAG RPKHIYSIYK KMVKKKLSFD GLFDIRAVRI LVDTVPECYT 

301 TLGI VHSLWQ PIPGEFDDY1 ANPKGNGYKS LHTVI VGPED KGVEVQIRTF 

3 51 DMHQFNEFGV AAHWRYKEGG KGDSAYEQKI AWLRQLLDWR ENKAESGKED 
401 LAAAFKTELF NDTI YVLTPH GKVLSLPTGA TPIDFAYALH SSIGDRCRGA 

4 51 KVEGQIVPLS TPLENGQRVE I ITAKEGHP3 VNWLYEGWVK SNKAIGKIRA 
501 YIRQONADTV REEGRVQLDK QLAKLTPKPN LGELAENLGY KKPEDLYTAV 
551 GQGEISNRAI QKACGTLNEP PPVPVSETTI VKQSKI KKGG KNGVLIDGED 
601 GLMTTLAKCC KPAPPDDI TG FVTRERGISV HRKTCPSFQH LAEHAPEKVL 
651 DASWAALQEG QVFAVDIEIR AQDRSGLLRD VSDALARHKL NVTAVQTQSR 
701 DLEASMRFTL EVKQVNDLPR VLASLGDVKG VLSVTRL* 

mll7-l/gll7-l 98. 2% identity in 737 aa overlap 

10 20 30 40 50 60 

ml 17-1 .pep MTAISPIQDTQSATLOELREWFDSYCAALPDNDKNLIGTAWLLAQEHYPADAATPYGEPL 
I I I I I I I I I I I I I I i I I I I I I I I I I \ I I f I I I I I I I I I I t I \ I I t I I I I I i I M ! I I I I 
g 117-1 MTAISPIQDTQSATLQELREWFDSYCAALPDNDKNLIGTAWSLAQEHYPADAATPYGEPL 

10 20 30 40 50 60 

70 80 90 100 110 120 
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ml 1 7 - I . pep PDKFLGAAQMVHELDLLPDAVAATLLADI GRyVPDWNLLVSERCNSTVAEL VKGVDEVQK 
I ! I f M I I I I f I I I I f I I t f i I I i I I I I I I I I ! I I I I ! i 1 I I I I I I I I ! I I I I I II i I t 
ail 7-1 PDHFLGAAOMVDELDLLPDAVAATLLADIGRYVPDWNLLVSERCKSTVAELVKGVDEVQK 
70 80 90 IOC 110 120 

130 140 150 16C 170 ISO 

ml 17-1 . pep LTHFARVDSLATPEERAOOAETMRKMLLAHVTDIRVVLIKLAMRTRrLOFLSNAPDSPEK 
1 ! I ! ! i 1 ! II I I 1 ! I M t ! i ) I tt I i it 1 II 1 t I I i M i t I I 1 1 I i i II I I I M I 1 1 I I 
gl 17-1 LTKFARVDSLATPEERAQQAETMRKMLLAMVTDIRVVLIKLAMRTRTLQFLSNAPDSPEK 
130 140 1 50 160 170 180 

190 200 210 220 230 240 

ml 17-1 .pep RAVAKETLDIFAPLANRLGVWQLKWQLEDLGFRHQKPEKVREIALLLDEKRTERLEYIEN 
I t f I I I I i I I I I f I I I I I I I I I I I I I I I i I I I I I I : I I I II I I f I i M I I U I I I I I I I I 
g i 1 7 - 1 RAVAKETLDI FAPLANRLGVWQLKWQLEDLGFRHQEPEKY'REIALLLDEKRTERLEYIEN 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml!7-l . pep FLNI LKGELKKYNVHFEVAGRPKHI YSI YKKMVKKKLSFDGLFDI RAVRI LVDT V PEC YT 
I I : I I I I ! I I I I : I I I I ! I I I I I I I I I I I I I I i I I i I I I I I I M I I I I I I M M I i I i I 
gll7-l FLDILRTELKKYNIHFEVAGRPKHI YSI YKKMVKKKLSFDGLFDI RAVRI LVDT VP EC YT 

250 260 270 280 290 300 

310 320 330 340 350 360 

mll7-l .pep TLGIVHSLWQPI PGEFDDYIANPKGNGYKSLHTVIVGPEDKGVEVQIRTFDMHQFNEFGV 
I I M I I I I II I i I I I li I I M I ( I i I I I II ! I I I I I ! M : M I I I ! I I I i I II I I II I I ! 
gll7-l TLGI VHSLWQPI PGEFDDYI ANPKGNGYKSLHTVI VGPEEKGVE7QI PTFDMHQFNEFGV 

310 320 330 340 350 360 

370 330 390 400 410 420 

mlI7-l .pep -.AHWRYKEGGKGDSAYEQKIAWLRQLLDWRENMAESGKEDLAAAFKTELFNDTIYVLTPH 
I I I I I I II I I I I ! I I I I I I II I II I M I I I I i I I ! II I I I I I I f I II I I I I I I i i I I I I I 
all 7-1 AAHWRYKEGGKGDSAYEQKIAWLRQLLDWRENMAESGKEDLAAAFKTELFNDTIYVLTPH 

370 3S0 390 400 410 420 

430 440 450 460 470 480 

ml 17-1 .pep GKVLSLPTGATPIDFAYALHSSIGDRCRGAKVEGQI VPLSTPLENGQRVEIITAKEGHPS 
I I 1 I I i I I 1 I I t I 1 I I I I i ! M ! I I I I } i M I I I I I I I II i I I ! 1 i I ( 11 I 1 1 I I I I i I I 
gll7-l GKVLSLPTGATPIDFAYALHSSIGDRCRGAKVEGQI VPLSTPLENGQRVEIITAKEGHPS 

430 440 450 460 470 480 

490 500 510 520 530 540 

mll7-l . pep VNWLYEGWVKSNKAIGKIRAYIRQQNADTVREEGRVQLDKQLAKLTPKPNLQELAENLGY 
I I I I I I I I ! I I : I II I I I I I II II I i I I I I I I I I I I I I I I i I I I I I I I I ! I I I I I I I I I I 
gll7-l VNWLYEGWVKSGKAIGKIRAYIRQONADTVREEGRVQLDKOLAKLTPKPNLQELAENLGY 

490 500 510 520 530 540 

550 560 570 580 590 600 

ml 17-1 .pep KKPEDLYTAVGQGEISNRAIQKACGTLNEPPPVPVSETTIVKQSKIKKGGKHGVLIDGED 
I 1 I I 1 I I I I I I I ! I I 1 1 I it I 1 I I I I i M I i 1 I I 1 1 I I 1 I 1 I II I 1 1 I I I : I I I I I I i ! 
g 117-1 KKPEDLYTAVGQGEISNRAIQKACGTLNEPPPVPVSATTIVKQSKIKKGGKTGVLIDGED 

550 560 570 580 590 600 

610 620 630 640 650 660 

m!17-l .pep GLMTTLAKCCKPAPPDDIIGFVTRERGISVHRKTCPSFQHLAEHAPEKVLDASWAALQEG 
I I i I I I I I II I i I M I I I I I I I I I I I I I I I I I I I ! I I : M I I I ! I I I I I I I I I I I I I I I 
gl!7-l GLMTTLAKCCKPAPPDDIAGFVTRERGISVHRKTCPSFRHLAEKAPEKVLDASWAALQEG 

610 620 630 640 650 660 

670 680 690 700 710 720 

ml 1 7 - 1 . pep OVFAVDI EIRAQDRSGLLRDVSDALARHKLNVTAVQTQSRDLEASMRFTLEVKQVNDLPR 
I I I I I I II I i I I I I I I I i II I M I i I I I I I I I I I I I I I I I I I I I I I I I I I I M II I ) I I I 
gll7-l QVFAVDIEIRAQDRSGLLRDVSDALARKKLNVTAVQTQSRDLEASMRFTLEVKQVNDLPR 

670 680 690 700 710 720 

730 

ml 17-1. pep VLASLGDVKGVLSVTRLX 
I t I : t I i I I I I II 1 I I I I 
g!17-l VLAGLGDVKGVLSVTRLX 
730 



mll7-l/RelA 
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spl P55133 | RELA_VIBSS GTP PYROPHOSPHOKI NASE ( ATP : GTP 3 ' -PYROPHOSPHOTRANSFEPA5E) ( PPGPP 
SYNTHETASE I) >gi I 537617 (U1376S) ppGpp synthetase I [Vibrio sp.) Length = 744 
Score = 536 bits (1366), Expect = e-151 

Identities - 288/685 (42%), Positives = 432/685 {63°o), Gaps = 31/685 (4%) 

Query; 7 4 LDLLPDAVAATLLADI GR Y V PDWNL LVS ERCN S TV AELVKGVDEVQKLT HEAR VDSL 130 

L + D + A LL + G Y D + E + T + LV + GV-f^+ *+ ++ S 
Sb}ct: 63 LSMDADTLIAALL YPLVEGGCYSTD ALKEEYSGTILHLVQGVEOMCAIS OLKST 121 

Query: 131 ATPEERAQOAETMRKMLLAMVTDIRVVLIKLAMRTRTLQFLSMAPDSPEKRAVAKETLDI 190 

A +A Q + +R+MLL+MV D R V+IKLA R L + + + PD -RA A + E +1 
Sbjct: 122 AEETAQAAQVDNIRRMLLSMVDDFRCVVIKLAERICNLREVKDCPDEV-RRAAAQECANI 180 

Query: 191 FAPLANRLGVWQLKWQLEDLGr f.HQKPEKYREIALLLDEKRTERLEYIENFLNI LRGELK 250 

+APLANRLG+ QLKW++ED FK^Q P^ Y++IA L E+R +R + :± +F++ L +K 
Sbjct: 181 YAPLANRLGIGQLKWEIEDYAFRYQHPDTYKQIAKQLSERRIDREDYITHFVDDLSDAMK 240 

Query: 251 KYMVHFEVAGRPKHI YSI YKKMVKKKLSFDGLFDIRAVRILVDTVPEC YTTLGI VHSLWQ 310 

N+ EV GRPKHIYSI++KM KK L FD LFD+RAVRI + + + +CY LG+VH+ + •*- 
Sbjct: 241 ASMIRAEVQGRPKHIYSIWRKMQKKSLEFDELFDVRAVRIVAEELQDCYAALGVVHTKYR 300 

Query: 311 PI PGEFDDYIANPKGNGYKSLHTVIVGPEDKGVEVQIRTFDMHQFHEFGVAAHWRYKEG- 369 

+ P EFDDY+ANPK NGY + S+HTV-f+GPE K +E+QIRT MH+ +E GVAAHW+YKEG 
Sbjct: 301 HLPKEFDDYVANPKPNGYQSIH7WLGPEGKTIEIQIRTKQMHEESELGVAAHWKYKEGT 360 

Query: 370 --GKGDSAYEQKIAWLRQLLDWRENMAESGKEDLAAAFKTELFNDTI YVLTPHGKVLSLP 42 7 

G SAY++KI WLR+LL W + E M+ + SG ++ ++ + + F+D +Y TP G V+ LP 
Sbjct: 361 ASGGAQSAYDEKINWLRKLLAWQEEMSDSG--EMLDELRSQVFDDRVYAFTPKGDWDLP 418 

Query: 428 TGATPIDFAYALHSSIGDRCRGAKVEGQIVPLSTPLENGQRVEI ITAKEGHPSVNWLYE- 486 

+ ATP+DFAY +HS +G RC GAKVEG+IVP + L+ G +VEIIT KE +PS +WL 
Sbjct: 419 SNATPLDFAYHIHSEVGHRC1GAKVEGR2VPFTYHLQMGDQVE1ITQKEPKPSRDV3LNPH 473 

Query: 487 -GWVKSNKAIGKIRAYIRQQNADTVREEGRVQLDKQLAKL— TPKPNLQELAENLGYKKP 54 3 

G+V S ++A K+ A+ R+Q+ D G+ L + +L K+ T K A- K P 

Sbjct: 479 LGFVTSSRARAKVHAWFRKQDRDKNI I AGKEILEAELVKI HATLKDAQYYAAKRFNVKS P 538 

Query: 544 EDLYTAVGQGEIS-NRAIQKACGTLNEPPPVPVSETTIVKQSKI KKGGKNGV 594 

E+LY +G G+ + N + I +N^P + + K S+ KK +- V 

Sbjct: 539 EELYAGIGSGDLRINQVINHINALVNKPTAEEEDQQLLEKLSEASNKQATSHKKPQRDAV 593 

Query: 595 LIDGEDGLMTTLAKCCKPAPPDDIIGFVTRERGISVHRKTCPSFQKLAEHAPEKVLDASW 654 

+ -T-+G D LMT LA+CC + P P DDI GFVT+ RGISVHR C + L HAPE+ + + D W 
Sbjct: 599 VVEGVDNLMTHLARCCQPIPGDDIQGFVTQGRGISVHRMDCEQLEELRHHAPERIIDTVW 658 

Query: 655 AALQEGQVFAVDIEIRAQDRSGLLRDVSDALARHKLNVTAVQTQ--3RDLEASMRFTLEV 712 

G + + + + A +R+GLL+++++ L K+ V + + + + + + M F LE + 
Sbjct: 65 9 GGGFVGN- YTITVRVTASERNGLLKELTNTLMNEKVKVAGMKSRVDYKKQMSIMDFELEL 717 

Query: 713 KQVNDLPRVLASLGDVKGVLSVTRL 7 37 

+ L RVL + VK V RL 
Sbjct: 718 TDLEVLGRVLKRIEQVKDVAEAKRL 7 42 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 42 1>: 

all7-l.seq 

1 ATGACCGCCA TCAGCCCGAT TCAAGACACG CAAAGCGCGA CTCTGCAAGA 

51 ATTGCGCGAA TGGTTCGACA GCTACTGCAC CGCGCTGCCG AACAACGATA 

101 AAAAACTTGT CTTAGCCGCC CGTTCGCTGG CGGAAGCACA TTACCCCGCC 

151 GATGCCGCCA CGCCGTATGG CGAACCGCTG CCCGACCACT TCCTCGGCGC 

201 GGCGCAAATG GTTCATGAAC TCGACCTGCT CCCCGATGCC GTCGCCGCCA 

251 CCCTGCTTGC CGACATCGGA CGCTACGTCC CCGACTGGAA CCTATTGGTT 

301 TCCGAACGCT GCAACAGTAC CGTCGCCGAG CTGGTCAAAG GTGTGGACGA 

351 AGTGCAGAAA CTCACCCACT TCGCCCGGGT GGACAGCCTC GCCACGCCGG 

4 01 AAGAACGCGC CCAGCAGGCA GAAACTATGC GGAAAATGCT GCTGGCGATG 

4 51 GTTACCGACA TCCGCGTCGT GTTAATCAAA CTGGCGATGC GTACGCGCAC 

501 CCTGCAATTT TTAAGCAACG CCCCCGACAG CCCCGAAAAA CGCGCCGTCG 

551 CCAAAGAAAC CCTCGACATC TTCGCCCCGC TCGCCAACCG TTTGGGCGTG 

601 TGGCAGCTCA AATGGCAGCT CGAAGATTTG Gl. .TTCCGCC ATCAAGAACC 

651 CGAAAAATAC CGCGAAATCG CCCTGCTTTT GGACGAAAAA CGCACCGAAC 

701 GCCTCGAATA CATCGAAAAC TTCCTTAATA TCCTGCGTAC GGAACTCAAA 

751 AAATACAATA TCCACTTTGA AGTCGCCGGC CGTCCGAAAC ACATCTACTC 

801 CATTTACAAA AAAATGGTGA AGAAAAAACT CAGCTTCGAC GGGTTGTTCG 
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651 


ACATCCGCGC 


CGTGCGGATT 


901 


ACACTGGGCA 


T7GTCCACAG 


951 


CGACTACATC 


GCCAACCCGA 


1001 


TCATCGTCGG 


CCCGGAAGAC 


1051 


GATATGCACC 


AATTCAACGA 


1101 


AGAGGGCGGC 


AAAGGCGATT 


1151 


GCCAACTTTT 


GGACTGGCGC 


1201 


CTCGCCGCCG 


CCTTCAAAAC 


1251 


GACCCCGCAC 


GGCAAAGTCC 


1301 


ACTTCGCCTA 


CGCCCTGCAC 


1351 


AAAGTCGAAG 


GGCAGATTGT 


1401 


GCGTGTCGAA 


ATCATTACCG 


1451 


TTTACGAAGG 


CTGGGTCAAA 


1501 


TACATCCGGC 


AGCAAAACGC 


1551 


ACTCGACAAA 


CAGCTTGCCA 


1601 


TTGCCGAAAA 


TCTCGGCTAC 


1651 


GGACAAGGCG 


AAATTTCCAA 


1701 


GAACGAACCG 


CCGCCCGTAC 


1751 


CCAAAATCAA 


AAAAGGCGGC 


1801 


GGTCTGATGA 


CCACGCTTGC 


1351 


CATTGTCGGC 


TTCGTTACCC 


1901 


CCTGCCCCTC 


TTTCCGACAC 


1951 


GACGCAAGTT 


GGGCGGCGTT 


2001 


CGAAATCCGC 


GCCCAAGACC 


2051 


CGCTCGCCCG 


CCACAAACTC 


2101 


GACTTGGAAG 


CCAGCATGAG 


2151 


CCTCCCACGC 


GTCCTCGCCA 


2201 


TTACCCGGCT 


TTAA 



CTG^j : TGATA CCGTCCCCGA GTGTTACACC 
CCTCTGGCAG CCCATTCCCG GCGAGTTCGA 
AAGGCAACGG CTATAAAAGT TTGCACACCG 
AAAGGCGTGG AAGTGCAAAT CCGCACCTTC 
ATTCGGTGTC GCCGCGCACT GGCGTTACAA 
CCGCCTACGA ACAAAAAATC GCCTGGTTAC 
GAAAACATGG CGGAJ-AGCGG CAAGGAAGAC 
CGAGCTTTTC AACGACACGA TTTATGTTTT 
TCTCCCTGCC CACAGGCGCG ACCCCCA7CG 
AGCAGCATCG GCGACCGTTG CCGCGGTGCG 
GCCGCTGTCC ACCCCGCTCG AAAACGGACA 
CCAAAGAAGG GCATCCTTCC GTCAACTGGC 
TCCAACAAGG CAATCGGCAA AATCCGCGCC 
CGACACCGTG CGCGAAGAAG GCCC r GTCCA 
AACTCACGCC CAAACCCAAC CTGCAAGAGC 
AAAAAGCCAG AAGACCTCTA CACCGCCGTC 
CCGCGCCATC CAAAAAGCCT GCGGCACGCT 
CCGTCAGCGA AACCACCATC GTCAAACAGT 
AAAAACGGCG TGCTCATCGA CGGCGAAGAC 
CAAATGCTGC AAACCCGCGC CGCCCGACGA 
GCGATCGCGG CATTTCGGTA CACCGCAAAA 
CTCGCCGAAC ACGCGCCCGA AAAAGTACTG 
GCAGGAAGGA CAAGTGTTCG CCGTCGATAT 
GCTCCGGGCT TTTGCGCGAC GTATCCGACG 
AACGTTACCG CCGTGCAAAC CCAGTCCCGC 
GTTCACGCTC GAAGTCAAAC AAGTTACCGA 
GCCTCGGCGA CGTCAAAGGC GTATTGAGCG 



This corresponds to the amino acid sequence <SEQ ID 422; ORF 1 17-1 ,a>: 

all7-l .pep 

1 MTAISPIQDT QSATLQELRE WFDSYCTALP NNDKKLVLAA RSLAEAHYPA 

51 DAATPYGEPL PDHFLGAAQM VHELDLLPDA VAATLLADIG RYVPDWNLLV 

101 SERCNSTVAE LVKGVDEVQK LTHFARVDSL ATPEERAQQA ETMRKMl,LAM 

151 VTDIRVVLIK LAMRTRTLQF LSNAPDSPEK RAVAKETLDI FAPLANRLGV 

201 WQLKWQLEDL GFRHQEPEKY REIALLLDEK RTERLEYIEN FLHILRTELK 

251 KYNIHFEVAG RPKHIYSIYK KMVKKKLSFD GLFDIRAVRI LVDTVPECYT 

301 TLGIVHSLWQ PIPGEFDDYI ANPKGNGYKS LHTVIVGPED KGVEVQIRTF 

351 DMHQFNEFGV AAHWRYKEGG KGDSAYEQKI AWLRQLLDWR ENMAESGKED 

401 IAAAFKTELF NDTIYVLTPH GKVLSLPTGA TPIDFAYALH SSIGDRCRGA 

451 KVEGQIVPLS TPLENGQRVE IITAKEGHPS VNWLYEGWVK SNKAIGK1RA 

501 YIRQQNADTV REEGRVQLDK QLAKLTPKPN LQELAENLGY KKPEDLYTAV 

551 GOGEISNRAI QKACGTLNEP PPVPVSETTI VKQSKI KKGG KNGVLI DGED 

601 GLMTTLAKCC KPAPPDDIVG FVTRDRGISV HRKTCPSFRH LAEHAPEKVL 

651 DASWAALQEG QVFAVDIEIR AQDRSGLLRD VSDALARHKL NVTAVQTQSR 

701 DLEASMRFTL EVKQVTDLPR VLASLGDVKG VLSVTRL* 



all7-l/m!17-l 31.1% identity in 737 aa overlap 



10 20 30 40 50 60 

ml 17-1 . pep MTAISPIQDTOSATLQELREWFDSYCAALPDNDKNLIGTAWLLAQEHYPADA^TPYGEPL 
I I M t I I I I I I I 1 I I I ! I I I 1 I f I I I : I I I : II I : I : : I I I : ! I ! I I I I I I I I I j I 
a 117-1 MTAISPIQDTQSATLQELREWFDSYCTALPHNDKKLVLAARSLAEAHYPADAATPYGEPL 

10 20 30 40 50 60 



70 80 90 100 110 120 

ml 17-1 .pep PDKFLGAAQMVHELDLLPDAVAATLLADIGRYVPDWNLLVSERCHSTVAELVKGVDEVQK 
I M I I I I I I I I I I f I II I I I I It I t I ! I I I I I I I I I I I I I 1 I I 1 I I I I I I [ I I i I t I [ | I 
a!17-l PDHFLGAAOMVHELDLLPDAVAATLLADIGRYVPDWNLLVSERCNSTVAELVKGVDEVQK 

70 80 90 100 110 120 



130 140 150 160 170 180 

ml 1 7 - 1 . pep LTHFARVDSLATPEERAQQAETMRKMLLAMVTDI RVVLI KLAMRTRTLQFLSNAPDSPEK 
I I I ! I I II I I I i I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I I I M M I I ! I I I II I 
a 1 1 7 - 1 LT HFARVDSLAT PEERAQOAETMRKMLLAMVTDI RVVLI KLAMRTRTLQFLSNAPDS PEK 

130 140 150 160 170 180 



190 200 210 220 230 240 

ml 1 7 - 1 . pep RAVAKETLDI FAPLANRLGVWQLKWOLEDLGFRHQK PEK YREI ALLLDEKRTERLEYXEN 
I I I I ( II I I I I I ! I I II 1 I I I I [ I I I I I I i 1 1 I i i : I I ! I i tt i I I ■ M ! I I I ! ! I I ! I I 
all 7-1 RAVAKETLDI FAPLANRLGVWQLKWQLEDLGFRHQEPEKYREIALLLDEKRTERLEYI EN 

190 200 210 220 230 240 
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250 260 270 280 290 300 

ml 1 7 - 1 . pep FLNI LRGELKKYNVHFEVAGRPKHI YS 1 YKKMVKKKLSFDGLFDIRAVRI LVDTVPEC YT 
I ! I I I I I I I i I ! : I I I ! I I I I I I I I I I I I I ! t I I I I I I I I ! I I I I II i I t I I ! t I I I I I 
all 7-1 FLNI LRTELKKYNIHFEVAGRPKHI YS I YKKMVKKKLS FDGLFDI RAVRI LVDTVPEC YT 

2S0 260 270 280 290 300 

310 320 330 340 350 360 

ml 1 7- 1 . pep TLGI VHSLWQPI PGEFDDYI ANPKGNGYKSLHTVIVGPEDKGVEVQIRTFDMHQFNEFGV 
I I I I I I 1 M I I I I I I i I I ! I I I I I I I i I I I I I I I 1 I ! 1 I I I I I I I I ! I I I I I I I i I I I i I 
a 1 1 7- 1 TLGI VHSLWQPI PGEFDDYI AN PKGNGYKSLHT VI VGPEDKGVEVQIRT FDMHQFNEFGV 

310 320 330 340 350 360 

370 380 390 4C0 410 420 

ml 17-1 .pep AAHWRYKEGGKGDSAYEQKIAWLRQLLDWRENMAESGKEDLAAAFKTELFNDTI YVLTPH 
] 1 I I ! 1 I 1 I i I II I I I t i ! I I I 1 I I I I II I i I 111 I ) ) II I I I I I I I I I I i 1 i II I I I II 
a 1 1 7 - 1 AAHWRYKEGGKGDSAYEQKI AWLRQLLDWRENMAESGKEDLAAAFKTELFNDTI YVLTPH 

370 380 390 400 410 420 

430 440 450 460 470 480 

ml 1 7- 1 . pep GKVLSLPTGATPIDFAYALHSSIGDRCRGAKVEGQIVPLSTPLENGQRVEI ITAKEGHPS 
I I I I I I I I I I I I I I I ! I ! I . I I I I I ! I I I I I I I i I I I I I I I t I I I I I I I I I I I I I I ! I I I 
all7-l GKVLSLPTGATPIDFAYALHSSIGDRCRGAKVEGQIVPLSTPLENGQRVEI ITAKEGHPS 

430 440 450 460 470 480 

490 500 510 520 530 540 

ml 17-1 .pep VNWLYEGWVKSNKAIGKIRAYIRQQNADTVREEGRVQLDKQLAKLTPKPNLQELAENLGY 
! I ! I I II I I I i I I I ( I I II ! I I I I I I I I i I I I I I I M I I I I I i I I ! If I I I I I I I I I I I I 
ain-1 VNVJLYEGWVKSNKAIGKIRAYIRQQNADTVREEGRVQLDKQLAKLTPKPNLQELAENLGY 

490 500 510 520 530 540 

550 560 570 580 590 600 

mll7-l . pep KKPEDLYTAVGQGEISNRAIQKACGTLNEPPPVPVSETTIVKOSKIKKGGKNGVLIDGED 
I ! I I I I I I I I f I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I ! ! I I I I I I I I I I I I I 
all7-l KKPEDLYTAVGOGEISNRAIQKACGTLNEPPPVPVSETTI VKQSKIKKGGKNGVLI DGED 

550 560 570 580 590 600 

610 620 630 640 650 660 

nil 1 7-1 , pep GLMTTLAKCCKPAPPDDI IGFVTRERGISVHRKTCPSFQHLAEHAPEKVLDASWAALQEG 
! I I I I I I i I I I I I I I I i 1 : I ( i \ I : (I I I t II I I I I I I : 1 I t t I II i i II I t I I I I II I ! 
a 117-1 GLMTTLAKCCKPAPPDDI VGFVTRDRGISVHRKTCPSFRHLAEHAPEKVLDASWAALQEG 

610 620 630 640 650 660 

670 680 690 700 710 720 

ml 17-1 .pep QVFAVDIEIRAQDRSGLLRDVSDALARHKLNVTAVOTQSRDLEASMRFTLEVKQVNDLPR 
I I I I 1 I !l I I I i I I I 11 I I I M I 1 M 1 I 1 II I I I 1 I I 1 1 I I I I 1 1 I 1 I 1 1 I I 1 1 I : II I I 
a 117-1 QVFAVDIEIRAQDRSGLLRDVSDALARHKLNVTAVQTQSRDLEASMRFTLEVKQVTDLPR 

670 680 690 700 710 720 

730 

mll7-l.pep VLASLGDVKGVLSVTRLX 
I I ! I I I I I II I I If I I I I 
a!17~l VLASLGDVKGVLSVTRLX 
730 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 423>: 

gll8 . seq 

1 ATGTGCGAGT TCAAGGATTT TAGAAGAAAC ATCCCTTGTT TTGAAGAGTA 

51 TGACGAAAAT TCATTTATTG GCAAATGGTA TGATGACGGG GTGTGGGATG 

101 ATGAAGAATA TTGGAAGCTG GAGAATGATT TAATcgaGGT TAGGAGAAAA 

151 TATCCTTATC CGATGGATAT ACCAAGGGAT ATTGTGATTG GAATCGGTAC 

201 CATTATTGAT TTTTTAATGG TTCCAAATTG GGAGCTTTTT GAAATTAAAG 

251 CTTCCCCTTG GTTGCCTGAT AGCGTGGGAA TTCATGAACG TTATGAAAGA 

301 TTCACAACGA TGCTCCGTTA TATTTTTACC GAGAAAGACA TAGTCAACGT 

3 51 GCGATTTGAT TATTACAaCA AAAAATAG 

This corresponds to the amino acid sequence <SEQ ID 424; ORF 1 1 8.ng>: 

gll8 . pep 

1 MCEFKDFRRN IPCFEEYDEN SFIGKWYDDG VWDDEEYWKL ENDLIEVRRK 

51 YPYPMDIPRD IVIGIGTIID FLMVPNW ELF EIKASPWLPD SVGIHERYEP, 

101 FTTMLRYI FT EKDIVNVRFD YYNKK* 
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70 30 90 100 HO 120 

ml 18. pep VVIGIGTI I DFLMVPNWKLFEIKASPWLPDSVGIHERYERFTTMLRYI FTEKDIVNVRFD 

: I M I I I I I ! I I I I I ! i : M I M | I ! I ! ( I i M I I I I I 1 i I l I f 1 ! ! ! I I I i I i I I l i i I 
all8 IVIGIGTI IDFLMVPNWELFEIKASPWLPDSVGIHERYERFTTMLRYI FTEKDI VNVRFD 

70 80 90 100 110 120 



ml 18. pep YYNKKX 
Mill! 

all 8 YYNKKX 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 429>: 



gl20 - seq 












1 


ATGATGAAGA 


CTTTTAAAAA 


TATATTTTCC 


GCCGCCATTT 


TGTCCGCCGC 


51 


CCTGCCGTGC 


GCGTATGCGG 


CAAGGCTACC 


CCAATCCGCC 


GTGCTGCACT 


101 


ATTCCGGCAG 


CTACGGCATT 


CCCGCCACGA 


TGACATTTGA 


ACGCAGGGGC 


151 


AATGCTTACA 


AAATCGTTTC 


GACGATTAAA 


GTGCCGCTAT 


ACAATATCCG 


201 


TTTCGAATCC 


GGCGGTACGG 


TTGTCGGCAA 


TACCCTGCAC 


CCTGCCTACT 


251 


ATAAAGACAT 


ACGCAGGGGC 


AAACTGTATG 


CGGAAGCCAA 


ATTCGCCGAC 


301 


GGCAGCGTAA 


CCTACGGCAA 


AGCGGGCGAG 


AGCAAAACCG 


AGCAAAGCCC 


351 


CAAGGCTATG 


GATTTGTTCA 


CGCTTGCCTG 


GCAGTTGGCG 


GCAAATGACG 


401 


CGAAACTCCC 


CCCGGGTCTG 


AAAATCACCA 


ACGGCAAAAA 


ACTTTATTCC 


451 


GTCGGCGGCC 


TGAATAAGGC 


GGGTACGGGA 


AAATACAGCA 


Taggcggcgt 


501 


gGAAACCGAA 


GTCGTCAAAT 


ATCGGGTGCG 


GCGCGGCGAC 


GATACGGTAA 


551 


CGTATTTCTT 


CGCACCGTCC 


CTGAACAATA 


TTCCGGCACA 


AATCGGCTAT 


601 


ACCGAcgaCG 


GCAAAACCTA 


TACGCTGAAG 


CTCAAATCGG 


TGCAGATCAA 


€51 


CGGACAGGCC 


GCCAAACCGT 


AA 







This corresponds to the amino acid sequence <SEQ ID 430; ORF 120.ng>: 

gl20.pep 

1 MMKTFKNIFS AAIL5AALPC AYA ARLPQSA VLHYSGSYGI PATMTFERSG 

51 NAYKIVSTIK VPLYNIRFES GGTWGNTLH PAYYKDIRRG KLYAEAKFAD 

101 GSVTYGKAGE SKTEQSPKAM DLFTLAWQLA ANDAKLPPGL KITNGKKLYS 

151 VGGLNKAGTG KYSIGGVETE WKYRVRRGD DTVTYFFAPS LNNIPAQIGY 

201 TDDGKTYTLK LKSVQINGQA AKP* 

The following partial DNA sequence was identified in N, meningitidis <SEQ ID 43 1>: 



ml20 . seq 












1 


ATGATGAAGA 


CTTTTAAAAA 


TATATTTTCC 


GCCGCCATTT 


TGTCCGCCGC 


51 


CCTGCCGTGC 


GCGTATGCGG 


CAGGGCTGCC 


CCAATCCGCC 


GTGCTGmACT 


101 


ATTCCGGCAG 


CTACGGCATT 


CCCGCCACGA 


TGACATTTGA 


ACGCAGCGGC 


151 


AATGCTTACA 


AAATCGTTTC 


GACGATTAAA 


GTGCCGCTAT 


ACAATATCCG 


201 


TTTCGAGTCC 


GGCGGTACGG 


TTGTCGGCAA 


TACCCTGCAC 


CCTACCTACT 


251 


ATAGAGACAT 


ACGCAGGGGC 


AAACTGTATG 


CGGAAGCCAA 


ATTCGCCGAC 


301 


GGCAGCGTAA 


CTTACGGCAA 


AGCGGGCGAG 


AGCAAAACCG 


AGCAAAGCCC 


351 


CAAGGCTATG 


GATTTGTTCA 


CGCTTGCCTG 


GCAGTTGGCG 


GCAAATGACG 


401 


CGAAACTCCC 


CCCGGGGCTG 


AAAATCACCA 


ACGGCAAAAA 


ACTTTATTCC 


451 


GTCGGCGGTT 


TGAATAAGGC 


GGGTACAGGA 


AAATACAGCA 


TAGGCGGCGT 


501 


GGAAACCGAA 


GTCGTCAAAT 


ATCGGGTGCG 


GCGCGGCGAC 


GATGCGGTAA 


551 


TGTATTTCTT 


CGCACCGTCC 


CTGAACAATA 


TTCCGGCACA 


AATCGGCTAT 


601 


ACCGACGACG 


GCAAAACCTA 


TACGCTGAAA 


CTCAAATCGG 


TGCAGATCAA 


651 


CGGCCAGGCA 


GCCAAACCG 









This corresponds to the amino acid sequence <SEQ ID 432; ORF 120>: 

ml20 .pep 

1 MMKTFKNIFS AAILSAALPC A* A AGLPQSA VLXYSGSYGI PATMTFERSG 

51 NAYKIVSTIK VPLYNIRFES GGTWGNTLH PTYYRDIRRG KLYAEAKFAD 

101 GSVTYGKAGE SKTEQSPKAM DLFTLAWQLA ANDAKLPPGL KITNGKKLYS 

151 VGGLNKAGTG KYSIGGVETE WKYRVRRGD DAVMYFFAPS LNNIPAQIGY 

201 TDDGKTYTLK LKSVQINGQA AKP 
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10 20 30 <i0 50 60 

70 30 90 100 110 120 

ml20 . pep VPLYNIRFESGGTVVGNTLHPTYYRDIRRGKLYAEAKFADGSVTYGKAGESKTEQSPKAM 
M I I I I I I I I I I I I I I f I I I I I ! I ! I I ! I I I ! I i I I I t I If I I I if I I I I I I ! I ! I I I ! I 
a 120 VPLYNIRFSSGGTVVGNTLHPT. YRDIRRGKLYAEAKFADGSVTYGKAGESKTEQSPKAM 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml20 . pep DLFTLAWQLAANDAKLPPGLKITNGKKLYSVGGLNKAGTGKYSIGGVETEVVKYRVRRGD 
I f I I I I ! I I 1 I I I I I I i M I I i I I I II I I II I ! I I I ! I I I M I I 1 I f ! I I I I I I I i i I I ( 
al20 DLFTLAWQLAANDAKLPPGLKITNGKKLYSVGGLNKAGTGKYSIGGVETEVVKYRVRRGD 

130 140 150 160 170 180 

190 200 210 220 

ml2 0 . pep DAVMYFFAPSLNNIPAQIGYTDDGKTYTLKLKSVQINGQAAKPX 
I I f I I i I I M I I { I II f i I I I ! I II I I I i ! I II I I I ! ! I I I II i 
a 120 DAVMYFFAPSLNNIPAQIGYTDDGKTYTLKLKSVQINGQAAKPX 

190 200 210 220 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 435>: 

gl21 . seq 

1 ATGGAAACAC AGCTTTACAT CGGCATTATG TCGGGAACCA GTATGGACGG 

51 GGCGGATGCC GTGCTGGTAC GGATGGACGG CGGCAAATGG CTGGGCGCGG 

101 AAGGGCACGC CTTTACCCCC TACCCTGACC GGTTGCGCCG CAAATTGCTG 

151 GATTTGCAGG ACACAGGCAC AGACGAACTG CACCGCAGCA GGATGTTGTC 

2 01 GCAAGAACTC AGCCGCCTGT ACGCGCAAAC CGCCGCCGAA CTGCTGTGCA 

2 51 GTCAAAACCT CGCTCCGTGC GACATTACCG CCCTCGGCTG CCACGGGCAA 
301 ACCGTCCGAC ACGCGCCGGA ACACGGTtac AGCATACAGC TTGCCGATTT 

3 51 GCCGCTGCTG GCGGAACTGa cgcggatttT TACCGTCggc gacttcCGCA 

4 01 GCCGCGACCT TGCTGCCGGC GGacaAGGTG CGCCGCTCGT CCCCGCCTTT 
4 51 CACGAAGCCC TGTTCCGCGA TGACAGGGAA ACACGCGTGG TACTGAACAT 
501 CGGCGGGATT GCCAACATCA GCGTACTCCC CCCCGGCGCA CCCGCCTTCG 
551 GCTTCGACAC AGGGCCGGGC AATATGCTGA TGGAcgcgtg gacgcaggca 
601 cacTGGcagc TGCCTTACGA CAAAAacggt gcAAAGgcgg cacAAGGCAA 
651 catatTGCcg cAACTGCTCG gcaggctGCT CGCCcaccCG TATTTCTCAC 

7 01 AACCCcaccc aaAAAGCACG GGgcGCGaac TgtttgcccT AAattggctc 
751 gaaacctAcc ttgacggcgg cgaaaaccga tacgacgtat tgcggacgct 
801 ttcccgattc accgcgcaaA ccgTttggga cgccgtctca CACGCAGCGG 

8 51 CAGATGCCCG TCAAATGTAC ATTTGCGGCG GCGGCATCCG CAATCCTGTT 
901 TTAATGGCGG ATTTGGCAGA ATGTTTCGGC ACACGCGTTT CCCTGCACAG 
951 CACCGCCGAA CTGAACCTCG ATCCTCAATG GGTGGAGGCG gccgCATTtg 

1001 cgtggttggC GGCGTGTTGG ATTAACCGCA TTCCCGGTAG TCCGCACAAA 

1051 GCGACCGGCG CATCCAAACC GTGTATTCTG GGCGCGGGAT ATTATTATTG 

1101 A 

This corresponds to the amino acid sequence <SEQ ID 436; ORF !21.ng>: 

gl21 -pep 

1 METQLYIGIM SGTSMDGADA VLVRMDGGKW LGAEGHAFTP YPDRLRRKLL 

51 DLQDTGTDEL HRSRMLSQEL SRLYAQTAAE LLCSQNLAPC DITALGCHGQ 

101 TVRHAPEHGY SIQLADLPLL AELTRIFTVG DFRSRDLAAG GQGAPLVPAF 

151 HEAL FRD ORE TRVVLNIGGI ANISVLPPGA PAFGFDTGPG NMLMDAWTQA 

201 HWQLPYDKNG AKAAQGNILP QLLGRLLAHP YFSQPHPKST GRELFALNWL 

251 ETYLDGGENR YDVLRTLSRF TAQTVWDAVS HAAADARQMY ICGGGIRNPV 

301 LMADLAECFG TRVSLHSTAE LNLDPQWVEA AAFAWLAACW INRIPGSPHK 

351 ATGASKPCIL GAGYYY* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 437>: 

ml21 . seq 

i ATGGAAACAC AGCTTTACAT CGGCATCATG TCGGGAACCA GCATGGACGG 

51 GGCGGATGCC GTACTGATAC GGATGGACGG CGGCAAATGG CTGGGCGCGG 

101 AAGGGCACGC CTTTACCCCC TACCCCGGCA GGTTACGCCG CCAATTGCTG 
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310 320 330 340 350 360 

ml 21 .pep LMADLAECFGTRVSLHSTADLNLDPQWVEAAXFAWLAACWINRIPGSPHKATGASKPCIL 
I M I I I I I I i M I ! I II t i : I I i I I M i M i i i ! I I I t I i I M I M ! ti I I I I I I i ! i I 
g 1 2 1 LMADLAE CFGTRVS LH STAELNLDPQW VEAAAFAWLAACW I N'R I PGS PHKATGASKPC I L 

310 320 330 340 350 360 

ml 21. pep KAGYYYX 
I I I ) I I 

g!21 GAGYYYX 



The following partial DNA sequence 



al21 . seq 






1 


ATGGAAACAC 


AGCTTTACAT 


51 


GGCGGATGCC 


GT ACT GAT AC 


101 


AAGGGCACGC 


CTTTACCCCC 


151 


GATTTGCAGG 


ACACAGGCGC 


201 


GCAAGAACTC 


AGCCGCCTGT 


251 


GTCAAAACCT 


CGCGCCGTCC 


301 


ACCGTCAGAC 


ACGCGCCGGA 


351 


GCCGCTGCTG 


GCGGAACGGA 


401 


GCCGCGACCT 


TGCGGCCGGC 


451 


CACGAAGCCC 


TGTTCCGCGA 


501 


CGGCGGGATT 


GCCAACATCA 


551 


GCTTCGACAC 


AGGACCGGGC 


601 


CACTGGCAGC 


TTCCTTACGA 


651 


CATATTGCCG 


CAACTGCTCG 


701 


AACCCCACCC 


TAAAAGCACG 


751 


GAAACCTACC 


TTGACGGCGG 


801 


TTCCCGATTC 


ACCGCGCAAA 


851 


CAGATGCCCG 


TCAAATGTAC 


901 


TTAATGGCGG 


ATTTGGCAGA 


951 


CACCGCCGAA 


CTGAACCTCG 


1001 


CATGGATGGC 


GGCGTGTTGG 


1051 


GCAACCGGCG 


CATCCAAACC 


1101 


A 





identified in A ; . meningitidis <SEQ ID 439>: 



CGGCATCATG TCGGGAACCA GCATGGACGG 
GGATGGACGG CGGCAAATGG CTGGGCGCGG 
TACCCCGGC GGTTACGCCG CAAATTGCTG 
GGACGAACTG CACCGCAGCA GGATGTTGTC 
ACGCGCAAAC CGCCGCCGAA CTGCTGTGCA 
GACATTACCG CCCTCGGCTG CCACGGGCAA 
ACACAGTTAC AGCGTACAGC TTGCCGATTT 
CTCAGATTTT TACCGTCGGC GACTTCCGCA 
GGACAAGGCG CGCCGCTCGT CCCCGCCTTT 
CGACAGGGAA ACACGCGCGG TACTGAACAT 
GCGTACTCCC CCCCGACGCA CCCGCCTTCG 
AATATGCTGA TGGACGCGTG GATGCAGGCA 
CAAAAACGGT GCAAAGGCGG CACAAGGCAA 
ACAGGCTGCT CGCCCACCCG TATTTCGCAC 
GGGCGCGAAC TGTTTGCCCT AAATTGGCTC 
CGAAAACCGA TACGACGTAT TGCGGACGCT 
CCGTTTTCGA CGCCGTCTCA CACGCAGCGG 
ATTTGCGGCG GCGGCATCCG CAATCCTGTT 
ATGTTTCGGC ACACGCGTTT CCCTGCACAG 
ATCCGCAATG GGTAGAAGCC GCCGCGTTCG 
GTCAACCGCA TTCCCGGTAG TCCGCACAAA 
GTGTATTCTG GGCGCGGGAT ATTATTATTG 



This corresponds to the amino acid sequence <SEQ ID 440; ORF 121.a>: 

al21 . pep 

1 METQLY1GIM SGTSMDGADA VLIRMDGGKW LGAEGHAFTP YPGRLRRKLL 

51 DLQDTGADEL HRSRMLSQEL SRLYAQTAAE LLCSQNLAPS DITALGCHGQ 

101 TVRHAPEHSY SVQLADLPLL AERTQIFTVG DFRSRDLAAG GQGAPLVPAF 

151 HEALFRDDRE TRAVLN I GG I ANISVLPPDA PAFGFDTGPG NMLMDAWMQA 

201 HWQLPYDKNG AKAAQGNILP QLLDRLLAHP YFAQPHPKST GRELFALNWL 

2 51 ETYLDGGENR YDVLRTLSRF TAQTVFDAVS HAAADARQMY ICGGGIRNPV 
301 LMADLAECFG TRVSLHSTAE LNLDPQWVEA AAFAWMAACW VNRIPGSPHK 

3 51 ATGASKPCIL GAGYYY* 

ml21/al21 74.0% identity in 366 aa overlap 



10 20 30 40 50 60 

ml21 . pep METQLYIGIMSGTSMDGADAVLIRMDGGKWLGAEGHAFTPYPGRLRRQLLDLQDTGADEL 
I I I I I t I ! I I I I M I II t I ! I II I 1 I M I I I M I I ! M I I M I M M : 1 M 1 i 1 M I I I I 
a 1 2 1 METQLYI GIMSGTSMDGADAVL IRMDGGKWLGAEGHAFTPYPGRLRRKLLDLQDTGADEL 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml 21. pep HRSRILSOELSRLYAQTAAELLCSQNLAPSDITALGCHGOTVRHAPEHGYSIQLADLPLL 
It i I : I M I I I M i I (I I I I I I I I I I i II I I I ! i I i ! I \ i I I I I I I I i : | I : | 1 j | | f | | 
al21 HRSRMLSOELSRLYAQTAAELLCSQNLAPSDITALGCHGQTVRHAPEHSYSVQLADLPLL 

70 SO 90 100 110 120 
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ml21-l . pep METQLYIGIMSGTSMDGADAVLIRMDGGKWLGAEGHAFTPYPGRLRRQLLDLQDTGADEL 
I I I I I ! I I I ) I I I I i I I I I I I I : I 1 I I I I I I I ! I I I ! I I 1 I I ( i I t : 1 II 1 t I I I : ! I I 
g 12 1 METQLY I GIMSGTSMDGADAVLVRMDGGKWLGAEGHAFTP YPDRLRRKLLDLQDTGTDEL 

10 20 30 40 50 60 



70 80 90 100 110 120 

.t.121-1 . pep HRSRILSQEL5RLVAQTAAELLCSQNLAPSDITALGCHGQTVRHAPEHGYSI0LADLPLL 
j I M : i 1 I i M i i I 1) 1 1 I 1 1 i i 1 ! I i I ) i i i M i i t i I I i 1 I t I i 1 I 1 I I I I I I I I I 1 
gl21 HRSRMLSOELSRLYAQTAAELLCSONLAPCDITALGCHGQTVRHAPEHGYSIOLADLPLL 

70 8G 90 100 110 120 



130 140 150 160 170 180 

ml21-l -pep AERTRIFTVGDFRSRDLAAGGQGAPLVPAFHEALFRDNRETRAVLNIGGIANISVLPPDA 
! I I I I I I I I I I I I I I I I I I I I I I I I J I M I I I I I I I : I II I : I I I I I M I I I ! I I 1 I 
gl21 AELTRIFTVGDFRSRDLAAGGQGAPLVPAFHEALFRDDRETRWLNIGGIANISVLPPGA 

130 140 150 160 170 180 



190 200 210 220 230 240 

ml21-l.pep PAFGFDTGPGNMLMDAWTQAHWQLPYDKKGAKAAQGNILPOLLDRLLAHPYFAQPHPKST 
I I I I I I I I I I I I I I I [[ j I I < I i I I I I ! II I I I ! M I I I I I I I I j I ! I ! I I : I I I I I I I 
gl21 PAFGFDTGPGMMLMDAWTQAHWQLPYDKNGAKAAOGNILPQLLGRLLAHPYFSOPHPKST 

190 200 210 220 230 240 



250 260 270 280 290 300 

ml21-l .pep GRELFALNWLETYLDGGENRYDVLRTLSRFTAQTVCDAVSHAAADARQMYICGGGIRNPV 
I S i i I I I M I I I I I II I ! I I t I I I I I I ! I I I ! I ! I I i 1 II I I I M I I I I i 1 I I M i I I i 

qlZi GRELFALNWLETYLDGGENRYDVLRTLSRFTAQTVWDAVSHAAADARQMYICGGGIRNPV 

250 260 270 280 290 300 



310 320 330 340 350 360 

ml2 1 - 1 . pep LMADLAECFGTRVSLHSTADLNLDPQWVEAAXFAWLAACWINRI PGS PHKATGA3KPCI L 
I I I i I I 1 1 I i I i I I I I ] ) I : ) I II I M I i I i I ) 1 I I I I I I I )! I I I I I 1 I I I I I II I I I 
gl21 LMADLAECFGTRVSLHSTAELHLDPQWVEAAAFAWLAACW1NRIPGSPHKATGASKPCIL 

310 320 330 340 350 360 

ml21-l.pep XAGYYYX 
i I I I I I 

gl21 GAGYYYX 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 443>: 

al21-l . seq 

1 ATGGAAACAC AGCTTTACAT CGGCATCATG TCGGGAACCA GCATGGACGG 

51 GGCGGATGCC GTACTGATAC GGATGGACGG CGGCAAATGG CTGGGCGCGG 

101 AAGGGCACGC CTTTACCCCC TACCCCGGCA GGTTACGCCG CAAATTGCTG 

151 GATTTGCAGG ACACAGGCGC GGACGAACTG CACCGCAGCA GGATGTTGTC 

201 GCAAGAACTC AGCCGCCTGT ACGCGCAAAC CGCCGCCGAA CTGCTGTGCA 

251 GTCAAAACCT CGCGCCGTCC GACATTACCG CCCTCGGCTG CCACGGGCAA 

301 ACCGTCAGAC ACGCGCCGGA ACACAGTTAC AGCGTACAGC TTGCCGATTT 

351 GCCGCTGCTG GCGGAACGGA CTCAGATTTT TACCGTCGGC GACTTCCGCA 

401 GCCGCGACCT TGCGGCCGGC GGACAAGGCG CGCCGCTCGT CCCCGCCTTT 

451 CACGAAGCCC TGTTCCGCGA CGACAGGGAA ACACGCGCGG TACTGAACAT 

501 CGGCGGGATT GCCAACATCA GCGTACTCCC CCCCGACGCA CCCGCCTTCG 

551 GCTTCGACAC AGGACCGGGC AATATGCTGA TGGACGCGTG GATGCAGGCA 

601 CACTGGCAGC TTCCTTACGA CAAAAACGGT GCAAAGGCGG CACAAGGCAA 

651 CATATTGCCG CAACTGCTCG ACAGGCTGCT CGCCCACCCG TATTTCGCAC 

701 AACCCCACCC TAAAAGCACG GGGCGCGAAC TGTTTGCCCT AAATTGGCTC 

751 GAAACCTACC TTGACGGCGG CGAAAACCGA TACGACGTAT TGCGGACGCT 

801 TTCCCGATTC ACCGCGCAAA CCGTTTTCGA CGCCGTCTCA CACGCAGCGG 

851 CAGATGCCCG TCAAATGTAC ATTTGCGGCG GCGGCATCCG CAATCCTGTT 

901 TTAATGGCGG ATTTGGCAGA ATGTTTCGGC ACACGCGTTT CCCTGCACAG 

951 CACCGCCGAA CTGAACCTCG ATCCGCAATG GGTAGAAGCC GCCGCGTTCG 

1001 CATGGATGGC GGCGTGTTGG GTCAACCGCA TTCCCGGTAG TCCGCACAAA 

1051 GCAACCGGCG CATCCAAACC GTGTATTCTG GGCGCGGGAT ATTATTATTG 

1101 A 



This corresponds to the amino acid sequence <SEQ ID 444; ORF 121-1 .a>: 

al21-l .pep 

1 METQLYIGIM SGTSMDGADA VLIRMDGGKW LGAEGHAFTP YPGRLRRKLL 

51 DLQDTGADEL HRSRMLSOEL SRLYAQTAAE LLCSQNLAPS DITALGCHGQ 

101 TVRHAPEHSY SVQLADLPLL AERTQIFTVG DFRSRDLAAG GQGAPLVpAF 

151 HEALFRDDRE TRAVLM1GGI ANISVLPPDA PAFGFDTGPG NMLMDAWMQA 
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g!22 . pep 

1 MALLSIRKLH KQYGSVTAIQ SLDLDLEKGE VIVLLGPSGC GKSTLLRCVN 

SI GLEPHQGGSI VMDGVGEFGK DVSWQTARQK VGMVFQSNEL FAHMTVIENI 

101 FLGPVKEQNR DRAEAEAQAG KLLERVGLLD RKNAYPRELS GGQKQRIAIV 

151 RALCLNPEVI LLDEITAALD PEMVREVLEV VLELAREGMS MLIVTHEMGF 

201 ARKVADRIVF MDKGGIVESS DPETFFSAPK SERARQFLAG MDY* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 447>: 

ml22 . seq 

1 GTTGTCATGA TTAAAATCCG 

51 TATTTTGCGC GGCATCGATT 

101 TCCTCGCGcC TTCCGGCTCA 

151 GCGTTGGAAA TGCCCGAAGA 

201 GCTGAAAATC GATTTTTCTA 

2 SI TGCGCCGCAA ATCAkGCATG 

3 01 AAAACCGCCT TGGAAAACGT 

3 51 GCCTGCCGCC CAAGCGCGCG 

4 01 GCTTGGGCGA CAAAGTGGAT 
4 51 CAGCAGCGCG TCGGCATTGC 
501 GCTGTTTGAC GAACCGACTT 
551 TTTTGGATmC CATGAAGGAA 
601 GTTACGCATG AAATCAAGTT 
651 GATGGACrGC GGCGTTATTG 
701 ACCACCCCAA ACACGAACGG 
751 ACCAAGATTT GA 

This corresponds to the amino acid sequence <SEQ ID 448; ORF 122>: 

ml 2 2 -pep 

1 WMIKIRNIH KTFGENTILR GIDLDVCKGQ WVILGPSGS GKTTFLRCLN 
51 ALEMPEDGQI EFDNERPLKI DFSKKPSKHD ILALRRKSXM VFQQYNLFPH 
101 KTALENVMEG PVAVQGKPAA QAREEALKLL EKVGLGDKVD LYPYQLSGGQ 
151 QC'VGIARAL AIQPELMLFD EPTSALDPEL VQDVLDXMKE LAQEGWTMW 
201 VTHEIKFALE VATTXWMDX GVIVEQGSPQ DLFDHPKHER TRRFLSQIQS 
251 TKI * 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 122 shows 47.2% identity over a 246 aa overlap with a predicted ORF (ORF 122.ng) 
from N. gonorrhoeae: 

ml22/gl22 

10 20 30 40 50 60 

WMI KIRN I HKT FGENT I LRGIDLDVCKGQVW I LGPSGSGKTTFLRCLNALEMP EDGQ I 
:::::||::|| :| | : : : : | | | : | | : | : | : | | | | || I : I : I M = I : ! I = ht 
MALLSIRKLHKQYGSVTAIQSLDLDLEKGEVIVLLGPSGCGKSTLLRCWGLEPHQGGSI 
10 20 30 40 50 60 

70 80 90 100 110 120 

EFDNERPLKIDFSKKPSKHDIIJVLRRKSXMVFQQYNLFPHKTALEMVMEGPVAVQGKPAA 

••!•• •• I 1 •■ : hi llih --11 i h- ••!!■- II! I- I 

VMDGVGEFGKDVSWQTA RQKVGMVFQSNELF AHMTVI EN I FLGPVKEQNRDRA 

70 80 90 100 110 

130 140 150 160 170 180 

QAREEALKXiLEKVGLGDKVDLYPYQLSGGQC^RVGIARALAIOPELMLFDEPTSALDPEL 
:h h I llhll I h : II : M I I h I h" : h I I I : = I h' : h I I 1 = 1(111 = 
EAEAQAGKLLERVGLLDRKNAYPRELSGGQKQRIAIVPJUiCLNPEVILLDEITAALDPEM 
120 130 140 ISO 160 170 

190 200 210 220 230 240 

VQDVLDXMKEIAOEGWTM\AAn , HEIKFALEVATTXVVMDXGVIVEOGSPODLFDHPKHER 
l-lh : llhll : h : II I h II hi I II I I I h" : : h : h I I I I 
VREVLEWLELAREGMSMLIVTHEMGFARKVADRIVFMDKGGIVESSDPETFFSAPKSER 



CAATATCCAT AAGACCTTTG GCGAAAACAC 
TGGATGTGTG CAAAGGGCAG GTGGTCGTCA 
GGCAAAACGA CGTTTCTGCG ATGCCTAAAC 
CGGACAAATC GAGTTCGACA ACGAGCGACC 
AAAAACCAAG CAAACACGAT ATTTTGGCAC 
GTGTTTCAAC AATACAAyCT CTTTCCGCAC 
AATGGAAGGA CCGGTTGCCG TACAgGGCAA 
AAGAGGCTCT GAAACTGCTG GAAAAAGTCG 
TTGTATCCCT ACCAGCTTTC CGGCGGTCAG 
CCGCGCATTG GCGATTCAGC CTGAACTGAT 
CCGCGCTCGA TCCTGAATTG GTGCAAGATG 
TTGGCGCAAG AAGGCTGGAC CATGGTTGTC 
CGCCTTAGAA GTGGCAACCA CCGwCGTCGT 
TCGAACAAGG CAGCCCGCAA GATTTGTTCG 
ACGCGGAGAT TTTTAAGCCA AATCCAATCT 



ml22 -pep 
gl22 

ml22 -pep 
gl22 

ml22 .pep 
gl22 

m!22 . pep 
g!22 
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The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 45 1>: 

gl22-l . seq 

I ATGATTAAAA TCCGCAATAT CCATAAGACC TTTGGCGAAA ACACCATTTT 

51 GCGCGGCATC GATTTGGATG TGGGCAAAGG GCAGGTGGTC GTCATCCTCG 

101 GGCCTTCCGG CTCGGGTAAA ACAACATTTC TGCGCTGCCT AAACGCGTTG 

151 GAAATGCCCG AAGACGGACA AATCGAGTTC GACAACGCGC GGCCGTTACG 

201 CATTGATTTT TCCAAAAAAA CAAGCAAACA CGATATTTTG GCACTGCGCC 

251 GCAAGTCCGG AATGGTATTC CAACAATACA ACCTCTTCCC GCATAAAACC 

301 GTGTTGGAAA ACGTGATGGA AGGGCCGGTT GCCGTACAGG GCAAGCCTGC 

351 CGCCCAAGCG CGCGAAGAGG CTTTGAAACT GCTGGAAAAA GTCGGCTTGG 

401 GCGATAAAGT GGATTTGTAT CCCTACCAGC TTTCCGGCGG TCAGCAGCAG 

4 51 CGTGTCGGTA TCGCCCGCGC ACTGGCGATT CAGCCTGAAT TGATGCTGTT 

501 TGACGAACCC ACTTCCGCGC TGGACCCCGA GTTGGTGCAA GACGTGTTGG 

551 ACGCCATGAA GGAATTGGCG CGGGAAGGTT GGACGATGGT CGTCGTTACC 

601 CACGAAATCA AGTTCACGCT GGAAGTTGCC ACGAACGTCG TCGTGATGGA 

651 CGGCGGCGTT ATCGTAGAGC AGGGCAGCCC GAAAGAGTTG TTCGACCACC 

701 TCAAACACGA ACGGACGCGG AGATTTTTAA GCCAAATCCA ATCTGCCAAG 

751 ATTTGA 

This corresponds to the amino acid sequence <SEQ ID 452; ORF 122-1 .ng>: 

gl22-l.pep 

1 MIKIRNIHKT FGENTILRGI DLDVGKGQVV VILGPSGSGK TTFLRCLNAL 

51 EMPEDGQIEF DNARPLRIDF SKKTSKHDIL ALRRKSGMVF QQYNLFPHKT 

101 VLENVMEGPV AVQGKPAAQA REEALKLLEK VGLGDKVDLY PYQLSGGQQQ 

151 RVGIARALAI QPELMLFDEP TSALDPELVQ DVLDAMKELA REGWTMVWT 

201 HEIKFTLEVA TNVVVMDGGV IVEQGSPKEL FDHLKHERTR RFLSQIQSAK 

251 I * 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 453>: 

ml22-l .seq 

1 ATGATTAAAA TCCGCAATAT 

51 GCGCGGCATC GATTTGGATG 

101 GGCCTTCCGG CTCAGGCAAA 

151 GAAATGCCCG AAGACGGACA 

201 AATCGATTTT TCTAAAAAAC 

2 51 GCAAATCAGG CATGGTGTTT 

301 GCCTTGGAAA ACGTAATGGA 

351 CGCCCAAGCG CGCGAAGAGG 

401 GCGACAAAGT GGATTTGTAT 

451 CGCGTCGGCA TTGCCCGCGC 

501 TGACGAACCG ACTTCCGCGC 

551 ATACCATGAA GGAATTGGCG 

601 CATGAAATCA AGTTCGCCTT 

651 CGGCGGCGTT ATTGTCGAAC 

701 CCAAACACGA ACGGACGCGG 

7 51 ATTTGA 

This corresponds to the amino acid sequence <SEQ ID 454; ORF 122-1>: 

ml22-l -pep 

1 MIKIRNIHKT FGENTILRGI DLDVCKGQVV VILGPSGSGK TTFLRCLNAL 
51 EMPEDGQIEF DNERPLKIDF SKKPSKHDIL ALRRKSGMVF QQYNLFPHKT 
101 ALENVMEGPV AVQGKPAAQA REEALKLLEK VGLGDKVDLY PYQLSGGQQQ 
151 RVGIARALAI QPELMLFDEP TSALDPELVQ DVLDTMKELA QEGWTMVVVT 
201 HEIKFALEVA TTVVVMDGGV I VEQGSPQDL FDHPKHERTR RFLSQIQSTK 
251 I* 

m!22-l/gl22-l 94.3% identity in 251 <aa overlap 

10 20 30 40 50 60 

ml 22-1 .pep MI K I RMIHKTFGENT I LRG I DLDVCKGQVVVI LGPSGSGKTTFLRCLNALEMPEDGQI EF 
i I ! I I i I I I ! I U I I I i I i II 1! I | | i f | I I I | I | M I I I I I I I I I I i I I j j ! I I I I I I 
g 122-1 MIKIRNIHKTFGENTILRGIDLDVGKGQVVVILGPSGSGKTTFLRCLNALEMPEDGQIEF 



CCATAAGACC TTTGGCGAAA ACACTATTTT 
TGTGCAAAGG GCAGGTGGTC GTCATCCTCG 
ACGACGTTTC TGCGATGCCT AAACGCGTTG 
AATCGAGTTC GACAACGAGC GACCGCTGAA 
CAAGCAAACA CGATATTTTG GCACTGCGCC 
CAACAATACA ACCTCTTTCC GCACAAAACC 
AGGACCGGTT GCCGTACAGG GCAAGCCTGC 
CTCTGAAACT GCTGGAAAAA GTCGGCTTGG 
CCCTACCAGC TTTCCGGCGG TCAGCAGCAG 
ATTGGCGATT CAGCCTGAAC TGATGCTGTT 
TCGATCCTGA ATTGGTGCAA GATGTTTTGG 
CAAGAAGGCT GGACCATGGT TGTCGTTACG 
AGAAGTGGCA ACCACCGTCG TCGTGATGGA 
AAGGCAGCCC GCAAGATTTG TTCGACCACC 
AGATTTTTAA GCCAAATCCA ATCTACCAAG 
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130 140 ISO 160 170 180 

ISO 200 210 220 230 240 

a 122- 1 .pep DVLNAMKELAREGWTMVWTHEIKFALEVATTVVVMDGGVI veqgspkelfdhpkhertr 

I I | : : I I I I I : I I I I I I t I I I ! I I I I I I I ! I I II I I I I i I J i f 1 I I I : : I I 1 I I I I i i \ I 
ml 22-1 DVLDTMKELA0EGWTMVVVTHEIKFALEVATTVVVMDGGVI7EQGSP0DLFDKPKHERTR 

190 200 210 220 230 240 

250 

al22-l.pep RFLSQIQSTKIX 

II I M I I I I f I I 
ml22-l RFLSQIQSTKIX 

250 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 457>: 

g!25 . seq 

1 ATGTCGGGCA ATGCCTCCTC TCCTTCATCT TCCGCCGCCA TCGGGCTGGT 

51 TTGGTTCGGC GCGGCGGTAT CGATTGCCGA AATCAGCACG GGTACGCTGC 

101 TCGCCCCCTT GGGCTGGCAG CGCCJTCTGG CGGCCCTGCT TTTGGGTCAT 

151 GCCGTCGGCG GCGCGCTGTT TTTTGCGGCG GCGTATATCG GCGCACTGAC 

201 CGGACGCAGC TCGATGGAAA GTGTGCGCCT _:rOTTCOGC AAATGCGGTT 

2 51 CAGTGCTGTT TTCCGTGGCG AATATGCTGC AACTGGCCGG CTGGACGGCG 

301 GTGATGATTT ACGTCGGCGC AacggTCAGC TCCGCTTTGG GCAAAGTGTT 

351 GTGGGACggc gaATCCTTTG TCTGGTGGGC ATTGGCAAAC GGCGCACTGA 

401 TCGTGCTGTG GCTGGTTTTC GGCGCACGCA GAACGGGCGG GCTGAAAACC 

4 51 GTTTCGATGC TGCTGATGCT GCTTGCCGTG TTGTGGTTGA GCGTCGAAGT 

501 GTTCGCTTCG TCCGGCACAA ACGCCGCGCC CGCCGTTTCA GACGGCATGA 

551 CCTTCGGAAC GGCAGTCGAA CTGTCCGCCG TCATGCCGCT TTCCTGGCTG 

601 CCGCTGGCCG CCGACTACAC GCGCCAAGCA CGCCGCCCGT TTGCGGCAAC 

6 51 CCTGACGGCA ACGCTCGCCT ATACGCTGAC GGGCTGCTGG ATGTATGCCT 

701 TGGGTTTGGC GGCGGCTCTG TTTACCGGAG AAACCGACGT GGCGAAAATC 

751 CTGTTGGGCG CGGGCTTGGG CATAACGGGC ATTCTGGCAG TCGTCCTCTC 

801 CACCGTTACC ACAACGTTTC TCGATACCTA TTCCGCCGGC GCGAGTGCGA 

851 ACAACATTTC CGCGCGTTTT GCGGAAATAC CCGTCGCTGT CGGCGTTACC 

901 CTGatccgca ccgtgcttgc cgtcatgctg cccgttaccg aatataaaaa 

951 cttcctgctg cttatccgct cggtatttgg gccgatggcg ggtggttctg 

1001 attgccgaCT TTTttgtctt AAAACGGCGT GA 

This corresponds to the amino acid sequence <SEQ ID 458; ORF 125.ng>: 



The following partial DNA sequence was identified in M meningitidis <SEQ ID 459>: 

ml25 . seq 

1 ATGTCGGGCA ATGCCTCCTC TCCTTCATCT TCCTCCGCCA TCGGGCTGAT 

51 TTGGTTCGGC GCGGCGGTAT CGATTGCCGA AATCAGCACG GGTACGCTGC 

101 TTGCGCCTTT GGGCTGGCAG CGCGGTCTGG CGGCTCTACT TTTGGGTCAT 

151 GCCGTCGGCG GCGCGCTGTT TTTTGCGGCG GCGTATATCG GCGCACTGAC 

201 CGGACGCAGC TCGATGGAAA GCGTGCGCCT GTCGTTCGGC AAACGCGGTT 

251 CAGTGCTGTT TTCCGTGGCG AATATGCTGC AACTGGCCGG CTGGACGGCG 

301 GTGATGATTT ACGCCGGCGC AACGGTCAGC TCCGCTTTGG GCAAAGTGTT 

3 51 GTGGGACGGC GAATCTTTTG TCTGGTGGGC ATTGGCAAAC GGCGCGCTGA 
401 TTGTGCTGTG GCTGGTTTTC GGCGCACGCA AAACAGGCGG GCTGAAAACC 

4 51 GTTTCGATGC TGCTGATGCT GTTGGCGGTT CTGTGGCTGA GTGCCGAAGT 
501 CTTTTCCACG GCAGGCAGCA CCGCCGCACA GGTTTCAGAC GGCATGAGTT 
551 TCGGAACGGC AGTCGAGCTG TCCGCCGTGA TGCCGCTTTC CTGGCTGCCG 
6 01 CTTGCCGCCG ACTACACGCG CCACGCGCGC CGCCCGTTTG CGGCAACCCT 
651 GACGGCAACG CTCGCCTACA CGCTGACCGG CTGCTGGATG TATGCCTTGG 
701 GTTTGGCAGC GGCGTTGTTC ACCGGAGAAA CCGACGTGGC AAAAATCCTG 
751 CTGGGCGCAr GTTTGgGTGC GGCAGGCATT TTGGCGGTCG TCCTCTCCAC 
8 01 CGTTACCACA ACGTTTCTCG ATGCCTATTC CGCCGGCGCG AGTGCGAACA 
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151 GCCGTCGGCG GCGCGCTGTT TTTTGCGGCG GCGTATATCG GCGCACTGAC 

201 CGGACGCAGC TCGATGGAAA GCGTGCGCCT GTCGTTCGGC AAACGCGGTT 

251 CAGTGCTGTT TTCCGTGGCG AATATGCTGC AACTGGCCGG CTGGACGGCG 

301 GTGATGATTT ACGCCGGCGC AACGGTCAGC TCCGCTTTGG GCAAAGTGTT 

351 GTGGGACGGC GAATCTTTTG TCTGGTGGGC ATTGGCAAAC GGCGCGCTGA 

401 TTGTGCTGTG GCTGGTTTTC GGCGCACGCA AAACAGGCGG GCTGAAAACC 

4 51 GTTTCGATGC TGCTGATGCT GTTGGCGGTT CTGTGGCTGA GTGCCGAAGT 

501 CTTTTCCACG GCAGGCAGGA CCGCCGCACA GGTTTCAGAC GGCATGAGTT 

551 TCGGAACGGC AGTCGAGCTG TCCGCCGTGA TGCCGCTTTC TTGGCTGCCG 

601 CTGGCCGCCG ACTACACGCG CCACGCGCGC CGCCCGTTTG CGGCAACCCT 

651 GACGGCAACG CTCGCCTACA CGCTGACCGG CTGCTGGATG TATGCCTTGG 

701 GTTTGGCAGC GGCGTTGTTC ACCGGAGAAA CCGACGTGGC AAAAATCCTG 

7 51 CTGGGCGCAG GTTTGGGTGC GGCAGGCATT TTGGCGGTCG TCCTGTCGAC 

801 CGTTACCACC ACTTTTCTCG ATGCCTACTC CGCCGGCGTA AGTGCCAACA 

851 ATATTTCCGC CAAACTTTCG GAAATACCCA TCGCCGTTGC CGTCGCCGTT 

901 GTCGGCACAC TGCTTGCCGT CCTCCTGCCC GTTACCGAAT ATGAAAACTT 

951 CCTGCTGCTT ATCGGCTCGG TATTTGCGCC GATGGCG. GC GGTTTTGATT 

1001 GCCGACTTTT TCGTCTTGAA ACGGCGTGA 

corresponds to the amino acid sequence <SEQ ID 462; ORF 125. a>: 

al25.pep 

1 MSGNASSPSS SAAIGLIWFG AAVSIAEIST GTLLAFLGWQ RGLAALLLGH 

51 AVGG ALFFAA AYIGALTGRS SMESVRLSFG KRGSVLFSVA NMLQLAGWTA 

101 VMIYAGATVS SALGKVLWDG ES FVWWALAN GALIVLWLV F GARKTGGLKT 

151 VSMLLMLLAV LWLSAEV FST AGSTAAQVSD GMSFGTAVEL SAVMPLSWLP 

201 LAADYTRHAR RPFAATLTAT LAYTLTGCWM YALGLAAALF TGETDVAKIL 

251 LGAGLGAAGI LAWL STVTT TFLDAYSAGV SANNISAKLS E IPIAVAVAV 

301 VGTLLAVLLP VTEYEN FLLL IGSVFAPMAX GrD CRLFRLE TA* 



125/al25 95.6% identity in 342 aa overlap 

10 20 30 40 50 60 

ml2 5, pep MSGNASSPSSSSAIGLIWFGAAVSIAEISTGTLLAPLGWQRGLAALLLGHAVGGALFFAA 
I I if I I i ( i I I : { I I I I I ! 1 ! I I I I I I i I I I i i I It I I I I I I ( I I I I I I I I II ! I II I I i 
al25 MSGNASSPSSSAAIGLIWFGAAVSIAEISTGTLLAPLGWQRGLAALLLGHAVGGALFFAA 
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m!25 .pep AYIGALTGRSSMESVRLSFGKRGSVLFSVANMLQLAGWTAVMIYAGATVSSALGKVLWDG 
I I i M I I ! I I I I I I I I I ( i I I I I I 1 I i I i I ! 1 I I i M I I 1 I i I I I II 1 I ! I I i I M { i 1 I 
al25 AYIGALTGRS3MESVRLSFGKRGSVLFSVANML0LAGWTAVMIYAGATV5SALGKVLWDG 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml25.pep ESFWWALANGALIVLWLVFGARKTGGLKTVSMLLMLLAVLWLSAEVFSTAGSTAAQVSD 
t i I M I II 1 i I I I M ! 1 i 11 II I It ! I M I I I I I I I t I I 11 t S II I I I I It 1 II I t I I ! I 
a 1 2 5 ES FWWALANGALIVLWLVFGARKTGGLKTV SMLLMLLAVLWLSAE VFSTAGSTAAQVS D 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml25 . pep GMSFGTAVEL5AVMPLSWLPLAADYTRHARRPFAAT LTATLAYTLTGCWMYALGLAAALF 

I I I I I I I I I I I M I I I I I ! I i I I I I I I 1 I I I I I I I I M ) I I I I I I I M 1 I I I I I I I I M I 
al2 5 GMSFGTAVELSAVMPLSWLPLAADYTRHARRPFAATLTATLAYTLTGCWMYALGLAAALF 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml25.pep TG E T DVAK I L LGAX LG AAG I LAVVLSTVTTT FL DAY SAGAS ANN I S AR FAE T PV A VX V T L 

I I I I I I I I I I I M I I I I ! I I I I I I I I I I I I i I I I I f I I : I I I I I I I : : : I i : I I I : : 
al25 TGETDVAK1LLGAGLGAAGILAVVLSTVTTTFLDAYSAGVSANNISAKLSEIPIAVAVAV 

250 260 270 280 290 300 

310 320 330 340 

ml25 . pep IGTVLAVMLPVTEYENFLLLIGSVFAFMAGGFDCRLFRLETAX 
: 1 I : I i I : I ! I I I I I I I M I I t I f I I I I ! ! I I i ! I I ! ! ! i M 
al2 5 VGTLLAVLLPVTEYENFLLLIGSVFAPMAXGFDCRLFRLETAX 
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101 DWIKLELIGD DDTLQPDVFQ LVEAAEILIK DGFJCVLPYCT EDLIACRRLL 

151 DAGCQALMPW AAPIGTGLGA VHAYALNVLR ERLPDTPLI I DAGLGLPSQA 

2 01 AQVMEWGFDG VLLNTAVSRS GDPVNMARAF ALAVESGRLA FEAGPVEARD 

251 KAQASTPTVG QPFWHSAEY* 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N, gonorrhoeae 

ORF 126 shows 95.9% identity over a 269 aa overlap with a predicted ORF (ORF 126.ng) 
from N. gonorrhoeae: 

ml26/gl26 

10 20 30 40 

HYTKEPIMLTLYGETFPSRLLLGTAAYPTPEILKQSIQTAQ 

-III I t I I I I ! I i t I I ! I I I I I I I I i I i i i i I I I : : | I : 
MPSETPKARRRLSDGI ASDNHTKESIMLTLYGETFPSRLLLGTAAYPTPEILKOSVRTAR 
10 20 30 40 50 60 



m!26 -pep 
g!26 



50 60 70 80 90 100 

ml26 . pep PAMITVSLRRAGSGGEAHGQGFWSLLQETGVPVLPNTAGCQSVQEAVTTAQMAREVFETD 

liMllillh! llllllllllMIIIMIIiillMllilMlllllillNIIIlM 

g 1 2 6 P AMI TVSLRRTGCGGEAHGQGFWSLLQETGVP VLPNTAGCQSVQE AVTTAQMAREVFETD 

70 80 90 100 110 120 



110 120 130 140 150 160 

ml26 . pep WIKLELIGDDDTLQPDVFQLVEAAEILIKDGFKVLPYCTEDL1ACRRLLDAGCQALMPWA 

llllllllllllllMIMIIIlllMIIIIIIIIIMIIillllllllllllillllM 

g 1 2 6 WI KLELI GDDDTLQPDVFQLVEAAEI LI KDGFKVLPYCTEDLI ACRRLLDAGCQALMPWA 

130 140 150 160 170 180 



170 180 190 200 210 220 

ml26 .pep APIGTGLGAVHAYALNVLRERLPDTPLIIDAGLGLPSQAAQVMEWGFDGVLLNTAVSRSG 

MIIHIIlllllll-lilllillllllllllllllMlllilMIIMIiliilllll 

g 1 2 6 AP IGTGLGAVHAYALKI LRERLPDTPL 1 1 DAGLGLPSQAAQVMEWGFDGVLLNTAVSRSG 

190 200 210 220 230 240 



230 240 250 260 270 

ml 2 6 . pep DPVNMARAFALAVESGRLAFEAGPVEARDKAQASTPTVGQPFWHSAEYX 

IIIIIIIIIIIIMIIIIIIIIIMIII lllllllllllllliIMM 

gl 2 6 DPVNMARAFALAVESGRLAFEAGPVEARTKAQASTPTVGQPFWHSAEYX 
250 260 270 280 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 467>; 

al26 . seq 

1 TTGTTAATCC ACTATACAAA GGAACCCATT ATGCTCACCC TGTACAGCGA 

51 AACTTTCCCT TCGCGGCTGC TGCTCGGCAC AGCCGCCTAC CCGACCCCTG 

101 AAATCCTCAA ACAATCCGTC CGAACCGCCC GGCCCGCGAT GATTACCGTC 

151 TCGCTGCGCC GCGCGGGATG CGGCGGCGAG GCGCACGGTC AGGGGTTTTG 

201 GTCGCTGCTT CAAGAAACCG GCGTTCCCGT CCTGCCGAAC ACGGCAGGCT 

251 GCCAAAGCGT GCAGGAAGCG GTAACGACGG CGCAAATGGC GCGCGAAGTG 

301 TTTGAAACCG ATTGGATTAA ACTCGAACTC ATCGGCGACG ACGACACCTT 

351 GCAGCCGGAT GTGTTCCAAC TTGTCGAAGC GGCGGAAATC CTGATTAAAG 

401 ACGGCTTCAA AGTGCTGCCT TATTGCACCG AAGACCTGAT TGCCTGCCGC 

4 51 CGCCTGCTCG ACGCGGGCTG TCAGGCGTTG ATGCCGTGGG CGGCCCCGAT 

501 ' CGGCACGGGT TTGGGCGCGG TTCACGCCTA CGCGTTGAAC GTCCTGCGCG 

551 AACGCCTGCC CGACACGCCG C™ 3ATTATCG ACGCGGGCTT GGGTTTGCCC 

601 TCACAGGCGG CACAAGTGAT GGAATGGGGC TTTGACGGCG TGCTTTTGAA 

651 TACTGCCGTT TCCCGCAGCG GCGATCCGGT CAATATGGCA CGCGCCTTCG 

701 CACTCGCCGT CGAATCCGGA CGGCTGGCAT TTGAAGCCGG ACCGGTCGAA 

7 51 GCACGCGACA AAGCGCAAGC CAGCACGCCG ACAGTCGGAC AACCGTTTTG 

801 GCATTCGGCG GAATATTGA 
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51 AHGQGFWSLL QETGVPVLPN TAGCQSVQEA VTTAQMAFEV FETDWIKLEL 

101 IGDODTLQPD VFQLVEAAEI LIKDGFKVLP YCTEDLIACR RLLDAGCQAi 

151 MPWAAPIGTG LGAVHAYALK ILRERLPDTP LIIDAGLGLP SOAAQVMEWG 

201 F DGVLLNTAV SRSGDPVNMA RAF ALAVESG RLAFEAGPVE ARTKAQASTP 

251 TVGQPFWHSA EY* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 47 1>: 

ml 2 6 - 1 . seq 

1 ATGCTCACCC TATACGGCGA AACTTTCCCC TCGCGGCTGC TGCTCGGCAC 

SI GGCTGCCTAC CCGACCCCCG AAATCCTCAA ACAATCCATC CAAACCGCCC 

101 AGCCTGCGAT GATTACCGTC TCGCTGCGCC GCGCGGGAAG CGGCGGCGAG 

151 GCGCACGGTC AGGGGTTTTG GTCGCTGCTT CAAGAAACCG GCGTTCCCGT 

201 CCTGCCGAAC ACGGCAGGCT GCCAAAGCGT GCAGGAAGCG GTAACGACGG 

251 CGCAAATGGC GCGCGAAGTG TTTGAAACCG ATTGGATAAA ATTGGAACTC 

301 ATCGGAGATG ACGACACCTT GCAGCCGGAT GTGTTCCAGC TTGTCGAAGC 

351 GGCGGAAATC CTGATTAAAG ACGGCTTCAA AGTGCTGCCT TATTGCACCG 

401 AAGACCTGAT TGCCTGCCGC CGCCTGCTCG ACGCGGGCTG TCAGGCGTTG 

4 51 ATGCCGTGGG CGGCCCCGAT CGGCACGGGT TTGGGCGCGG TTCACGCCTA 

501 CGCGTTGAAC GTCCTGCGCG AACGCCTGCC CGACACGCCG CTGATTATCG 

551 ACGCGGGCTT GGGTTTGCCC TCACAGGCGG CACAAGTGAT GGAATGGGGC 

601 TTTGACGGCG TGCTTTTGAA TACTGCCGTT TCCCGCAGCG GCGATCCGGT 

651 CAATATGGCA CGCGCCTTCG CACTCGCCGT CGAATCCGGA CGGCTGGCAT 

701 TTGAAGCCGG ACCGGTCGAA GCACGCGACA AAGCGCAAGC CAGCACGCCG 

7 51 ACAGTCGGAC AACCGTTTTG GCATTCGGCG GAATATTGA 



This corresponds to the amino acid sequence <SEQ ID 5; ORF 126-1>: 

ml26-l .pep 

1 MLTLYGETFP SRLLLGTAAY PTPEILKQSI QTAQPAMITV SLRRAGSGGE 

51 AHGQGFWSLL QETGVPVLPN TAGCQSVQEA VTTAQMAREV FETDWIKLEL 

101 IGDDDTLQPD VFQLVEAAEI LIKDGFKVLP YCTEDLIACR RLLDAGCQAL 

151 MPWAAPIGTG LGAVHAYALN VLRERLFDTP LIIDAGLGLP SQAAQVMEWG 

201 F DGVLLNTAV SRSGDPVNMA RAFALAVESG RLAFEAGPVE ARDKAGASTP 

251 TVGQPFWHSA EY* 



ml2€-l/gl26-l 96.9% identity in 262 aa overlap 

10 20 30 40 50 60 

ml 2 6-1 .pep MLTLYGETFPSRLLLGTAAYPTPE1LKQSIQTAQPAMITVSLRRAGSGGEAHGOGFWSLL 
I I! t I i I I I I I I I I I ! I I I I I I I I I I I I I : : I I : I f I I [ I i I I i : I ! I II (I I I I t ! I t 
gl2 6-l MLTLYGETFPSRLLLGTAAYPTPEILKQSVRTARPAMITVSLRRTGCGGEAHGQGFWSLL 

10 20 30 40 50 60 



70 80 90 100 110 120 

ml 2 6-1 .pep QETGVPVLPNTAGCQSVQEAVTTAQMAREVFETDWIKLELIGDDDTLQPDVFQLVEAAEI 
I I I I I I I i i I I I I I I I I I i I I I I I I I I I M If I I i 1 I I i I I I I ! I I I I i I I I I I II I I I I 
gl2 6-l QETGVPVLPNTAGCQSVQEAVTTAQMAREVFETDWIKLELIGDDDTLQPDVFQLVEAAEI 
70 80 90 100 110 120 



130 140 150 160 170 180 

ml 2 6-1 .pep LIKDGFKVLPYCTEDLIACRRLLDAGCQALMPWAAPIGTGLGAVHAYALNVLRERLPDTP 
I I I I I I t I I [ I I I I M I I II I t M I III I I I I I I I I I I I I I I I I I I I I I : : ( i I I I I M 1 
g 1 2 6- 1 LIKDG FKVLP YCTEDLI ACRRLLDAGCQALMPWAAP IGTGLGAVKAYALKI LRERLPDTP 

130 140 150 160 170 180 



190 200 210 220 230 240 

ml 2 6- 1 . pep LI I DAGLGLPSQAAQVMEWGFDGVLLNTAVSRSGDPVNMARAFALAVESGRLAFEAGPVE 
I I i I I i i I M I I II I I I ! I I I 1 I I II I I I I I I I I I I i I i I I I I I I I I i I I i I i I I M I I I 
gl 2 6 - 1 LI I DAGLGLPSQAAQVMEVJGFDGVLLNTAVSRSGDPVNMARAFALAVESGRLAFEAGPVE 

190 200 210 220 230 240 



250 260 
ml 2 6-1. pep ARDKAQASTPTVGQPFWHSAEYX 

ii n 1 1 1 1 ii 1 1 1 1 1 1 m 1 1 1 1 

g 126-1 ARTKAQAST PTVGQP FWHS AEYX 

250 260 



The following partial DNA sequence was identified in /V. meningitidis <SEQ ID 473>: 

al26-l.seq 

1 ATGCTCACCC TGTACAGCGA AACTTTCCCT TCGCGGCTGC TGCTCGGCAC 

51 AGCCGCCTAC CCGACCCCTG AAATCCTCAA ACAATCCGTC CGAACCGCCC 

101 GGCCCGCGAT GATTACCGTC TCGCTGCGCC GCGCGGGATG CGGCGGCGAG 
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E51 CGGTCGAAAT CCCCGTTCCC ATCCATTTGG ATTCGGATGA AGCCGTATGC 

6 01 CGTCTGAAAG CCGTACTCGA GCCCTTGTGC GCGCCCTACA TCCCCGCCAT 

651 TCAGCGGTAT TTGGAAAACG TGCAGGCGGA AAAACTGTTT ATCACGCCCG 

701 CCGCCAGGCC GCGCGTTACC CGCGTACCGT ACGACGACAA GGCATACCGC 

751 ATCATCGTCC GCTTCGCCTC CCCCGTTTCA AAGCGGCTGG AAATCCAACA 

801 GGCGGTTATG GACGAATTTT TGCGCGTACA ATACCGCCTG TTAAATCATC 

851 CCGCCGgczc cgAAACACTT TAA 



This corresponds to the amino acid sequence <SEQ ID 476; ORF 127.ng>: 

g!27 .pep 



1 MEIWNMLNTW PDAVPIRAEA AESVAAVAAL LLARALLLNI HFRRHPDFGI 

51 ESKRRFLVAS RNITLLLVLF SLAFIWSA QI Q TLALSMFAV AAAWVAT KE 

101 LIMCLSGSIL RSATQQYSVG DYIEINGLRG RWDINLLNT LMMQVGPNPL 

151 VGQLAGTTVS FPNSLLLSHP VRRDNILGDY VIHTVEIPVP IKLDSDEAVC 

201 RLKAVLEPLC APYIPAIQRY LENVQAEKLF ITPAARPRVT RVPYDDKAYR 

251 IIVRFASPVS KRLEIQQAVM DEFLRVQYRL LNHPAGSETL * 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 477>: 



1 ATGGAAATAT GGAATATGTT GGACACTTGG CTCGGTGCCG TCCCGATACG 

51 TGCGGAGGCG GTCGAATCCG TGGCGGCGGT TGCGGCTTTG CTGCTGGCGC 

101 GCGCCCTTCT GTTGAATATC CACTTCAAAC GGCATCCGGA TTTCGGCATC 

151 GAAAGCAAGC GGCGGTTTTT GGTTGCCAGC CGCAATATAA CGCTGCTTTT 

201 GGTGCTGTTT TCGCTGGCAT TTATCTGGTC GGCGCAAATC CAAACGCTGG 

251 CTTTGTCGAT GTTTGCGGTG GCGGCGGCGG TCGTCGTGGC GACGAAGGAA 

3 01 CTGATTATGT GTCTGTCGGG CAGTATTTTA AGGTCTGCCA CCCAGCAATA 
351 CTCGGTCGGC GACTATATCG AAATCAACGG CCTGCGCGGG CGCGTGGTCG 

4 01 ACATCAACCT GTTGAACACG CTGATGATGC AGGTCGGTCC GAACCCCTTG 
451 GTCGGACAGC TTGCGGGAAC CACCGTTTCT TTCCCCAACA GCCTGTTGTT 
501 GAGCCACCCC GTGCGCCGCG ACAATATTTT GGGCGACTAT GTCATCCATA 
551 CGGTCGAAAT CCCCGTTCCC ATCCATTTGG ATTCGGATGA AGCCGTATGC 
601 CGTCTGAAAG CCGTACTCGA GCCCTTGTGC GCGCCCTACA TCCCCGCCAT 
651 CCAACGGsAT TTGGAAAACG TGCAGGCGGA AAAACTGTTT ATCACGCCCG 
701 CCGCCAGACC GCGCGTTACC CGCGTGCCGT ACGATGACAA GGCATACCGC 
751 ATCATCGTCC GCTTCGCTTC CCCCGTTTCA AAGCGGCTGG AAATCCAACA 
801 GGCGGTTATG GACGAATTTT TGCGCGTACA ATACCGCCTG TTAAATCACC 
851 CCGCCGGCTC CGAAACACTT TAA 



This corresponds to the amino acid sequence <SEQ ID 478; ORF 127>: 

ml 2 7 -pep 

1 MEIWNMLDTW LGAVPIRAEA VE5VAAVAAL LLARALLLNI HFKRHPDFGI 



51 ESKRRFLVAS R NITLLLVLF SLAFIWSA QI Q TLALSMFAV AAAWVAT KE 

101 LIMCLSGSIL RSATQQYSVG DYIEINGLRG RWDINLLNT LMMQVGPNPL 

151 VGQLAGTTVS FPNSLLLSHP VRRDNILGDY VIHTVEIPVP IHLDSDEAVC 

201 RLKAVLEPLC APYIPAIQRX LENVQAEKLF ITPAARPRVT RVPYDDKAYR 

251 IIVRFASPVS KRLEIQQAVM DEFLRVQYRL LNHPAGSETL * 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 127 shows 97.9% identity over a 290 aa overlap with a predicted ORF (ORF 127.ng) 
from N. gonorrhoeae: 

ml27/gl27 



m!27 . seq 



m!27.pep 



10 20 30 40 50 60 

ME I WNMLDTWLGAVP I PJIEAVESVAAVAALLLARALLLNIHFKRHPDFG IESKRRFLVAS 



gl27 




10 20 30 40 50 60 



ml 2 7 .pep 



70 80 90 100 110 120 

RNITLLLVLFSLAFIWSAQIQTIiALSKFAVAAAVWATKELIMCLSGSILRSATQQYSVG 



g!27 



RNITLLLVLFSLAFIWSAQIQTLALSMFAVAAAVWATKELIMCLSGSILRSATQQYSVG 
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130 140 150 160 170 ISO 

ml27 . pep DYISINGLRGRVVDINLLNTLMMQVGPNPLVGQLAGTTVSFPNSLLLSHPVRRDNILGDY 
I I I ! I i I I I I I I ! I I I ! 1 I I ! I M I i I i I I I I ! I II I I I i I I t I I I M I | I | i i | j M ( | 
al27 DY 1 E INGLRGR VV DI N LLNT LMMQVG PN PLVGQLAGTTV S FPN S LLLSHPVRP, DN I LG D Y 

130 140 150 160 170 ISO 

190 200 210 220 230 240 

ml27 .pep VIHTVEIPVPIHLDSDEAVCRLKAVLEPLCAPYIPAIQRXLENVQAEKLFITPAARPRVT 
1 I I ) ! i I I I I I ! M II I I I 1 1 I I I i I i 1 I ! II I ! t t 1 II I II I i 1 M I M I I I I : i I I I 
al27 VIHTVEIPVPIHLDSDEAVCRLKAVLEPLCAPYIPAIQRHLENVQAEKLFITPAAKPRVT 

190 200 210 220 230 , 240 

250 260 270 280 290 

m!27 . pep RVPYDDKAYRIIVRFASPVSKRLEIQQAVMDEFLRVQYRLLNHPAGSETLX 
i I I I I I I I I I i I I I I ! I I I I I I I ! (I M I I I I I I I I M I I j i : I ! I I I H I 
a 12 7 RVPYDDKAYRI IVRFASPVSKRLEIQOAVMDEFLRVQYRLLNYPAGSETLX 

250 2C0 270 280 290 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 48 1>: 

gl28 . seq 

1 atgattgaca acgCActgct ccacttgggc gaagaaccCC GTTTTaatca 

51 aatccaaacc gaagACAtca AACCCGCCGT CCAAACCGCC ATCGCCGAAG 

101 CGCGCGGACA AATCGCCGCC GTCAAAGCGC AAACGCACAC CGGCTGGGCG 

151 AACACCGTCG AGCGTCTGAC CGGCATCACC GAACGCGTCG GCAGGATTTG 

201 GGGCGTCGTG TCCCATCTCA ACTCCGTCGT CGACACGCCC GAACTGCGCG 

251 CCGTCTATAA CGAACTGATG CCTGAAATCA CCGTCTTCTT CACCGAAATC 

3 01 GGACAAGACA TCGAACTGTA CAACCGCTTC AAAACCATCA AAAATTCCCC 

3 51 CGAATTTGCA ACGCTTTCCC CCGCACAAAA AACCAAGCTC GATCACGACC 

4 01 TGCGCGATTT CGTATTGAGC GOCGCGGAAC TGCCGCCCGA ACGGCAGGCA 
451 GAACTGGCAA AACTGCAAAC CGAAGGCGCG CAACTTTCCG CCAAATTCTC 
501 CCAAAACGTC CTAGACGCGA CCGACGCGTT CGGCATTTAC TTTGACGATG 
551 CCGCACCGCT TGCCGGCATT CCCGAAGACG CGCTCGCCAT GTTTGCCGCC 
601 GCCGCGCAAA GCGAAGGCAA AACAGGTTAC AAAATCGGCT TGCAGATTCC 
6 51 GCACTACCTT GCCGTTATCC AATACGCCGG CAACCGCGAA CTGCGCGAAC 
701 AAATCTACCG CGCCTACGTT ACCCGTGCCA GCGAACTTTC AAACGACGGC 
751 AAATTCGACA ACACCGCCAA CATCGACCGC ACGCTCGAAA ACGCATTGAA 
801 AACCGccaaa cTGCTCGGCT TTAAAAATTA CGCCGAATTG TCGCTGGCAA 
851 CCAAAATGGC GGACACGCCC GAACAGGTTT TAAACTTCCT GCACGACCTC 
901 GCCCGCCGCG CCAAACCCTA CGCCGAAAAA GACCTCGCCG AAGTCAAAGC 
951 CTTCGCCCGC GAACACCTCG GTCTCGCCGA CCCGCAGCCG TGGGACTTGA 

1001 GCTACGCCGG CGAAAAACTG CGCGAAGCCA AATACGCATT CAGCGAAACC 

1051 GAAGTCAAAA AATACTTCCC CGTCGGCAAA GTTCTGGCAG GCCTGTTCGC 

1101 CCAAATCAAA AAACTCTACG GCATCGGATT CGCCGAAAAA ACCGTTCCCG 

1151 TCTGGCACAA AGACGTGCGC TATTTTGAAT TGCAACAAAA CGGCAAAACC 

12 01 ATCGGCGGCG TTTATATGGA TTTGTACGCA CGCGAAGGCA AACGCGGCGG 

12 51 CGCGTGGATG AACGACtaca AAGGCCGCCG CCGCTTTGCC GACGgcacGC 

13 01 TGCAACTGCC CACCGCCTAC CTCGTCTGCA ACTTCGCCCC GCCCGTCGGC 
1351 GGCAAAGAAG CGCGTTTAAG CCACGACGAA ATCCTCACCC TCTTCCACGA 

14 01 AacCGGCCAC GGACTGCACC ACCTGCTTAC CCAAGTGGAC GAACTGGGCG 

14 51 TGTCCGGCAT CAAcggcgtA GAATGGGACG CGGTCGAACT GCCCAGCCAG 
1501 TTTATGGAAA ACTTCGTTTG GGAATACAAT GTATTGGCAC AAATGTCCGC 

15 51 CCACGAAGAA AccgGCGAGC CCCTGCCGAA AGAACTCTTC GACAAAATGC 

16 01 TcgcCGCCAA AAACTTCCAG CGCGGTATGT TCCTCGTCCG GCAAATGGAG 
16 51 TTCGCCCTCT TCGATATGAT GATTTACAGT GAAAGCGACG AATGCCGTCT 
1701 GAAAAACTGG CAGCAGGTTT TAGACAGCGT GCGCAAAGAA GTcGCCGTCA 
1751 TCCAACCGCC CGAATACAAC CGCTTCGCCA ACAGCTTCGG CCacatctTC 
1801 GCcggcGGCT A7TCCGCAGG CTATTACAGC TACGCATGGG CCGAAGTCCt 
1851 cAGCACCGAT GCCTACGCCG CCTTTGAAGA AAGcGACGac gtcGCCGCCA 
1901 CAGGCAAACG CTTCTGGCAA GAAAtccttg ccgtcggcgg ctCCCGCAGC 
1951 gcgGCGGAAT CCTTCAAAGC CTTCCGCGGA CGCGAACCGA GCATAGACGC 
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301 GKRFWQ/EILA VGXSRSGAES FKAFRGREPS IDALLRHSGF DNAV* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 128 shows 91.7% identity over a 475 aa overlap with a predicted ORF (ORF 128.ng) 
from /V. gonorrhoeae: 

mI28/g!28 

10 20 30 40 50 60 

g 1 2 8 . pep MI DNALLHLGE E PR FNQ I QTEDI KPAVQTA I AEARGQ I AAV KAQTHTGWANTVERLTG I T 

t ii MMiniiihii •■in iiimi 1 1 mi minimum m urn 

m 1 2 8 MTDN ALLHLGEE PRFDQ 1 KTED I KP ALQTAI AEAREQ I AAI KAOTHTGW ANTVE P LTG I T 

10 20 30 40 50 60 



70 80 90 100 110 120 

gl28 .pep ERVGRIWGWSHLNSWDTPELRAVYNELMPEITVFFTEIGQDIELYNRFKTIKNSPEFA 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 in i mi mini i! in mmi i m iii I! i iii mi 

ml 2 8 ERVGR I WGWSHLNCVADTPELRAVYNELMPE I TVFFTEIGQD I ELYNRFKT I KNSPEFD 

70 80 90 100 110 120 



130 140 150 160 170 180 

gl2 8 . pep TXiSPAOKTKLDHDLRDFVLSGAELPPERQAELAKLQTEGAQLSAKFSQNVLDATDAFGIY 

niimiihi 

ml 2 8 TLSPAQKTKLNH 
130 

// 

340 350 360 

g 1 2 8 . pep Y AGE KLR EAK YA FS E T E VKK Y FP VG KVLAG 

n mi mm m i mum n i 

ml 2 8 Y ASEKLREAKYAFSETXVKKYFPVGXVLNG 

10 20 30 



370 380 390 400 410 420 

gl2 8 . pep L FAQ I KKL Y G I G F AE KTV P VWHKDVR Y F E LOQN G KT I GGV YMDL Y ARE G KRGGAWMND Y K 

mi m it 1 1 1 1 : ii 1 1 1 1 1 1 1 1 N i immnimmimiiimimii 

ml2 8 LFAQXKKLYGIGFTEKTVPVWHKDVRYXELQQNGEXIGGVYMDLYAREGKRGGAWMNDYK 
40 50 60 70 80 90 



430 440 450 460 470 480 

gl2 8 . pep GRRRF ADGTLQLP TAYLVCNF AP PVGGKE ARLSKDE I LTLFHETGHGLKHLLTQVDELGV 

mmmmimmmimhimiimi nmmmmimm 

ml 2 8 GRRRFSDGTLQLPTAYLVCNFAPPVGGREARLSHDE I LI LFHETGHGLHHLLTQVDELGV 

100 110 120 130 140 150 



490 500 510 520 530 540 

gl2 8 . pep SGINGVEWDAVELPSQFMENFWEY^LAQMSAHEETGEPLPKELFDKMLAAKNFQRGMF 

mm mmummmmm miiii mm n mmi m 

ml 2 8 SGINGVXVJDAVELPSOFMENF^EYNVl^OXSAHEETGVPLPKEI^DKXLAAKNFQXGMF 
160 170 180 190 200 210 



550 560 570 580 590 600 

g!28 .pep LVRQMEFALFDMMIYSESDECRLKNWOQVLDSVRKEVAVIQPPEYNRFANSFGHIFAGGY 

iii mmmiimi imimmnmimmimi immiii 

ml 2 8 XVRQXEFALFDMMI YSEDDEGRLKNWQOVLDSVRKKVAVIQPPEYNRFALSFGHI FAGGY 

220 230 240 250 260 270 



610 620 630 640 650 660 

gl28 - pep SAGYYSYAWAEVLSTDAYAAFEESDDVAATGKRFWQEILAVGGSRSAAES FKAFRGREPS 

m mmmmimmmimmmmm mnminmm 

ml 2 8 S AAX YS YAWAEVLSADAYAAFEESDDVAATGKRFWQEILAVGXSRSGAES FKAFRGREPS 

280 290 300 310 320 330 
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551 FALFDMMIYS EDDEGRLKNW QQVLDSVRKE VAVVRPPEYN RFANSFGHI F 
601 AGGYSAGYYS YAWAEVLSAD AYAAFEESDD VAATGKRFWQ EILAVGGSRS 
651 AAESFKAFRG REPSIDALLR HSGFDNAA* 

128/al28 66.0% identity in 677 aa overlap 

10 20 30 40 50 60 

m!2 3.pep MT DHALLH LGEE PR FDQI KTED I K PALQTA I AEAREQ I AAI KAQTHTGWANTVE PLTG I T 

I I ! I I I [ i I I II I ! I It I I I i ! I I I I I I I II I I I I f I I I I I I I I I 1 I M ! I I I I I I ! I ( I 
al2S MTDNALLHLGEEPRFDQ I KTEDIKPALQTAI AEAREQ I AAI KAQTHTGWANTVE PLTG IT 

10 20 30 40 50 60 

70 80 90 100 110 120 

inl28.pep ERVGRIWGVVSHLNCVADTPELRAVYNELMPEITVFFTEIGQDIELYNRFKTIKNSPEFD 
li II 1 I I I I 1 I 1 I i I : I M M I i : I I I II I 1 M 11 M II i M ! 1 I I ! I I I I I ! i I M I 1 
a 128 ERVGRIWGVVSHLNSVTDTPELRAAYNELMPEITVFFTEIGQDIELYNRFKTIKNSPEFD 

70 80 90 100 110 120 

130 

ml 2 8 . pep TLSPAQKTKLNH 

I I I I I I I I I I I 

a 128 TLSHAQKTKLNHDLRDFVLSGAELPPEQQAELAKLQTEGAQLSAKFSQNVLDATDAFGIY 
130 140 150 160 170 180 



ml28.pep 

al28 FDDAAPLAGIPEDALAMFAAAAQSEGKTGYKIGLQ1PHYLAVIQYADNRKLREQIYRAYV 
190 200 210 220 230 240 



ml28.pep 

al2 8 TRASELSDDGKFDNTANIDRTLENALQTAKLLGFKNYAELSLATKMADTPEQVLNFLHDL 

250 260 270 280 290 300 

140 150 

ml28 .pep YASEKLREAKYAFSETXVKKYFPVGX 

i I : I M 1 I I I I 1 ! I ! I I ! I ( i ! I I 
al28 ARRAKPYAEKDLAEVKAFARESLGLADLQPWDLGYAGEKLREAKYAFSETEVKKYFPVGK 
310 320 330 340 350 360 

160 170 180 190 200 210 

ml 2 8 . pep VLNG LFAQXKKLYG I G FTEKTV PVW HKDVRYXE LQQNGEX I GGV YMDL Y AREGKRGGAWM 

I I I f I I 1 I M I II I 11 1 I I I I I I I I M I I I i i i I 1 1 i : M 1 I I II 1 ) I I 1 M 1 I I I 11 
al28 VLNGLFAQIKKLYGIGFTEKTVPVWHKDVRYFELQQNGETIGGVYMDLYAREGKRGGAWM 
370 380 390 400 410 420 

220 230 240 250 260 270 

ml28 . pep NDYKGRRRFSDGTLQLPTAYLVCNFAPPVGGREARLSHDEILILFHETGHGLHHLLTQVD 
I I I I I I I 1 I M I I I I I I I I I I I I I 1 : I I I I I : I I I I! I I I I I I II I I I I I I I I I M I I I 
al28 NDYKGRRRFSDGTLQLPTAYLVCNFTPPVGGKEARLSHDEILTLFHETGHGLHHLLTQVD 
430 440 450 460 470 480 

280 290 300 310 320 330 

ml 2 8 .pep ELGVSGINGVXWDAVELPSQFMENFVWEYNVLAQXSAHEETGVPLPKELXDKXLAAKNFQ 
I I I I II I I I I i ! M t I I I i I I I I I I I 11 1 1 I I I I I I I M it I I II II II M 5 I 11 I 
a 12 8 ELGVSGINGVEWDAVELPSQFMENFVWEYNVLAQMSAHEETGVPLPKELFDKMLAAKNFQ 
490 500 510 520 530 540 

340 350 360 370 380 390 

ml 28 .pep XGMFXVRQXEFALFDMMIYSEDDEGRLKNWQQVLDSVRKKVAVIQPPEYNRFALSFGHIF 
ill IN I I I I I I I I I I 1 I I 1 I I I I I I I I ! I I j I I I I : I I I I I I I 1 I I I I ; I II I 
a 128 RGMFLVRQMEFALFDMMIYSEDDEGRLKNWQQVLDSVRKEVAVVRPPEYNRFANSFGHIF 

550 560 570 5S0 590 600 
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201 


GGGCGTGGTG 


TCGCACCTCA 


251 


CCGTCTATAA 


CGAACTGATG 


301 


GGACAAGACA 


TCGAGCTGTA 


351 


CGAATTCGAC 


ACCCTCTCec 


401 


TGCGCGATTT 


CGTCCTCAGC 


451 


GAACTGGCAA 


AACTGCAAAC 


501 


CCAAAACGTC 


CTAGACGCGA 


551 


CCGCACCGCT 


TGCCGGCATT 


601 


GCCGCGCAAA 


GCGAAAGCAA 


651 


ACACTACCTC 


GCCGTCATCC 


701 


AAATCTACCG 


CGCCTACGTT 


751 


AAATTCGACA 


ACACCGCCAA 


801 


AACCGCCAAA 


CTGCTCGGCT 


651 


CCAAAATGGC 


GGACACGCCC 


901 


GCCCGCCGCG 


CCAAACCCTA 


951 


CTTCGCCCGC 


GAAAGCCTGA 


1001 


GCTACGCCAG 


CGAAAAACTG 


1051 


GAAGTCAAAA 


AATACTTCCC 


1101 


CCAAATCAAA 


AAACTCTACG 


1151 


TCTGGCACAA 


AGACGTGCGC 


1201 


ATAGGCGGCG 


TTTATATGGA 


1251 


CGCGTGGATG 


AACGACTACA 


1301 


TGCAACTGCC 


CACCGCCTAC 


1351 


GGCAGGGAAG 


CCCGCCTGAG 


1401 


AACCGGACAC 


GGGCTGCACC 


1451 


TATCCGGCAT 


CAACGGCGTA 


1501 


TTTATGGAAA 


ATTTCGTTTG 


1551 


CCACGAAGAA 


ACCGGCGTTC 


1601 


TCGCCGCCAA 


AAACTTCCAA 


1651 


TTCGCCCTCT 


TTGATATGAT 


1701 


GAAAAACTGG 


CAACAGGTTT 


1751 


TCCAGCCGCC 


CGAATACAAC 


1801 


GCAGGCGGCT 


ATTCCGCAGG 


1851 


GAGCGCGGAC 


GCATACGCCG 


1901 


CAGGCAAACG 


CTTTTGGCAG 


1951 


GCGGCAGAAT 


CCTTCAAAGC 


2001 


ACTCTTGCGC 


CACAGCGGTT 



ACTCCGTCGC CGACACGCCC GAACTGCGCG 
CCCGAAATCA CCGTCTTCTT CACCGAAATC 
CAACCGCTTC AAAACCATCA AAAATTCCCC 
CCGCACAAAA AACCAAACTC AACCACGATC 
GGCGCGGAAC TGCCGCCCGA ACAGCAGGCA 
CGAAGGCGCG CAACTTTCCG CCAAATTCTC 
CCGACGCGTT CGGCATTTAC TTTGACGATG 
CCCGAAGACG CGCTCGCCAT GTTTGCCGCC 
AACAGGCTAC AAAAT CGGCT TGCAGATTCC 
AATACGCCGA CAACCGCGAA CTGCGCGAAC 
ACCCGCGCCA GCGAACTTTC AGACGACGGC 
CATCGACCGC ACGCTCGCAA ACGCCCTGCA 
TCAAAAACTA CGCCGAATTG TCGCTGGCAA 
GAACAAGTTT TAAACTTCCT GCACGACCTC 
CGCCGAAAAA GACCTCGCCG AAGTCAAAGC 
ACCTCGCCGA TTTGCAACCG TGGGACTTGG 
CGCGAAGCCA AATACGCGTT CAGCGAAACC 
CGTCGGCAAA GTATTAAACG GACTGTTCGC 
GCATCGGATT TACCGAAAAA ACCGTCCCCG 
TATTTTGAAT TGCAACAAAA CGGCGAAACC 
TTTGTACGCA CGCC^AGGCA AACGCGGCGG 
AAGGCCGCCG CCGTTTTTCA GACGGCACGC 
CTCGTCTGCA ACTTCGCCCC ^■'"'"CGTCGGC 
CCACGACGAA ATCCTCATCC TTTCCACGA 
ACCTGCTTAC CCAAGTGGAC GAACTGGGCG 
GAATGGGACG CGGTCGAACT GCCCAGCCAG 
GGAATACAAT GTCTTGGCAC AAATGTCAGC 
CCCTGCCGAA AGAACTCTTC GACAAAATGC 
CGCGGCATGT TCCTCGTCCG GCAAATGGAG 
GATTTACAGC GAAGACGACG AAGGCCGTCT 
TAGACAGCGT GCGCAAAAAA GTCGCCGTCA 
CGCTTCGCCT TGAGCTTCGG CCACATCTTC 
CTATTACAGC TACGCGTGGG CGGAAGTATT 
CCTTTGAAGA AAGCGACGAT GTCGCCGCCA 
GAAATCCTCG CCGTCGGCGG ATCGCGCAGC 
CTTCCGCGGC CGCGAACCGA GCATAGACGC 
TCGACAACGC GGTCTGA 



This corresponds to the amino acid sequence <SEQ ID 490; ORF 128-1>; 

ml28-l.pep. 

1 MTDNALLHLG EEPRFDQIKT EDIKPALQTA IAEAREQIAA IKAQTHTGWA 

51 NTVEPLTGIT ERVGRIWGW SHLNSVADTP ELRAVYNELM PEITVFFTEI 

101 GQDIELYNRF KTIKNSPEFD TLS PAQKTKL NHDLRDFVLS GAELPPEQQA 

151 ELAKLQTEGA QLSAKFSQNV LDATDAFGIY FDDAAPLAGI PEDALAMFAA 

2 01 AAQSESKTGY KIGLQIPHYL AVIQYADNRE LREQIYRAYV TRASELSDDG 

2 51 KFDNTANIDR TLANALQTAK LLGFKNYAEL SLATKMADTP EQVLNFLHDL 
301 ARRAKPYAEK DLAEVKAFAR ESLNIjADLQP WDLGYASEKL REAKYAFSET 

3 51 EVKKYFPVGK VLNGLFAQIK KLYGIGFTEK TVPVWHKDVR YFELQQNGET 

4 01 I GG VYMDLYA REGKRGGAWM NDYKGRRRFS DGTLQLPTAY LVCNFAPPVG 
4 51 GREARLSHDE ILILFHETGH GLHHLLTQVD ELGVSGINGV EWDAVELPSO 
501 FMENFVWEYN VLAQMSAHEE TGVPLPKELF DKMLAAKNFQ RGMFLVRQME 
551 FALFDMMIYS EDDEGRLKNVJ QQVLDSVRKK VAVIQPPEYN RFALSFGHIF 
601 AGGYSAGYYS YAWAEVLSAD AYAAFEESDD VAATGKRFWQ EILAVGGSRS 
6 51 AAESFKAFRG REPSIDALLR HSGFDNAV* 



ml28-l/gl28-l 94.5% identity in 491 aa overlap 



10 20 30 40 50 60 

gl28 - 1 . pep MIDNALLHLGEEPRFNQIKTEDIKPAVOTAIAEARGQIAAVKAQTHTGWANTVERLTGIT 
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ml2B- 1 MTDNALLHLGEEPRFDQIKTEDIKPALQTAIAEAREQIAAIKAQTHTGWANTVEPLTGIT 
10 20 30 40 50 60 
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12 51 CGCGTGGATG AACGACTACA AAGGCCGCCG CCGTTTTTCA GACGGCACGC 

13 01 TGCAACTGCC CACCGCCTAC CTCGTCTGCA ACTTCACCCC GCCCGTCGGC 

13 51 GGCAAAGAAG CCCGCTTGAG CCATGACGAA ATCCTCACCC TCTTCCACGA 

14 01 AACCGGACAC GGCCTGCACC ACCTGCTTAC CCAAGTCGAC GAACTGGGCG 

14 51 TATCCGGCAT CAACGGCGTA GAATGGGACG CAGTCGAACT GCCCAGTCAG 

15 01 TTTATGGAAA ATTTCGTTTG GGAATACAAT GTCTTGG CGC AAA TGTCCGC 

15 51 CCACGAAGAA ACCGGCGTTC CCCTGCCGAA AGAACTCTTC GACAAAATGC 

16 01 TCGCCGCCAA AAACTTCCAA CGCGG AATGT TCCTCGTCCG CCAAATGGAG 
16 51 TTCGCCCTCT TTGATATGAT GATTTACAGC GAAGACGACG AAGGCCGTCT 
1701 GAAAAACTGG CAACAGGTTT TAGACAGCGT GCGCAAAGAA GTCGCCGTCG 
1751 TCCGACCGCC CGAATACAAC CGCTTCGCCA ACAGCTTCGG CCACATCTTC 
1801 GCAGGCGGCT ATTCCGCAGG CTATTACAGC TACGCGTGGG CGGAAGTATT 
18 51 GAGCGCGGAC GCATACGCCG CCTTTGAAGA AAGCGACGAT GTCGCCGCCA 
1901 CAGGCAAACG CTTTTGGCAG GAAATCCTCG CCGTCGGCGG AT CGCGC AGC 
1951 GCGGCAGAAT CCTTCAAAGC CTTCCGCGGA CGCGAACCGA GCATAGACGC 
2001 ACTCTTGCGC CACAGCGGCT TCGACAACGC GGCTTGA 

This corresponds to the amino acid sequence <SEQ ID 492; ORF 128-1 .a>: 

al28 -1 .pep 

1 MTDNALLHLG EEPRFDQIKT ED I KPALQTA IAEAREQIAA IKAQTHTGWA 

SI NTVEPLTGIT ERVGRIWGW SHLNSVTDTP ELRAAYNELM PEITVFFTEI 

101 GQDIELYNRF KTIKNSPEFD TLSHAQKTKL NHDLRDFVLS GAELPPEQQA 

151 ELAKLQTEGA QLSAKFSQNV LDATDAFGIY FDDAAPLAGI PEDALAMFAA 

2 01 AAQSEGKTGY KIGLQIPHVL AVIQYADNRK LREQIYRAYV TRASELSDDG 

2 51 KFDNTANIDR TLENALQTAK LLGFKNYAEL SLATKMADTP EQVLNFLHDL 
301 ARRAKPYAEK DLAEVKAFAR ESLGLADLQP WDLGYAGEKL REAKYAFSET 

3 51 EVKKYFPVGK VUJGLFAQIK KLYGIGFTEK TVPVWHKDVR YFELQQNGET 

4 01 IGGVYMDLYA REGKRGGAWM NDYKGRRRFS DGTLQLPTAY LVCNFTPPVG 
4 51 GKEARLSHDE ILTLFHETGH GLHHLLTQVD ELGVSGINGV EWDAVELPSQ 
501 FMENFVWEYN VLAQMSAHEE TGVPLPKELF DKMLAAKNFQ RGMFLVRQME 
551 FALFDMMIYS EDDEGRLKNW QQVLDSVRKE VAWRPPEYN RFANSFGHIF 
601 AGGYSAGYYS YAWAEVLSAD AYAAFEESDD VAATGKRFWQ 31 LA VGGSRS 
651 AAESFKAFRG REPSIDALLR HSGFDNAA* 

ml28-l/al28-l 97.8% identity in 677 aa overlap 

10 20 30 40 50 60 

al28- 1 .pep MTDNALLHLGEEPRFDQI KTED I KPALQTAI AEAREQ I AAIKAQTHTGWANTVE PLTG I T 

{UililllillllKIIIIIKMIIlilMUIIIIllif MIMIIMIIIUIIfi 

ml 2 B - 1 MTDNALLHLGEEPRFDQ I KTED I KPALQTAI AEAREQIAAI KAQTKTGWANT VE PLTG I T 

10 20 30 40 50 60 

70 80 90 100 110 120 

al28 -1 ■ pep ERVGRIWGWSHLNSVTDTPELRAAYNELMPEITVFFTEIGQDIELYNRFKTIKNSPEFD 
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ral2B-l ERVGRIWGWSHLNSVADTPELRAVYNELMPEITVFFTEIGQDIELYNRFKTIKNSPEFD 
70 80 90 100 110 120 
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al28-l .pep T LS HAQKTKLNHD LRD FVLSGAELPP EQQAE LAKLQTEGAQ LS A KF S QNVLDATDA FG I Y 
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ml28 - 1 TLSPAQKTKLNHDLRDFVLSGAELPPEQQAELAKLQTEGAQLSAKFSQNVLDATDAFGIY 
130 140 150 160 170 180 
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al28-l .pep FDDAAPLAGI PEDALAMFAAAAQSEGKTGYKIGLQIPHYLAVIQYADNRKLREQIYRAYV 
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ml 2 8 - 1 FDDAAPLAGI PEDALAMFAAAAQSESKTGYKIGLQI PHYLAVIQYADNRELREQIYRAYV 

190 200 210 220 230 240 
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al28 - 1 . pep TRASELS DDG KFDNTAN I DRT LEN ALQTAKLLGFKNYAELS LAT KMADT PEQVLNF LHD L 
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ml 2 8 - 1 TRASELSDDG KFDNTANIDRTLANALQTAKLI^FKNYAELSLATKMADTPEQVIJMFLiHDL 

250 260 270 280 290 300 

310 320 330 340 350 360 

al2 8 - 1 . pep ARRAKPYAEKDIAEVKAFARESLGLADLOPWDLGYAGEKLREAKYAFSETEVKKYFPVGK 

iMilllMllillllllllMMIIIimiMHIimilMUMUMMMI 
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Query: 363 NGLFAQIKKLYGIGFTE-KTVPVWHKDVRYFEL-OQNGETrGGVYMDLYAREGKRGGAWM 420 

+GLF IK + + + I E K V WHKDVR*F+L +N t- G Y+DLYARE KRGGAWM 
Sbjcc: 365 SGLFE L I KR I FN I RAVERKG VDTWHKDVR FFD LI D ENDQLRGS FYLD LYAREHKRGGAWM 424 

Query- 421 NDYKGRRRFSDGTLQLPTAYLVCNFTPPVGGKEARLSHDEIXXXXXXXXXXXXXXXXQVD 4 80 

+D GR+R DG+++ P AYL CMF P+G K A +H+E+ Q+D 
Sbjct: 425 DDCIGRKRKLDGSIETPVAYLTCNFNAPIGNKPALFTHNEVTTLFHEFGHGIHHMLTQID 484 

Query: 4 81 ELGVSGINGVEWDAVELPSQFMENFVWEYNVLAQMSAHEETGVPLPKELFDKMIAAKNFQ 540 

V+GINGV WDAVELPSQFMEN+ WE LA +S H ETG PLPKE ++L AKNFQ 
Sbjct: 4 85 VSDVAGINGVPWDAVELPSQFMENWCWEEEALAFISGHYETGEPLPKEKLTQLLKAKNFQ 544 

Query. 541 RGMFLVRQMEFALFDMMIYSEDDEGRLKNWQQVLDSVRKEVAWRPPEYNRFANSFGHIF 6 00 

MF++RQ+EF +FD ++ D + L SV+ +VAV++ + + R +SF HIF 

Sbjct: 545 AAMFILRQLEFGIFDFRLHHTFDAEKTNQILDTLKSVKSQVAVIKGVDWARAPHSFSHIF 604 

Query: 601 XXXXXXXXXXXXWAEVLSADAYAAFEESDDV-AATGKRFWQEILAVGGSRSAAESFKAFR 659 

WAEVLSADAY+ FEE TGK F EIL GGS E FK FR 

Sbjct: 605 AGGYAAGYYSYLWAEVLSADAYSRFEEEGIFNPITGKSFLDEILTRGGSEEPMELFKRFR 664 

Query: 660 GREPSIDALLRKSGFDN 6 76 

GREP +DALLRH G N 
Sbjct: 665 GREPQLDALLRHKGIMN 631 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 493>: 



gl29 . seq 












1 


ATGCTTTCAC 


CTCCTCGGCG 


TAAAACGGCG 


GCACATCAAT 


CAAGCCGTCT 


51 


TTCATTTGCG 


TGCGGAAAAA 


ATGCGGCGTG 


TTGCCGTGAT 


CAAAATCAAT 


101 


ATCGTGCAGC 


ATCCAGCCCA 


AATCGCGGTT 


TGCCTCGCTT 


TCCGATAACG 


151 


CCGACGGCGG 


CAGCGGTTCA 


CCCTTATCCG 


CGCTTTCGCC 


ATTTGCCCTT 


201 


TCAGGCTGCG 


GGCATAGGGG 


CGGAACAGGC 


GGCGGTCGAA 


TCCTGTTTCA 


251 


TCCGGACAAA 


CGCGTTGGCA 


GTCGGAAAAT 


CCGGCCGGCC 


GTGTCAAATA 


301 


ATGCGTTACT 


TTGGCCGGGT 


CTTGTCCTTT 


GTAAG CGGCG 


GTCTTTTTTT 


351 


GCGCGCCATC 


CGCATCTGTT 


TGGGCGCATG 


GCAAACGGCG 


GCTGCCGTAC 


401 


AATCAAAATG 


TTTGGCGATT 


TCATGCAGAC 


AGGCATCCGG 


ATGCCGCCCG 


451 


ACATATCGAG 


CCGGTTTTTG 


CCTATCCGAT 


TTGGCGGCAT 


TTAGG CCGGT 


501 


AACTTGA 











This corresponds to the amino acid sequence <SEQ ID 494; ORF 129.ng>: 

gl29 . pep 

1 MLSPPRRKTA AKQSSRLSFA CGKNAACCRD QNQYRAASSP NRGLPRFPIT 

51 PTAAAVHPYP RFRHLPFQAA GIGAEQAAVE SCFIRTNALA VGKSGRPCQI 

101 MRYFGR VL5F VSGGLFLRAI RIC LGAWQTA AAVQSKCLAI SCRQASGCRP 

151 TYRAGFCLSD LAAFRPVT* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 495>: 

ml29.seq (partial) 

1 . . TATCTGCGCT TTCACTATTT GCCCTTTCAG GCTGCGGGCA TAGGGACGGA 

51 ACAGGTAGCG GTCAAATCCT GTTTCATCCA AATAAACACG TTGGTAGTCG 

101 GAAAATTCGG CCGGCTGTGT CAAATAATGC GTTACTTTGG CCGGGTCTTG 

151 TTCTTTGTAA GTGGTGGTCT TTTTTTGCGC GTTATCCCCA TCTGTTTGAG 

201 TGCATAGCAA ATGGTGGCTG CCGTACAATC AAAATGTTTG GCGATTTCAT 

251 GCAGATAGGC ATCCGGGTGT TGCCCAACAT ATTGAGCCGG TTTTTGCCTA 

301 TCCGATTTGA CGGCATTTAG ACCGGTAACT TGA 

This corresponds to the amino acid sequence <SEQ ID 496; ORF 129>: 
ml29.pep (partial} 

1 . . YLRFHYL-PFQ AAGIGTEQVA VKSCFIQINT LWGKFGRLC QIMRYFGRVL 
51 FFVSGGLFLR VIPIC LSAXQ MVAAVQSKCL AISCRXASGC CPTYXAGFCL 
101 SDLTAFRPVT * 

Computer analysis of this amino acid sequence gave the following results: 
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251 AATGCCACGC GGCGGACAGC AATGTGCCGA ACGCTCCGAA ACTGGAACAC 

301 AACGGCGACT GGGCGCCGCG TATCGCGCAA GGCTTCGATA CCTTGTTCCA 

3 51 ACACGCGCTG AACGGCTTTA ACGCCATGCC TGCCAAAGGC GGTGCGGCAG 

4 01 ACCTGACCGA TCAGGAACTC AAACGGGCGA TTACCTACAT GGCGAATAAA 
4 51 AGCGGCGGTT CTTTCCCGAA TCCTGATGAG GCTGCGCCTG CCGACAATGC 
501 CGCTTCAGGA ACAGCTTCTG CTCCTGCCGA TAGTGCAGCT CCGGCAGAAG 
551 CGAAGGCAGA AGACAAGGGT GCGGCAGCCC CTGCGGTCGG CGTTGACGGT 
601 AAAAAAGTCT TCGAAGCAAC CTGTCAGGTG TGCCACGGCG GTTCGATTCC 
651 CGGTATTCCC GGCATAGGCA AAAAAGACGA TTGGGCACCG CGTATCAAAA 
701 AAGGCAAAGA AACCTTGCAC AAACATGCCC TTGAAGGCTT TAACGCGATG 
751 CCGGCCAAAG GCGGCAATGC AGGTTTGAGC GATGACGAAG TCAAAGCGGC 
801 TGTTGACTAT ATGGCAAACC AATCCGGTGC AAAATTCTAA 



This corresponds to the amino acid sequence <SEQ ID 500; ORF 130.ng>: 

gl30 .pep 



1 MKQLRDNKAQ GSALF TLVSG IVIVIAVLYF LI KLAGSGSF GDVDATTEAA 

51 TQTRIQPVGO LTMGDGIPVG ERQGEQIFGK ICIQCKAADS NVPNAPKLEH 

101 NGDWAPRIAQ GFDTLFQHAL NGFNAMPAKG GAADLTDQEL KRAITYMANK 

151 SGGSFPNPDE AAPADNAASG TASAPADSAA PAEAKAEDKG AAAPAVGVDG 

2 01 KKVFEATCQV CHGGSIPGIP GIGKKDDWAP RIKKGKETLH KHALEGFNAM 

251 PAKGGNAGLS DDEVKAAVDY MANQSGAKF* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 501>: 

ml30.seq (partial) 



1 . . GGCGAACAGA TTTTCGGCAA AATCTGTATC CAATGCCACG CGGCGGACAG 

51 CAATGTGCCG AACGCTCCGA AACTGGAACA CAACGGCGAT TrGGCACCGC 

101 GTATCGgCAA GGCTTCGATA CCTTGTTCCA ACACGCGCTG AACGGCTTTA 

151 ACGCCATGCC TGCAAAAGGC GGTGCGGCAG ACCTGACCGA TCAGGAACTT 

201 AAACGGGCGA TTACTTACAT GGCGAACAAA AGCGGCGGTT CTTTCCCGAA 

251 TCCTGATGAG GCTGCGCCTG CCGACAATGC CGCTTCAGGA ACAGCTTCTG 

301 CTCCTGCCGA TAGTGCAGCT CCGGCAGAAG CGAAGGCAGA AGACAAGGGT 

3 51 GCGGCAcCCC TGCGGTCGGC GTTGACGGTA AAAAAGTCTT CGAAGCAACC 

401 TGTCAGGTGT GCCACGGCGG TTCGATTCCC GGTATTCCCG GCATAGGCAA 

451 AAAAGACGAT TGGGCACCGC GTATCAAAAA AGGCAAAGAA ACCTTGCACA 

501 AACACGCCCT TGAAGGCTTT AACGCGATGC CTGCCAAArG CGgCAATGCA 

551 GGTTTGAGCG ATGACGAAgT CAAAGCGGCT GTTGACTATA TGGCAAACCA 

601 ATCCGGTGCA AAATTCTAA 



This corresponds to the amino acid sequence <SEQ ID 502; ORF 130>: 

ml30.pep (partial) 

1 . . GEQIFGKICI QCHAADSNVP NAPKLEHNGD XAPRIQGFDT LFQKALNGFN 
51 AMPAKGGAAD LTDQELKRAI TYMANKSGGS FPNPDEAAPA DNAASGTASA 
101 PADSAAPAEA KAEDKGAAPA VGVDGKKVFE ATCQVCHGGS IPGIPGIGKK 
151 DDWAPRIKKG KETLHKHALE GFNAMPAKXG NAGLSDDEVK AAVDYMANQS 
201 GAKF* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 130 shows 98.1% identity over a 206 aa overlap with a predicted ORF (ORF 130.ng) 
from N. gonorrhoeae: 



m!30/gl30 



ml 3 0 . pep 



10 20 30 

GEQI FGKI CI QCHAADSNVPNAPKLEHNGD 



gl30 



DATTEAATQTRIQPVGQLTMGDGI PVGERQGEQI FGKICIQCHAADSNVPNAPKLEH1 
50 60 70 80 90 100 




m!30 -pep 



40 50 60 70 80 89 

XAPRI-OGFDTLFQHAIJ^GF^AMPAKGGAADLTDQELKRAITYMANKSGGSFPNPDEAAP 



gl30 




110 120 130 140 150 160 
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I l I ( I l ! I I i I I I I ! I I I ! i i I i M I ! i f ( i i I I ! I I ! I I (I M M I ! ! ! I II f I I 
a 130 KKDDWAPRIKKGKETLHKHALEGFNAMPAKGGKAGLSDDEVKAAVDYMANQSGAKFX 
230 240 250 260 270 2S0 



The following partial DNA sequence was identified in /V. gonorrhoeae <SEQ ID 505>: 

gl32 . seq 

1 ATGGAAGCCT TCAAAACCCT AATTTGGATT ATTAATATTA TTTCCGCTTT 

51 GGCCGTCATC GTGTTAGTAT TGCTCCAACA CGGCAAAGGC GCGGATGCCG 

101 GCGCGACCTT CGGATCGGGA AGCGGCAGCG CGCAAGGCGT ATTCGGCTCT 

151 GCCGGCAACG CCAACTTCCt CAgccGCTCG AccGccGTTG CAGCAACAtt 

201 tttcttTGca acctgcAtgg gctatggTgt arattcacac CCACACGACA 

251 AAACACGGTT TGGACTtcag caacataCGA CAGACTCAGC AagcACCCAA 

301 ACCcgtAAGC AATACCGAAC CTTCTGCCCC TGTTCCTCAG CAGCAGAAAT 

351 AACagtTTTT CAAATgccga caTGgtga 

This corresponds to the amino acid sequence <SEQ ID 506; ORF 132.ng>: 

g!32 .pep 

1 MEAFKTLIWI INIISALA VI VLVLLQHGKG ADAGATFGSG SGSAQGVFGS 

51 AGNANFLSRS TAVAATFFFA TCMGYGVYSH PHDKTRFGLQ QHTTDSASTQ 

101 TRKQYRTFCP CSSAAEITVF QMPTW* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 507>: 

ml32.seq (partial) 

1 ATGGAACCCT TCAAAACCTT AATTTGGATT GTTAATTTAA TTTCCGCTTT 

51 GGCCGTCTTC GTGTTAGTAT TGCTCCAACA CGGCAAAGGC GCGGATGCCG 

101 GCGCGACTTT CGGA . . . 

This corresponds to the amino acid sequence <SEQ ID 508; ORF 132>: 

ml32,pep (partial} 

1 MEPFKTLIWI VNLISALA VF VLVLLQHGKG ADAGATFG - . . 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 132 shows 89.5% identity over a 38 aa overlap with a predicted ORF (ORF 132.ng) 
from N. gonorrhoeae: 

ml32/g!32 



10 20 30 

ml32 . pep MEPFKTLIWI VNL I S ALAV FV LVL LQHG KGAD AG AT FG 

II MIIMhhllllihlllllllllllillllll 
gl32 MEAFKTLIWI INI I SALAVI VLVLLQHGKGADAGATFGSGSGSAQGVFGSAGNANFLSRS 

10 20 30 40 50 60 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 509>: 

al32 . seq 

1 ATGGAAGCCT TCAAAACCCT AATTTGGATT GTTAATATAA TTTCCGCTTT 

51 GGCCGTCATC GTGTTAGTAT TGCTCCAACA CGGCAAAGGC GCGGATGCCG 

101 GCGCGACTTT CGGATCGGGA AGCGGCAGCG CGCAAGGCGT ATTCGGCTCT 

151 GCCGGCAACG CTAACTTCCT CAGCCGCTCG ACCGCCGTTG CAGCAACATT 

201 TTTCTTTGCA ACCTGCATGg GCTATGGTGT ATATTCACAC CCACACGACA 

2 51 AAACACGGTT TGGACTTCAG CAACGTACAA CAAACTCAGC AAGCACCCAA 

301 ACCCGTAAGC AATACCGAAC CTTCTGCCCC TGTTCCTCAG CAGCAGAAAT 

351 AACAGTTTTT CAAATGCCGA CATGGTGA 

This corresponds to the amino acid sequence <SEQ ID 510; ORF 132.a>: 

a!32 . pep 

1 MEAFKTLIWI VNII SALA VI VLVLLQHGKG ADAGATFGSG SGSAQGVFGS 

51 AGNANFLSRS TAVAATFFFA TCMGYGVYSH PHDKTRFGLQ QRTTNSASTQ 

101 TRKQYRTFCP CSSAAEITVF QMPTW* 



m!32/al32 92.1% identity in 38 aa overlap 
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201 TTCCGTGGCA TCAAGTGTGA TGCAGTTCGA TTACAAAGAC CACACCGTCA 

251 ACCTCTTGGA CACGCCGGGA CACCAAGACT TCTCCGAAGA CACCTACCGC 

3 01 GTTTTAACCG CCGTGGACAG CGCATTAATG GTCATCGACG CGGCAAAAGG 

3 51 CGTGGAAGCG CAAACCATCA AGCTCTTAAA CGTCTGCCGC CTGCGCGATA 
401 CACCGATTGT TACGTTTATG AACAAATACG ACCGCGAAGT GCGCGATTCC 

4 51 CTGGAACTTT TGGACGAAGT GGAAAACATT TTAAAAATCC GCTGCGCGCC 
501 CGTTACCTGG CCGATCGGTA TGGGCAAAAA CTTCAAGGGC GTGTACCACA 
551 TCCTGAACGA TGAAATTTAT CTCTTTGAAG CTGGCGGCGA ACGCCTGCCG 
601 CACGAGTTCG ACATCATCAA AGGCATCGAT AATCCTGAAT TGGAACAACG 
651 CTTTCCGTTG GAAATCCAGC AGTTGCGCGA CGAAATCGAA TTGGTGCAGG 
701 CGGCTTCCAA CGAGTTTAAT CTCGACGAAT TCCTCGCCGG CGAACTCACG 
751 CCCGTATTCT TCGGCTCTGC GATTAACAAC TT^GGTATTC AGGAAATCCT 
801 CAATTCATTG ATTGACTGGG CGCCCGCGCC GAAACCGCGC GACGCGACCG 
851 TACGTATGGT CGAGCCGGAC GAGCCGAAGT TTTCCGGATT TATCTTCAAA 
901 ATCCAAGCCA ATATGGACCC GAAACACCGC GACCGTATTG CCTTCTTGCG 
951 CGTCTGCTCC GGCAAATTCG M3CGCGGCAT GAAGATGAAA CACCTGCGTA 

1001 TCAACCGCGA AATCGCCGCC TCCAGCGTGG TTACCTTCAT GTCGCACGAC 

1051 CGCGAGCTGG TTGAAGAAGC CTACGCCGGC GACATTATCG GCATCCCGAA 

1101 CCACGGCAAC ATCCAAATCG GCGACAGCTT CTCCGAAGGC GAACAACTGG 

1151 CGTTCACCGG CATCCCATTC TTCGCACCCG AACTGTTCCG CAGCGTACGC 

12 01 ATCAAAAACC CGCTGAAAAT CAAACAACTG CAAAAAGGCT TGCAACAGCT 
1251 CGGCGAAGAA GGCGCGGTGC AGGTGTTCAA ACCGATGAGC GGCGCGGATT 

13 01 TGATTTTGGG CGCGGTCGGC GTGTTGCAGT TTGAAGTCGT TACCTCGCGC 

13 51 CTCGCCAACG AATACGGCGT AGAAGCCGTG TTCGACAGCG CATCCATCTG 

14 01 GTCGGCGCGC TGGGTATCGT GCGACGACAA GAAAAAACTG GCTGAATTTG 
14 51 AAAAAGCCAA CGCGGGCAAC CTCGCCATCG ACGCAGGCGG CAACCTCGCC 
1501 TACCTCGCCC CCAACCGCGT GAATTTGGGA CTCACGCAAG AACGTTGGCC 
1551 GGACATCGTG TTCCACGAAA CACGCGAACA TTCGGTCAAA CTGTAA 

This corresponds to the amino acid sequence <SEQ ID 514; ORF 134>: 

m!34 .pep 

1 MSQEILDQVR RRRTFAIISH PDAGKTTLTE KLLLFSGAIQ SAGTVKGKKT 

51 GKFATSDWME IEKQRGISVA SSVMQFDYKD HTVNLLDTPG HQDFSEDTYR 

101 VLTAVDSALM VIDAAKGVEA QTIKLLNVCR LRDTPIVTFM NKYDREVRDS 

151 LELLDEVENI LKIRCAPVTW PIGMGKNFKG VYHILNDEIY LFEAGGERLP 

201 HEFDIIKGID NPELEQRFPL EIQQLRDEIE LVQAASNEFN LDEFLAGELT 

251 PVFFGSAINN FGIQEILNSL IDWAPAPKPR DATVRMVEPD EPKFSGFIFK 

301 IQANMDPKHR DRIAFLRVCS GKFERGMKMK HLRINREIAA SSWTFMSHD 

351 RELVEEAYAG DIIGIPNHGN IQIGDSFSEG EQLAFTGIPF FAPELFRSVR 

401 IKNPLKIKQL QKGLQQLGEE GAVQVFKPMS GADLILGAVG VLQFEW TSR 

451 LANEYGVEAV FDSASIWSAR WVSCDDKKKL AEFEKANAGN LAIDAGGNLA 

501 YLAPNRVNLG LTQERWPDIV FHETREHSVK L* 

Computer analysis of this amino acid sequence gave the following results: 

Homology with a predicted ORF from N. gonorrhoeae 

ORF 134 shows 98.7% identity over a 531 aa overlap with a predicted ORF (ORF 134.ng) 
from N. gonorrhoeae: 

ml34/gl34 



10 20 30 40 50 60 

ml 3 4 . pep MSQEILDQVRRRRTFAI ISHPDAGKTTLTEKLLLFSGAIQSAGTVKGKKTGKFATSDWME 

IIIIMIIIIIIIIIIIIIlllllllllllllliMIIIINIIIMMIIIIIIMIh 

gl34 MSQEILDQVRRRRTFAIISHPDAGKTTLTEKLLLFSGAIQSAGTVKGKKTGKFATSDWMD 
10 20 30 40 50 60 



70 80 90 100 110 120 

ml34 .pep I EKQRG I S VASS VMQFDYKDHTVNLLDTPGHQDFS EDTYRVLTAVDSALMV IDAAKGVEA 

IIIIMllMIIIIMIIlllllUIIIIIIIIINIIllMIIIMIilllllillUI 

g 1 3 4 IE KQRG I SVASSVMQ FD YKDHTVNLLDTPGHQD FS EDTYRVLTAVDSALMV I D AAKGVE A 

70 80 90 100 110 120 

130 140 150 160 170 180 
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951 CGTCTGCTCC GGCAAATTCG 

1001 TCAACCGCGA AATCGCCGCC 

1051 CGCGAGCTGG TTGAAGAAGC 

1101 CCACGGCAAC ATCCAAATCG 

1151 CGTTTACCGG CATCCCATTC 

12 01 ATCAAAAACC CGCTGAAAAT 

12 51 TGGCGAAGAA GGTGCGGTGC 
1301 TGATTTTGGG CGCGGTCGGC 

13 51 CTTGCCAACG AATACGGCGT 

14 01 GTCGGCGCGC TGGGTATCGT 
14 51 AAAAAGCCAA CGCGGGCAAC 
1501 TACCTCGCCC CTAACCGCGT 
1551 GGACATCGTG TTCCACGAAA 



AGCGCGGCAT GAAAATGAAA CACCTGCGTA 
TCCAGCGTGG TAACCTTCAT GTCCCACGAC 
CTACGCCGGC GACATTATCG GTATCCCAAA 
GCGACAGCTT CTCCGAAGGC GAACAACTGA 
TTCGCGCCCG AACTGTTCCG CAGCGTTCGC 
CAAGCAACTG CAAAAAGGTT TGCAACAGCT 
AGGTGTTCAA ACCAATGAGC GGCGCGGATT 
GTGTTGCAGT TTGAAGTCGT TACCTCGCGC 
GGAAGCCGTG TTCGACAACG CATC CAT CTG 
GCGACGACAA GAAAAAACTG GCGGAATTTG 
CTCGCCATCG ACGCGGGCGG CAACCTCGCC 
GAATCTGGGA CTCACGCAAG AACGCTGGCC 
CGCGCGAGCA TTCGGTCAAA CTTTAA 



is corresponds to the ammo acid sequence <SEQ ID 516; ORF 134.a>: 

a 134 . pep 

1 MSQEILDQVR RRRTFAI ISH PDAGKTTLTE KLLLFSGAIQ SAGTVKGKKT 

51 GKFATSDWMD IEKQRGISVA SSVMQFDYXD HTVNLLDTPG HQDFSEDTYR 

101 VLTAVDSALM VXDAAKGVEA QTIKLLNVCR LRNTPIVTFM NKYDREVRDS 

151 LELLDEVENI LQIRCAPVTW PIGMGKNFKG VYHILNDEIY LFEAGGERLP 

201 HEFDIIKGID NPELEQRFPL EIQQLRDEIE LVQAASNEFN LDEFLAGELT 

251 PVFFGSAINN FGIQEILNSL IEWAPAPKPR DATVRKVEPD EPKFSGFIFK 

301 1QANMDPKHR DRIAFLRVCS GKFERGMKMK HLRINREIAA SSWTFMSHD 

351 RELVEEAYAG DIIGIPNHGN IQIGDSFSEG EQLTFTGIPF FAPELFRSVR 

4 01 IKNPLKIKQL QKGLQQLGEE GAVQVFKPMS GADLILGAVG VLQFEWT5R 

4 51 LANEYGVEAV FDNASIWSAR WVSCDDKKKL AEFEKANAGN LAIDAGGNLA 

501 YLAPNRVNLG LTQERWPDIV FHETREHSVK L* 



134/al34 98.9% identity in 531 aa overlap 

10 20 30 40 50 60 

ml 34 .pep MSQEILDQVRRRRTFAIISHPDAGKTTLTEKLLLFSGAIQSAGTVKGKKTGKFATSDWME 
M I } I i t i M 1 1 I i i I I i I I 1 I M I I I I I i I i I i i i I ! I I 1 t I I I i I I i I I I i t I It i I : 
a!34 MSQEILDQVRRRRTFAIISHPDAGKTTLTEKLLLFSGAIQSAGTVKGKKTGKFATSDWMD 

10 20 30 40 50 60 



70 80 90 100 110 120 

ml 3 4 . pep IEKQRGISVASSVMQFDYKDHTVNLLDTPGHQDFSEDTYRVLTAVDSALMVIDAAKGVEA 
M I i I I I I ! I I i I i I I I I M M I I I I I t t [ I I I I I ! I I I I I I I ! I I I I I i I I ! I 1 I I i I I 
al34 IEKQRGISVASSVMQFDYKDHTVNLLDTPGHQDFSEDTYRVLTAVDSALMVIDAAKGVEA 

70 80 90 100 110 120 



130 140 150 160 170 180 

ml34 .pep QTIKLLNVCRLRDTPIVTFMNKYDREVRDSLELLDEVENILKIRCAPVTWPIGMGKNFKG 
M I I I I i I I I I I : I I I I I I I I i ! I I I I M I I I I I II I I I I I : M I I I M I I M I I I I I I I 
a!34 QTIKLLNVCRLRNTPIVTFMNKYDREVRDSLELLDEVENILQIRCAPVTWPIGMGKNFKG 

130 140 150 160 170 180 

190 200 210 220 230 240 

mi34 .pep vyhilndeiylfeaggerl?hefdiikgidnpeleqrfpleiqqlrdeielvqaasne™ 
I I I I I ! M I ! I I I I I I I i I I II II I I I I M I I i I I M ! I i I I I I I I I I I I I II I I I I M I 
al34 vyhilndeiylfeaggerlphefdiikgidnpeleqrfpleiqqlrdeielvqaasnefn 

190 200 210 220 230 240 



250 260 270 280 290 300 

ml34 .pep LDEFLAGELTPVFFGSAINNFGIQEILNSLIDWAPAPKPRDATVRMVEPDEPKFSGFIFK 
1 I I I I ! I I I) I 1 I I I I 1 I I I II M I I I I I I I : I I i ! ! II I I II I I II I I I I II 111 M I I 

ai3 4 ldeflageltpvffgsainnfgiqeilnsliewapapkprdatvrmvepdepkfsgfifk 

250 260 270 280 290 300 



310 320 330 340 350 360 

ml 3 4 . pep IQANMDPKHRDRIAFLRVCSGKFERGMECMKHLRINREIAASSWTFMSHDRELVEEAYAG 
II I I II I I I I It I i I I i I I I II I I I M I I I I I I I I i I I I I I ! I I I I I I I I I I II M II I I 
a!3 4 IQANMDPKHRDIvIAFLRVCSGKFERGMKMKHLRINREIAASSVVTFMSHDRELVEEAYAG 

310 320 330 340 350 360 
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501 GACCGACATA GACGGTCTTT ACACGGGCAA CCCGAACAGC AATCCCGATG 

551 CCGTACGGCT GGACAAAATC GAACACATCA ACCATGAAAT CATCGAAATG 

6 01 GCGGGCGGCT CGGGTTCGGC AAACGGCACG GGCGGTATGC TGACCAAAAT 

651 CAAAGCGGCA ACCATCGCCG CCGAATCCGG CGTACCGGTG TATATCTGTT 

701 CCTCGCTCAA ACCCGATGCA CTTGCCGAAG CTGCCGAACA TCAGGCGGAC 

751 GGCTCGTTTT TCGTCCCCCG TGCCAAAGGT TTGCGGACGC AGAAGCAATG 

801 GCTGGCGTTC TATTCCGAAA GCCGGGGCAG CGTTTATGTG GACGAAGGTG 

651 CGGAACACGC TTTGTCCGAA CAGGGGAAAA GCCTGCTGAT GTCGGGCATT 

901 GCCGGAATCG AAGGGCATTT TTCCCGTATG GACACCGTAA CCGTGTACAG 

951 CAAGGCAACC AAACAGCCCC TGGGCAAAGG GCGCGTCCTG TTCGGCTCTG 

1001 CCGCCGCCGA AGACCTGCTC AAATCGCGTA AGGCGAAAGG CGTGTTCATC 

10 51 CATCGGGACG ACTGGATTTC CATCACGCCC G;\AATACGCC TGCTTCTGAC 

1101 CGAATTTTAG 

This corresponds to the amino acid sequence <SEQ ID 520; ORF 135>: 

ml35 .pep 

1 MKYKRIVFKV GTSSITHSDG SLSRGKIQTI TCQLAALHKA GHE LVLVSSG 

51 AVAAGFGALG FKKRPVKIAD KQASAAVGQG LLMEEYTANL SSDGIVSAQI 

101 LLSRADFADK RRYQNAGGAL SVLLQRRAVP IINENDTVSV EELKIGDNDT 

151 LSAQVAAMIQ ADLLVLL TDI DGLYTGNPNS NPDAVRLDKI EHINHEIIEM 

201 AGGSGSANGT GGMLTKIKAA TIAAESGVPV YICSSLKPDA LAEAAEHQAD 

251 GSFFVPRAKG LRTQKQWLAF YSESRGSVYV DEGAEHALSE QGKSLLMSGI 

301 AGIEGHFSRM DTVTVYSKAT KQPLGKGRVL FGSAAAEDLL KSRKAKGVFI 

3 51 HRDDWISITP EIRLLLTEF* 

Computer analysis of this amino acid sequence gave the following results: 
Homology rath a predicted ORF from N. gonorrhoeae 

ORF 135 shows 97.6% identity over a 294 aa overlap with a predicted ORF (ORF 135.ng) 
from N. gonorrhoeae: 

ml35/g!35 

10 20 30 40 50 60 

ml3 5 .pep 
g!35 



ml35 .pep 



ml35 .pep 
gl35 



ml 3 5 .pep 
gl35 



m!35 .pep 
g!35 

250 260 270 280 290 

310 320 330 340 350 360 



MKYKRIVFKVGTSSITHSDGSLSRGKIQTITCQLAALHHAGHELVLVSSGAVAAGFGALG 

(IIIIKIIIIIIilhlMIIIIIIIIIM IIIIMIIIIIIINIIilllMIIIII 

MKYKRIVFKVGTSSITRSDGSLSRGKIQTITRQLAALHHAGHELVLVSSGAVAAGFGALG 
10 20 30 40 50 60 

70 80 90 100 110 120 

FKKRPVKIADKQASAAVGQGLLMEEYTAKLSSDGIVSAOILLSRADFADKRRYONAGGAL 

IIIII!I!III!II1!IIIII[I!III!!!III[I!IIIIIII!M11!I!!II1!IIII 

FKKRPVKIADKOASAAVGQGLLMEEYTANLSSDGIVSAQILLSRADFADKRRYQNAGGAL 
70 80 90 100 110 120 

130 140 150 160 170 180 

SVLLQRRAVPIINENDTVSVEELKIGDNDTLSAQVAAMIQADLLVLLTDIDGLYTGNPNS 

lllllllhlllMMIIIMIIIllIIMMIMMIlNMIIIIMMIMIMII! 

SVLLQRRAIPIINENDTVSVEELKIGDNDTLSAQVAAMIQADLLVLLTDIDGLYTGNPNS 
130 140 150 160 170 130 

190 200 210 220 230 240 

NPDAVRLDKI EHINHEI I EMAGGSGSANGTGGMLTKI KAAT I AAESGVPVYI CSSLKPDA 

lllllllllllMIIIIIIIIIIiilllllllllMllMlllllllllMlinilil: 

NPDAVRLDKI EHINHEI I EMAGGSGSANGTGGMLTKI KAAT I AAESGVPVYI CSS LKPDS 
190 200 210 220 230 240 

250 260 270 280 290 300 

U^AAEHQADGSFFVPRAKGLRTOKQWIAFYSESRGSVYVDEGAEHALSEQGKSLLMSGI 

iiiiiMiiiiiiiiiiiiiiiiiiMiiiiiii nmiMiimni!: 

LAEAAEHQADGSFF^PRAKGLRTOKQWLAFYSESGGSVYVDESAEHALSEOGKACX 
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250 260 270 280 290 300 

ml 3 5. pep LAEAAEHQADGSFFVPRAKGLRTQKOWLAFYSESRGSVYVDEGAEHALSEQGKSLLMSGI 
t i It I : : 1 I i 1 ! ! I 1 t ) ! I ) M M I ! I 1 It II ! ! i t : I It ! I ! 1 I I t i ! ! 1 I M I t It I ! 
a 135 LAEAADNQADGSFFVPRAKGLRTQKQWLAFYSESRGGVYVDEGAEHALSEQGKSLLMSGi 

250 260 270 290 290 300 

310 320 330 3*50 350 360 

ml 35. pep AGIEGHFSRMDTVTVYSKATKQPLGKGRVLFGSAAAEDLLKSRKAKGVFIHRDDWISITP 
I I t I ! II i I I 1 I I I i If I t I 1 M i 1 i I I I I ! M I I i I i I I I M II I I I \ II i 1 I I I I I I 
al35 AGIEGHFSRMDTVTVYSKATKQPLGKGRVLFGSAAAEDLLKLRKAKGVFIHRDDWISITP 

310 320 330 340 350 360 

370 

ml35.pep EIRLLLTE FX 

I M I I 1 I I I I 
al35 EIRLLLTEFX 
370 



The following partial DNA sequence was identified in A', gonorrhoeae <SEQ ID 523>: 

g!36 . seq 

1 ATGGAAATCC GGTTTCAGAC AGCATTTTTA CGTTTGGTTC AGatgaAAAC 

51 AAACGCTtca aTTCTtaccg caACACGCCT TGTATTTCCT GccgCTGCCG 

101 CACGGACAGG GATCGTTCCT GCCGgtTTTT TCCCCTTCCC TGCGGACGGT 

151 TTGCGGTTTG TTGATGACCG CCTGCCAGTA GCGGTAGATG TCtgccagcg 

201 cgTAAGGCag tTCGGAcgca agttccgcca gctcgccttc ggTGAATTGC 

251 AGgcggataa cgccgtttTC CTCTTCGTCg taaatgccgc ccactgccat 

301 cacgGGGTAA AACAGCTCTT CAAACGCTTC ATCATCGGCG GCTTCAAACC 

351 AATCGGTCGG CACAATGTCC AAACCGTAAA GATAGGCGTT GCACCAAGTG 

401 TAAAAATCGC TGCCGCCCTC GCCGTCGTCG TAGAGCCACA AATCGGGCAG 

4 51 CTTTTTATCC GACATCGCGG CGGTTGTTTC CATCGCCATT GCCAAAACCA 

501 GCCGTTCGAT TTCGGAACGT TCGGCGGCGG TAAATTGCGA TTCGTCGCCC 

551 AACACTTCGG GCAGCCAGTC GAGCGGTGCC AATTTGTCCG GCCCGCTCAA 

601 CAGCGCCGTC ATAAAACCTT GAACCTCGTC GCAACGCATC GTGTTGCCTT 

651 GTTCGCTTTT GGCATCCAAT AA 

This corresponds to the amino acid sequence <SEQ ID 524; ORF 136.ng>: 

gl36 . pep 

1 MEIRFQTAFL RLVQMKTNAS ILTATRLVFP AAAARTGIVP AGFFPFPADG 

51 LRFVDDRLPV AVDVCQRVRQ FGRKFRQLAF GELQADNAVF LFWNAAHCH 

101 HGVKOLFKRF IIGGFKPIGR HNVQTVK IGV APSVKIAAAL AVW EPQIGQ 

151 LFIRHRGGCF HRHCQNQPFD FGTFGGGKLR FVAQHFGQPV ERCQFVRPAQ 

201 QRRHKTLNLV ATHRVALFAF GIQ* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 525>: 



ml36 .seq 












1 


ATGGAAACAA 


ACGCTTCAAT 


TCTTACCGCA 


ACACGCCTTG 


TATTTTCTGC 


51 


CGCTGCCGCA 


CGGACAGGGA 


TCGTTCCTGC 


CTGTTTTTTC 


GCCTTCCCTG 


101 


CGGACGGTTT 


GCGGTTTGTT 


GATGACTGCC 


TGCCAGTAGC 


GGTAGATATC 


151 


CGCCAATGCA 


TAAGGCAACT 


CGGATTCCAG 


TTCCGCCAGC 


TCGCCTTCTG 


201 


TGAATTGCAG 


ACGGATAGCG 


CCGTTTTCCT 


CTTCGTCGTA 


AATACCGCCC 


251 


AATGCCATGA 


TGGGATAAAA 


CAACTCTTCA 


AACGCTTCAT 


CATCGACGGC 


301 


TTCAAACCAA 


TCGGTCGGCA 


CAATATCCAA 


ACCGTAAAGA 


TAAGCATTGC 


351 


ACCATGTGTA 


AAAATCGCTG 


CCGCCGTCTT 


CGTTTTCATA 


CAGCCACAAA 


401 


TCGGGCAGTT 


TTTTATCCGA 


CATCGCGGCG 


GTTGTTTCCA 


TCGCCATTGC 


451 


CAAAACCAGC 


CGTTCGATTT 


CGGAACGTTC 


GGCGGCGGTA 


AATTGCGATT 


501 


CGTCGCCCAA 


CACTTCGGGC 


AGCCAGTCGA 


GCGGTGTCAA 


TTTGTCCGGC 


551 


CCGCTCAACA 


GCGCCGTCAT 


AAAACCTTGA 


ACCTCGTCGC 


AACGCATCGT 


601 


GTTGCCTTGT 


TCGCTTTTGG 


CATCCAACAA 


TTCGCTCAAC 


CGCCGTTTGG 


651 


ATGCTTCGGT 


AAATTTTCGG 


GAATCCATCA 


TTTTCCTTTT 


CAAATGGGTT 


701 


TTGCGCCCTA 


TTATCGCCGC 


AATGCCGTCT 


GA 





This corresponds to the amino acid sequence <SEQ ID 526; ORF 136>: 
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1 METNASILTA TRLVFSAAAA RTGIVPACFF AFPADGLRLV DDRLPVAVDI 

51 R QC I RQLGFQ FRQ LAFCELQ TDSAWLFVV HTAQCHDGIK QLFKRFIIDG 

101 FKPIGRHNIQ TVK ISIAPCV KI AAAVFVFI QPQIGQFFIR HRGGCFHRHC 

151 QNQPFDFGTF GGGKLRFVAQ HFGQPVERCQ FVRPAQQRRH KTLNLVATHR 

201 VALFAFGIQQ FAQPPFGCFG KFSGIHHFPF PMGFAPYYSG LNLNQDKATK 

2 51 PQTVQIVRQG EATPYWFKFN PLYRRNAV* 



m!36/al36 98.3% identity in 238 aa overlap 

10 20 30 40 50 60 

ml 3 6. pep METNAS I LTATRLVFSAAAARTG I VPACFFAFPADGLRFVDDCLPVAVDI RQCIRQLGFQ 

H ! 1 [ I t I I I ! I [ M ! ! I I I I M I I I I I I M I I ! I I I I : t i ! I ! I 1 I ( I ! I I 1 I I f [ I t 
a!3& METNASILTATRLVFSAAAARTG IVPACFFAFPADGLRLVDDRLPVAVDIRQCIRQLGFQ 

10 20 30 40 50 60 



70 80 90 100 110 120 

ml 3 6. pep FRQLAFCELQTDSAVFLFVVNTAQCHDGIKQLFKRFI IDGFKPIGRHNIQTVKISIAPCV 

I I f I M I i I I I I I M I If I I I 1 I I i 1 f I M I M I ! I f I i I I I I I II I i I I I II I I I I I I 
al3 6 FRQLAFCELQTDSAVVLFV VNTAQCHDG I KQLFKRFI IDGFKPIGRHNIQTVKISIAPCV 

"0 80 90 100 110 120 



130 140 150 160 170 180 

ml36.pep KIAAAVFVFIQPQIGQFFIRHRGGCFHRHCQNQPFDFGTFGGGKLRFVAQHFGQPVERCQ 
j I I I I M I I I I M M I I I M I I II 1 I M I i I 1 I I 1 1 I I I M 1 I I I M I I I! I I I M I I I I 
al36 KIAAAVFVFIQPQIGQFFIRHRGGCFHRHCQNQPFDFGTFGGGKLRFVAQHFGQPVERCQ 

130 140 150 160 170 180 



190 200 210 220 230 240 

ml 3 6. pep FVRPAQQRRHKTLNLVATHRVALFAFGIQQFAQPPFGCFGKFSGIHKFPFQMGFAPYYRR 

I I | j I 1 i I i II M I I I II i ! I I I I ) II I I II I IS 11 I 1 I I I \ \ I i I 1 I M I 11 I I I 1 
a l36 FVRPAQQRRHKTLNLVATHRVALFAFGIQQFAQPPFGCFGKFSGIHHFPFPMGFAPYYSG 

190 200 210 220 230 240 



ml 3 6. pep NAVX 

al3 6 LMLNQDKATKPQTVQIVRQGEATPYWFKFKPLYRRNAVX 
250 260 270 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 529>: 



gl37. seq 












1 


ATGATTATCC 


ATCACcaaTT 


CGATCCCGTC 


CTCATCAGTA 


TCGGCCCGCT 


51 


TGCCGTCCGC 


TGGTATGCCT 


TAAGCTACAT 


CCTCGGATTT 


ATTCTTTTTA 


101 


CCTTTCTCGG 


CAGAAGGCGC 


ATCGCGCAAG 


GCTTGTCCGT 


TTTTACCAAA 


151 


GAATCGCTCG 


ACGACTTCCT 


GACATGGGGC 


ATTTTGGGCG 


TGATTTTGGG 


201 


CGGACGCTTG 


GGCTATGTCC 


TGTTTTACAA 


ATTCTCCGAC 


TACCTCGCCC 


251 


ATCCGCTTGA 


TATTTTCAAG 


GTATGGGAAG 


GCGGAATGTC 


GTTCCACGGC 


301 


GGCTTTTTGG 


GTGTAGTTAT 


TGCCATATGG 


TTGTTCAGCC 


GCAAGCACGG 


351 


CATCGGCTTC 


CTCAAACTGA 


TGGACACGGT 


CGCGCCGCTC 


GTTCCGCTGG 


401 


GTCTCGCTTC 


GGGACGTATC 


GGCAACTTTA 


TCAACGGCGA 


ACTTTGGGGA 


451 


CGCATTACCG 


ACATTAACGC 


ATTTTGGGCA 


ATGGGCTTCC 


CGCAAGCGCA 


501 


TTACGAAGAT 


GCCGAAGCCG 


CCGCGCACAA 


TCCGCTTTGG 


GCAGAATGGC 


551 


TGCAACAATA 


CGGTATGCTG 


CCGCGTCATC 


CCTCGCAGCT 


TTATCAGTTT 


601 


GCCCTTGAAG 


GCATCTGCCT 


GTTCGCCGTC 


GTTTGGCTGT 


TTTCCAAAAA 


651 


ACCGCGCCCG 


ACCGGGCAGA 


CTGCCGCGCT 


TTTTCTCGGC 


GGCTACGGCG 


701 


TGTTCCGCTT 


TATTGCCGAA 


TTTGCGCGCC 


AACCCGACGA 


CTATCTCGGG 


751 


CTGCTGACCT 


TGGGGCTGTC 


GATGGGGCAA 


TGGTTGAGCG 


TCCCGATGAT 


801 


TGTTTTGGGT 


ATCGTCGGCT 


TTGTCCGGTT 


CGGCATGAAA 


AAACAGCACT 


851 


GA 











This corresponds to the amino acid sequence <SEQ ID 530; ORF 137.ng>: 

g!3 7 . pep 
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lillllliKlllllllllilflllllhhIMII! I l!h|:||liii|:jl!lll 

g 1 3 7 AEWLOOYGMLPRHPSOLYQFALEGI CLFAVWLFS KKPRPTGQTAALFLGGYGVFRFI AE 

190 200 210 220 230 240 



250 260 270 280 

ml 3 7 . pep FARQPDDYLGLLTLGLSMGQWLSVPMIVLGIVGFVRFGMKKQHX 

! [ I ! I ! I I I i I II I I I ! I I I I I i i i I I I II I I I I I I I ( I i I I I I 
g 1 3 7 FARQPDDYLGLLTLGLSMGQWLSVPMI VLGI VGFVRFGMKKQHX 

250 260 270 280 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 533>: 

al37 . seq 

1 ATGATTACCC ATCCCCAATT CGACCCCGTC CTTATCAGTA TCGGCCCGCT 

51 TGCCGTCCGC TGGTATGCCC TAAGCTACAT CCTCGGATTT ATTCTTTTTA 

101 CCTTTCTCGG CAGAAGGCGC ATCGCGCAAG GCTTGTCCGT TTTTACCAAA 

151 GAATCGCTCG ACGACTTCCT GACATGGGGC ATTTTGGGCG TAATTTTGGG 

201 CGGGCGTTTG GGTTACGTCC TGTTTTACAA GTTTTCCGAC TACCTCGCCC 

251 ATCCGCTTGA TATTTTCAAG GTATGGGAAG GCGGAATGTC GTTCCACGGC 

301 GGCTTTTTGG GTGTAGTTAT TGCCATATGG TTGTTCGGTC GCAAACACGG 

351 CATCGGCTTC CTCAAACTGA TGGACACGGT CGCACCGCTC GTTCCACTGG 

401 GTCTCGCTTC GGGACGTATC GGCAACTTCA TCAACGGCGA ACTTTGGGGA 

4 51 CGCGTTACCG ACATCAACGC ATTTTGGGCA ATGGGCTTCC CGCAGGCGCG 

501 TTACGAAGAC CTCGAAGCCG CCGCGCACAA TCCGCTTTGG GCAGAATGGC 

551 TGCAACAATA CGGTATGCTG CCGCGTCATC CCTCGCAGCT TTATCAGTTT 

601 GCACTTGAAG GCATCTGCCT GTTCGCCGTC GTTTGGCTGT TCTCTAAAAA 

651 ACAGCGGCCG ACCGGACAAG TCGCCTCACT CTTCCTCGGC GGCTACGGCA 

701 TATTCCGCTT CATTGCCGAA TTTGCACGCC AACCCGACGA CTATCTCGGG 

751 CTGCTGACCT TGGGGCTGTC GATGGGGCAA TGGTTGAGCG TCCCGATGAT 

801 TGTTTTGGGT ATCGTCGGCT TTGTCCGGTT CGGCATGAAA AAACAGCACT 

851 GA 

This corresponds to the amino acid sequence <SEQ ID 534; ORF 137.a>: 

al37 . pep 

1 MITHPQFDPV LISIGPLAVR WYALSYILGF ILFTFLG RRR IAQGLSVFTK 

51 ESLDDFLTWG ILGVILGGRL GYVLFYKFSD YLAHPLDIFK VWEGGMSFHG 

101 GFLGWIAIW LF GRKHGIGF LKLMDTVAPL VPLGLASGRI GNFINGELWG 

151 RVTDIMAFWA MGFPQARYED LEAAAHNPLW AEWLQQYGML PRHPSQLYQF 

201 ALEGICLFAV VWL F5KKQRP TGQVASLFLG GYGIFRFIAE FARQPDDYLG 

251 LLTLGLSMGQ W LSVPMTVLG IVGFVRFG MK KQH* 

m!37/al37 98.2% identity in 283 aa overlap 

10 20 30 40 50 60 

m!37 .pep MITHPQFDPVLISIGPLAVRWYALSYILGFILFTFLGRRRIAQGLSVFTKESLDDFLTWG 
I I I I I M I I i ! 1 I 1 II t i II I I 1 11 I 1 I I II 1 I 1 i i II i i 1 ! } I 1 t I 1 1 I I M I I t 1 t I I 
al37 MITHPQFDPVLISIGPLAVRWYALSYILGFILFTFLGRRRIAQGLSVFTKESLDDFLTWG 

10 20 30 40 50 60 



70 80 90 100 110 120 

ml37.pep ILGVILGGRLGYVLFYKFSDYLAHPLDIFKVWEGGMSFHGGFLGWI AIRLFGRKHGIGF 

I II I M I I I I I I I I I ( I I M I II I I I i I I I I I I I I ! ( I I I I I M M i I I ( I I t I I 1 I ( I 
a 1 37 I LGVI LGGRLGYVLFYKFSDYLAHPLDI FKVWEGGMS FHGG FLGWIAIWLFGRKHGIGF 

70 30 90 100 110 120 



130 140 150 160 170 180 

ml 37 .pep LKLMDTVAPLVPLGLASGRIGNFINGELWGRVTDINAFWAMGFPQARYEDAEAAAHNPLW 
I II I 1 I I M i I I I I II I ! M I I I I M I I II I I II I I I I I I II I I I 1 I I II I I I I 1 i I I I 
al37 LKLMDTVAPLVPLGLASGRIGNFINGELWGRVTDINAFWAMGFPQARYEDLEAAAHNPLW 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml37 . pep AEWLQQYGMLPRHPSQLYQFALEGICLFTVIWLFSKKQRSTGQVASLFLGGYGI FRFIAE 

I I I I I I II I I I I I I I I I I I I I I i I I I I I : I : I I I II I M M I If I I I I I M I I I I I I | j 
al37 AEWLQQYGML PRHPSQLYQFALEG I CL FA WWLFSKKQRPTGQVASLFLGGYG I FRFIAE 
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51 RD WLLKLVG IHPVIVHGG G PQINAMLEKV GKKGEFVQGM RVTDKEAMDI 

1G1 VEMVLGGHVN KEIVSMINTY GGHAVGVSGR DDHFIKAKKL LIDTPEQNGV 

151 DIGQVGTVES IDTGLVKGLI ERGCIPWAP VGVGEKGEAF NINADLVAGK 

201 LAEELNAEKL LMMTNIAGVM DKTGNLLTKL TPKRIDELIA DGTLYGGMLP 

2 51 KIASAVEAAV NGVKATHIID GRLPN ALLLE I FTDAGIGSM IL GGGEPA* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted QRP from N. gonorrhoeae 

ORF 138 shows 98.0% identity over a 298 aa overlap with a predicted ORF (ORF 138.ng) 

from N. gonorrhoeae: 
ml38/gl38 



10 20 30 40 50 60 

ml 3 8 . pep MESENI I SAADKARI LAEALPYI RRFSGSVAVI KYGGNAMTEPALKEGFARDWLLKLVG 

II IMIIIIIMIIIMNIIIIIIIIIIMIIINMIIIIMIIIIIIIIIIIIIII 

gl3 8 MEFENIISAADKARIIAEALPYIRRFSGSVAVIKYGGNAMTEPAliKEGFARDVVLLKLVG 
10 20 30 40 50 60 



70 80 90 100 110 120 

ml38 . pep IHPVIVHGGGPQINAMLEKVGKKGEFVCOmVTO 

1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 M 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 II M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 

gl 3 8 I H P V I VHGGG PQ I N AMLE KVGKKGE FV QGMRVTDKETMD I VEMVLGGHVNKE I VSM I NT Y 

70 80 90 100 110 120 



130 140 150 160 170 180 

ml38 .pep GGHAVGVSGRDDHFIKAKKLLIDTPEONGVDIGQVGTVES IDTGLVKGLI ERGCI PWAP 

lllllllllllllllllIllhlllllhlllMIIMINIIIMIIIIIIIIIIIIIl 

gl3 8 GGHAVGVSGRDDHF I KAKKLLVDT P EQNS VD I GQVGTVE S I DTGLV KGL I E RGC I PWAP 

130 140 150 160 170 180 



190 200 210 220 230 240 

ml 3 8 . pep VG VG EKG E AFN I NADLVAG KLAEE LN AE KLLMMTN I AG VMD KTGN LLT KLT PKRIDELIA 

IIIIIIIIIIIIIMIIIIIIIIIIMIilllllllllllllllllllllllllll III 

g 1 3 8 VGVGEKGEAFN I NADLVAG KLAEE LNAE KLLMMTN I AGVMD KTGNLLTKLT PKRIDGLIA 

190 200 210 220 230 240 



250 260 270 280 290 299 

ml 3 8 . pep DGTL YGGMLPKI AS AVEAAVNGVKATH 1 1 DGRLPNALLLE I FTDAG I GSMI LGGGEDAX 

MIIIIIIIIMIIIIIIIIIIIIIIMIIIIIMIIMIMIIIMIIIIil lllll 

gl3 8 DGTLYGGMLPKIASAVEAAVNGVKATHI IDGRLPNALLLEI FTDAGIGSMI LGRGEDAX 

250 260 270 280 290 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 539>: 

al38 . seq 

1 ATGGAGTCTG AAAACATTAT TTCCGCCGCC GACAAGGCGC GTATCCTTGC 

51 CGAAGCGCTG CCTTACATCC GCCGGTTTTC CGGTTCGGTC GCCGTCATCA 

101 AATACGGCGG CAACGCGATG ACCGAACCTG CCTTGAAAGA AGGGTTTGCC 

151 CGCGATGTCG TGCTGCTGAA GCTGGTCGGC ATTCATCCCG TCATCGTTCA 

201 CGGCGGCGGG CCGCAGATCA ATGCGATGCT TGAAAAAGTC GGCAAAAAGG 

251 GTGAGTTTGT CCAAGGAATG CGCGTTACCG ACAAAGAGGC GAT GG AT ATT 

301 GTCGAAATGG TGTTGGGCGG GCATGTCAAT AAAGAAATCG TGTCGATGAT 

351 TAACACATAT GGCGGACACG CGGTCGGCGT AAGCGGACGC GACGACCATT 

4 01 TCATTAAGGC GAAGAAACTT TTGATCGATA CGCCCGAACA GAATGGCGTG 

4 51 GACATCGGAC AGGTCGGTAC GGTGGAAAGC ATCGATACCG GTTTGGTTAA 

501 AGGGCTGATA GAACGTGGCT GCATTCCCGT CGTCGCCCCC GTCGGCGTAG 

551 GTGAAAAAGG CGAAGCGTTC AACATCAACG CCGATTTGGT AGCAGGCAAA 

601 TTGGCGGAAG AATTGAACGC CGAAAAACTC TTGATGATGA CGAAT AT CGC 

651 CGGTGTGATG GACAAAACGG GCAATCTGCT GACCAAACTC ACGCCGAAAC 

7 01 GGATTGATGA ACTGATTGCC GACGGCACGC TGTATGGCGG TATGCTGCCG 

751 AAAATCGCTT CTGCGGTCGA AGCCGCCGTC AACGGCGTGA AAGCCACGCA 

801 TATCATCGAC GGCAGGGTGC CCAACGCGCT TTTGCTGGAA ATCTTTACCG 

851 ATGCCGGTAT CGGTTCGATG ATTTTGGGCG GTGGGGAAGA TGCCTGA 
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The following partial DNA sequence was identified in N. meningitidis <SEQ ID 543>: 



ml3 9 . seq 












1 


ATGCGAACGA 


CCCCAACCTT 


CCCTACAAAA 


ACTTTCAAAC 


CGACTGCCAT 


51 


GGCGTTAGCT 


GTTGCAACAA 


CACTTTCTGC 


CTGCTTAGGC 


GGCGGCGGAG 


101 


GCGGCACTTC 


TGCGCCCGAC 


TTCAATGCAG 


GCGGTACCGG 


TATCGGCAGC 


151 


AACAGCAGAG 


CAACAACAGC 


GAAATCAGCA 


GCAGTATCTT 


ACGCCGGTAT 


201 


CAAGAACGAA 


ATGTGCAAAG 


ACAGAAGCAT 


GCTCTGTGCC 


GGTCGGGATG 


251 


ACGTTGCGGT 


TACAGACAGG 


GATGC CAAAA 


TCAATGCCCC 


CCCCCGAATC 


301 


TGCATACCGG 


AGACTTTCCA 


AACCCAAATG 


ACGCATtACA 


AGAATTTGAT 


351 


CAACCTCAAA 


CCTGCAATTG 


AAGCAGGCTA 


TACAGGACGC 


GGGGTAGAGG 


401 


TAGGTATCGT 


CGACACAGGC 


GAATC CGTCG 


GCAGC AT AT C 


CTTTCCCGAA 


451 


CTGTATGGCA 


GAAAAGAACA 


CGGCTATAAC 


GAAAATTACG 


AAAAACTATA 


501 


CGGCGTATAT 


GCGGAAGGAA 


GCGCCTGA 







This corresponds to the amino acid sequence <SEQ ID 544; ORF 138>: 

ml 3 9 . pep 

1 MRTTPTFPTK TFKPTAMA LA VATTLSACLG GGGGGTSAPD FNAGGTGIGS 

51 NSRATTAKSA AVSYAGIKNE MCKDRSMLCA GRDDVAVTDR DAKINAPPRI 

101 CIPETFQTQM THYKNLINLK PAIEAGYTGR GVEVGIVDTG ESVGSISFPE 

151 LYGRKEHGYN ENYEKLYGVY AEGSA* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 138 shows 92.2% identity over a 179 aa overlap with a predicted ORF (ORF 138.ng) 
from N. gonorrhoeae; 

ml39/gl39 

10 20 30 40 50 60 

MRTTPTFPTKTFKPTAMALAVATTLSACLGGGGGGTSAPDFNAGGTGIGSNSRATTAKSA 

III! IMI!!l!l:!IIIIIMIII!MIM!!l!ilil!lillllll!lll!i Ml 

MRTTSTFPTKTFKPAAMAIiAVATTLSACLGGGGGGTSAPDFNAGGTGIGSNSRATIAESA 
10 20 30 40 50 60 

70 80 90 100 110 120 

AVSYAGIKNEMCKDRSMLCAGRDDVAVTDRDAKINAPPRICIPETFQTQMTHYKNLINLK 

1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 ! I i I I I I I I I I I I ! I i I I : 1 1 1 1 1 1 1 i I I I I I I I : I I : 1 I! I 
AVSYAG I KNEMCKDRSMLCAGRDDVAVTDRDAKI KAP -RICI PETFQTQMTNI KNMINLK 
70 80 90 100 110 

130 140 150 160 170 
P AI EAG YTGRGVEVG I VDTGES VGS I S FPELYGRKEHGYNENY EKLYGVYAEGSAX 

MimmiiiimiiiimimiiiiimmiiiM himmiiiiii 

PAIEAGYTGRGVEVGIVDTGESVGSISFPELYGRKEHGYNENYKNKLQKLYGVYAEGSAX 
120 130 140 150 160 170 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 545>: 

al39.seq 

1 ATGCGAACGA CCCCAACCTT CCCTACAAAA ACTTTCAAAC CGGCTGCCAT 

51 GGCGTTAGCT GTTGCAACAA CACTTTCTGC CTGCTTAGGC GGCGGCGGAG 

101 GCGGCACTTC TGCGCCCGAC TTCAATGCAG GCGGCACCGG TATCGGCAGC 

151 AACAGCAGGG CAACAACAGC GAAATCAGCA GCAATATCTT ACGCCGGTAT 

201 CAAGAACGAA ATGTGCAAAG ACAGAAGCAT GCTCTGTGCC GGTCGGGATG 

251 ACGTTGCGGT TACAGACAGG GATGCCAAAA TCAATGCCCC CCCCCGAATC 

301 TGCATACCGG AGACTTTACA AACCCAAATG ACGCAT.ACA AGAATTTGAT 

351 CAACCTCAAA CCTGCAATTG AAGCAGGCTA TACAGGACGC GGGGTAGAGG 

401 TAGGTATCGT CGACACAGGC GAATCCGTCG GCAGCATATC CTTTCCCGAA 

451 CTGTATGGCA GAAAAGAACA CGGCTATAAC GAAAATTAC . AAAAACTATA 

501 CGGCGTATAT GCGGAAGGAA GCGCCTGA 

This corresponds to the amino acid sequence <SEQ ID 546; ORF 139.a>: 

al39.pep 

1 MRTTPTFPTK TFKPAAMALA VATTLSA CLG GGGGGTSAPD FNAGGTGIGS 



ml 3 9 -pep 
g!39 

ml 3 9 . pep 
gl39 

ml39 .pep 
gl39 
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2 51 DSISLFAGIR HDVGDIGYLK GLFSYGRYKN SISRSTGADE YAEGSVNGTL 
301 MQLGALGGVN VPFAATGDLT VEGGLRHDLL KQDAFAEKGS ALGWSGNSLT 
351 EGTLVGLAGL KLSQPLSDKA VLSATAGVER DLNGRDYAVT GGFTGAAAAT 
401 GKTGARNMPH TRRVAGLGVD VEFGNGWNGL ARYSYTGSKQ YGNKSGQIGV 
451 GYRF* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 549>; 

ml40 . seq 

1 ATGTCGGCAC GCGGCAAGGG GGCAGGCTAT CTCAACAGTA CCGGACGACG 

51 TGTTCCCTTC CTGAGTGCCG CCAAAATCGG GCAGGATTAT TCTTTCTTCA 

101 CAAACATCGA AACCGACGGC GGCCTGCTGG CTTCCCTCGA CAGCGTCGAA 

151 AAAACAGCGG GCAGTGAAGG CGACACGCTG TCCTATTATG TCCGTCGCGG 

201 CAATGCGGCA CGGACTGCTT CGGCAGCGGC ACATTCCGCG CCCGCCGGTC 

251 TGAAACACGC CGTAGAACAG GGCGGCAGCA ATCTGGAAAA CCTGATGGTC 

301 GAACTGGATG CCTCCGAATC ATCCGCAACA CCCGAGACGG TTGAAACTGC 

3 51 GGCAGCCGAC CGCACAGATA TGCCGGGCAT CCGCCCCTAC GGCGCAACTT 
401 TCCGCGCAGC GGCAGCCGTA CAGCATGCGA ATGCCGCCGA CGGTGTACGC 
451 ATCTTCAACA GTCTCGCCGC TACCGTCTAT GCCGACAGTA CCGCCGCCCA 
501 TGCCGATATG CAGGGACGCC GCCTGAAAGC CGTATCGGAC GGGTTGGACC 
551 ACAACGGCAC GGGTCTGCGC GTCATCGCGC AAACCCAACA GGACGGTGGA 
601 ACGTGGGAAC AGGGCGGTGT TGAAGGCAAA ATGCGCGGCA GTACCCAAAC 
651 CGTCGGCATT GCCGCGAAAA CCGGCGAAAA TACGACAGCA GCCGCCACAC 
701 TGGGCATGGG ACGCAGCACA TGGAGCGAAA ACAGTGCAAA TGCAAAAACC 
751 GACAGCATTA GTCTGTTTGC AGGCATACGG CACGATGCGG GCGATATCGG 
801 CTATCTCAAA GGCCTGTTCT CCTACGGACG CTACAAAAAC AGCATCAGCC 
851 GCAGCACCGG TGCGGACGAA CATGCGGAAG GCAGCGTCAA CGGCACGCTG 
901 ATGCAGCTGG GCGCACTGGG CGGTGTCAAC GTTCCGTTTG CCGCAACGGG 
951 AGATTTGACG GTCGAAGGCG GTCTGCGCTA CGACCTGCTC AAACAGGATG 

1001 CATTCGCCGA AAAAGGCAGT GCTTTGGGCT GGAGCGGCAA CAGCCTCACT 

10 51 GAAGGCA CGC TGGTCGGACT CGCGGGTCTG AAGCTGTCGC AACCCTTGAG 

1101 CGATAAAGCC GTCCTGTTTG CAACGGCGGG CGTGGAACGC GACCTGAACG 

1151 GACGCGACTA CACGGTAACG uGCGG CTTTA CCGGCGCGhC TGCAGCAACC 

1201 GGCAAGACGG GGGCACGCAA TATGCCGCAC ACCCGTCTGG TTGCCGGCCT 

12 51 GGGCGCGGAT GTCGAATTCG GCAACGGCTG GAACGGCTTG GCACGTTACA 
1301 GCTACGCCGG TTCCAAACAG TACGGCAACC ACAGCGGACG AGTCGGCGTA 

13 51 GGCTACCGGT TCTGA 



This corresponds to the amino acid sequence <SEQ ED 550; ORF 


140>: 


ml40 -pep 












1 


MSARGKGAGY 


LNSTGRRVPF 


LSAAKIGQDY 


SFFTNIETDG 


GLLASLDSVE 


51 


KTAGSEGDTL 


SYYVRRGNAA 


RTASAAAHSA 


PAGLKHAVEQ 


GGSNLENLMV 


101 


ELDASESSAT 


PETVETAAAD 


RTDMPGIRPY 


GATFRAAAAV 


QHANAADGVR 


151 


I FNSLAATVY 


ADSTAAHADM 


QGRRLKAVSD 


GLDHNGTGLR 


VIAQTQQDGG 


201 


TWEQGGVEGK 


MRGSTQTVGI 


AAKTGENTTA 


AATLGMGRST 


WSENSANAKT 


251 


DSISLFAGIR 


HDAGDIGYLK 


GLFSYGRYKN 


SISRSTGADE 


HAEGSVNGTL 


301 


MQLGALGGVN 


VPFAATGDLT 


VEGGLRYDLL 


KQDAFAEKGS 


ALGWSGNSLT 


351 


EGTLVGLAGL 


KLSQPLSDKA 


VLFATAGVER 


DLNGRDYTVT 


GGFTGATAAT 


401 


GKTGARNMPH 


TRLVAGLGAD 


VEFGNGWNGL 


ARYSYAGSKQ 


YGNHSGRVGV 


451 


GYRF* 











Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 140 shows 94.5% identity over a 454 aa overlap with a predicted ORF (ORF 140.ng) 
from N. gonorrhoeae: 

ml40/gl40 

10 20 30 40 50 60 

ml 4 0 . pep MSARGKGAGYLNSTGRRVPFLSAAKIGQDYSFFTNIETDGGLLASLDSVEKTAGSEGDTL 

I I I I i I I ! I I I I I I I I : I i I I I I I I I I I II I I I I I - I I I I I I I I I I I I I I I I I I I I I I 
gl4 0 MSARGKGAGYLNSTGRHVPFLSAAKIGQDYSFFKNIKTDGGLLASLDSVEKTAGSEGDTP 
10 20 30 40 50 60 

70 80 90 100 110 120 
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951 AGATTTGACG GTCGAAGGCG GTCTGCGCTA CGACCTGCTC AAACAGGATG 

1001 CATTCGCCGA AAAAGGCAGT GCTTTGGGCT GGAGCGGCAA CAGCATCACT 

1051 GAAGGCACAC TGGTCGGACT CGCGGGTCTG AAGCTGTCGC AACCCTTGAG 

1101 CGATAAAGCC GTCCTGTTTG CAACGGCGGG CGTGGAACGC GACCTGAACG 

1151 GACGCGACTA CACGGTAACG GGCGGCTTTA CCGGCGCGAC TGCAGCAACC 

12 01 GGCAAGACGG GGGCACGCAA TATGCCGCAC ACCCGCCTGG TTGCCGGTCT 

12 51 GGGCGCGGAT GTCGAATTCG GCAACGGCTG GAACGGCTTG GCACGTTACA 

1301 GCTACGCCGG TTCCAAACAG TACGGCAACC ACAGCGGACG AGTCGGCGTA 

1351 GGCTACCGGT TCTGA 



This corresponds to the amino acid sequence <SEQ ID 552; ORF 


al 4 0 . pep 










1 


MSAGGKGAGY 


LNRTGQRVPF 


LSAAKIGRDY 


SFFTNIETDG 


51 


KTAGSEGDTL 


SYYVRRGNAA 


RTASAAAHSA 


PAGLKHAVEQ 


101 


ELDASESSAT 


PETVETAAAD 


RTDMPGIRPY 


GAT FRAAAAV 


151 


I FNNLAATVY 


ADST^AHADM 


QGRRLKAVSD 


GLDHNATGLR 


201 


TWEQGGVEGK 


MRGSTQTVGI 


AAKTGENTTA 


AATLGMGHST 


251 


DSISLFAGIR 


HDAGDIGYLK 


GLFSYGRYKN 


SISRSTGADE 


301 


MQLGALGGVN 


VPFAATGDLT 


VEGGLRYDLL 


KQDAFAEKGS 


351 


EGTLVGLAGL 


KLSQPLSDKA 


VLFATAGVER 


DLNGRDYTVT 


401 


GKTGARNMPH 


TRLVAGLGAD 


VEFGNGWNGL 


ARYSYAGSKQ 


451 


GYRF* 









VIAQTQQDGG 
WSENSANAKT 
HAEGSVNGTL 
ALGWSGNSIT 
GGFTGATAAT 



ml40/al40 98.2% identity in 454 aa overlap 

10 20 30 40 50 60 

ml 4 0. pep MSARGKGAGYLNSTGRRVPFLSAAKIGQDY5FFTNIETDGGLLASLDSVEKTAGSEGDTL 
111 I I I I i I I I I I : I I I I II II I I I : I I I I i II I I I I I I I I I I M I I i I i i I ( I I I M 
al4 0 MSAGGKGAGYLNRTGQRVPFLSAAKIGRDYSFFTNIETDGGLLASLDSVEKTAGSEGDTL 

10 20 30 40 50 60 



70 80 90 100 110 120 

n.x40.pep S YY VRRGNAART AS AAAH S APAGLKHAVEQGGSN LEN LMVELDASE S SAT PETVETAAAD 

i i ! 11 I It I I I t ! II li I M I M I I I I I M 1 ! I I I III I I II I I i I I II I t i I t I II I I I 
al4 0 SYYVRRGNAAKTASAAAHSAPAGLKHAVEQGGSNLENLMVELDASES SAT PETVETAAAD 

70 80 90 100 110 120 



130 140 150 160 170 180 

ml 4 0 . pep RTDMPG I RPYGAT FRAAAAV QHANAADGVRI FN S LAATVY ADSTAAHADMQGRRLKAV S D 

I I I I I I I I I M I I I I I i I I j I I I I I I I I I I I M : I I I I I I i ! I I I M I I II I I I M I M ! 
al4 0 RT DM PG I RPYGAT FRAAAAVQHAN AADG VR I FNNLAATVYADST AAHADMQGRRLKAVS D 

130 140 150 160 170 180 



190 200 210 220 230 240 

ml 4 0 . pep GLDHNGTGLRVIAQTQQDGGTWEQGGVEGPCMRGSTQTVGIAAKTGENTTAAATLGMGRST 
I I I I ! : I I I I M I I II I I I I I I I I I If I I I I I f I I I If J I I I I I I I II I I I I M j I f : I I 
al4 0 GLDHNATGLRVIAQTQQDGGTWEQGGVEGKMRGSTQTVGIAAKTGENTTAAATLGMGHST 

190 200 210 220 230 240 



250 260 270 280 290 300 

ml 4 0 .pep WSENSANAKTDSISLFAGIRHDAGDIGYLKGLFSYGRYKNSISRSTGADEHAEGSVNGTL 
I I! ! I \ I I I II M I I i I U I ! I I 11 I II I I I I I I I I I I t I I I ! I I I I t I I I I I I S I M M 
al40 WSENSANAKTDSISLFAGIRHDAGDIGYLKGLFSYGRYKNSISRSTGADEHAEGSVNGTL 
250 260 270 280 290 300 



310 320 330 340 350 360 

ml 40 . pep MQLGALGGVNVPFAATGDLTVEGGLRYDLLKODAFAEKGSALGWSGNSLTEGTLVGLAGL 
I i II I II I I I M I I I M I I I 1 I I I I I I I II I I I 1 I I I I M I f II I I I I : I I I II I I 1 i M 
a 1 4 0 MQLGALGGVNVPFAATGDLTVEGGLRYDLLKQDAFAEKGSALGWSGNS ITEGTLVGLAGL 

310 320 330 340 350 360 

370 380 390 400 410 420 

ml 4 0 . pep KLSQPLSDKAVLFATAGVERDLNGRDYTVTGGFTGATAATGKTGARNMPHTRLVAGLGAD 
I I I I I M I 1 i I ' I I II I I I 1 I I II I 1 I I I I t I I II I II I M I I I I I I M I I I I I 1 I 1 I | I 
al4 0 KLSQPLSDKAVLFATAGVERDLNGRDYTVTGGFTGATAATGKTGARNMPHTRLVAGLGAD 
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ml41 . seq 

1 ATGAGCTTCA AAACCGATGC CGAAATCGCC CAATCCTCCA CCATGCGCCC 

51 GATTGGCGAA ATTGCCGCCA AGCTTGGTCT GAA7GCCGAC AACATTGAGC 

101 CTTACGGTCA TTACAAGGCG AAAATCAATC CTGCCGAAGC GTTCAAACTG 

151 CCGCAAAAAC AGGGCAGGCT GATTTTGGTT ACCGCCATCA ACCCGACTCC 

201 GGCGGGCGAA GGCAAAACCA CCGTAACCAT CGGTTTGGCG GACGCGTTGC 

251 GCCACATCGG CAAAGATGCC GTGATTGCCC TGCGCGAACC TTCTCTGGGG 

3 01 CCGGTGTTCG GCGTGAAAGG CGGCGCGGCA GGCGGCGGCT ATGCCCAAGT 

351 TTTGCCGATG GAAGACATCA ACCTGCACTT CACCGGAGAT TTTCACGCCA 

401 TCGGTGCGGC AAATAATCTG CTTGCCGCGA TGCTCGACAA CCATATCTAC 

451 CAAGGCAACG AGTTGAACAT CGACCCCAAA CGCGTGCTGT GGCGGCGCGT 

501 GGTCGATATG AACGACCGCC AGTTGCGCAA CAT CATC 3 AC GGCATGGGTA 

551 AACCCGTTGA CGGCGTGATG CGTCCTGACG GTTTCGATAT TACCGTTGCT 

601 TCCGAAGTGA TGGCGGTATT CTGTCTTGCC AAAGACATCA GCGATTTGAA 

651 AGAGCGTTTG GGCAACATCC TTGTCGCCTA CGCCAAAGAC GGCAGCCCCG 

701 TTTACGCCAA AG ATT TG AAA GCGAATGGCG CGATGGCGGC ATTGCTTAAA 

751 GATGCGATTA AGCCCAACTT GGTGCAAACC ATCGAAGGCA CGCCCGCCTT 

801 CGTACACGGC GGCCCGTTCG CCAACATCGC CCACGGCTGC AACTCCGTAA 

851 CCGCAACCCG TCTGGCGAAA CACCTTGCCG ATTACGCCGT AACCGAAGCA 

901 GGCTTCGGCG CGGACTTGGG CGCGGAAAAA TTCTGCGACA TCAAATGCCG 

951 CCTTGCCGGT TTGAAACCTG ATGCGGCTGT TGTCGTGGCG ACTGTCCGCG 

1001 CGTTGAAATA TAACGGCGGC GTGGAACGCG CCAACCTCGG CGAAGAAAAT 

1051 TTAGACGCTT TGGAAAAAGG TTTGCCCAAC CTGCTGAAAC ACATTTCCAA 

1101 CCTGAAAAAC GTATTCGGAC TGCCCGTCGT CGTTGCGCTC AACCGCTTCG 

1151 TGTCCGACGC CGATGCCGAG TTGGCGATGA TTGAAAAAGC CTGTGCCGAA 

1201 CT.CGGCGTTG AAGTTTCCCT GACCGAAGTG TGGGGCAAAG GTGGTGCGGG 

1251 CGGCGCGGAT TTGGCGCGCA AAGTCGTCAA CGCCATTGAA AGTCAAACCA 

1301 ATAACTTCGG TTTCGCCTAC GATGTCGAGT TGGGCATCAA AGACAAAATC 

13 51 CGTGCGATTG CCCAAAAAGT GTACGGCGCG GAAGATGTTG ATTTCAGCGC 

14 01 GGAAGCGTCT GCCGAAATCG CTTCACTGGA AAAACTGGGC TTGGACAAAA 
14 51 TGCCGATCTG CATGGCGAAA ACCCAATACT CTTTGAGCGA CAACGCCAAA 
1501 CTGTTGGGCT GCCCCGAAGA CTTCCGCATC GCCGTGCGCG GCATCACCGT 
1551 TTCCGCAGGC GCAGGTTTCA TCGTCGCCCT GTGCGGCAAC ATGATGAAAA 
16 01 TGCCCGGCCT GCCCAAAGTT CCGGCTGCCG AGAAAATCGA TGTGGACGCA 
1651 GAAGGCGTGA TTCACGGCTT GTTCTGA 

This corresponds to the amino acid sequence <SEQ ID 556; ORF 141>: 

ml41 -pep 

1 MSFKTDAEIA QSSTMRPIGE IAAKLGLNAD NIEPYGHYKA KINPAEAFKL 

51 PQKQGRLILV TAINPTPAGE GKTTVTIGLA DALRHIGKDA VIALREPSLG 

101 PVFGVKGGAA GGGYAQVLPM EDINLHFTGD FHAIGAANNL LAAMLDNHIY 

151 QGNELNIDPK RVLWRRWDM NDRQLRNIID GMGKPVDGVM RPDGFDITVA 

2 01 SEVMAVFCLA KDISDLKERL GNILVAYAKD GSPVYAKDLK ANGAMAALLK 
251 DAIKPNLVQT I EGTPAFVHG GPFANIAHGC NSVTATRLAK HLADYAVTEA 
301 GFGADLGAEK FCDIKCRLAG LKPDAAVWA TVRALKYNGG VERANLGEEN 

3 51 LDALEKGLPN LLKHISNLKN VFGLPVWAL NRFVSDADAE LAMIEKACAE 

4 01 HGVEVSLTEV WGKGGAGGAD LARKWNAIE SQTNNFGFAY DVELGIKDKI 

4 51 RAIAQKVYGA EDVDFSAEAS AEIA3LEKLG LDKMPICMAK TQYSLSDNAK 

5 01 LLGCPEDFR I AVRGITVSAG AGFIVA LCGN MMKMPGLPKV PAAEKIDVDA 
551 EGVIHGLF* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 141 shows 97.5% identity over a 558 aa overlap with a predicted ORF (ORF 141.ng) 

from N. gonorrhoeae: 
ml41/gl4l 

10 20 30 40 50 60 

ml41 . pep MSFKTDAEIAQSSTMRPIGEIAAKLGLNADNIEPYGHYKAKINPAEAFKLPQKQGRLILV 

IIIIIIM INIIIIHIIIinillhllllltlllUllllillllMIIIIMIII 

gl41 MSFKTDAETAQSSTMRPIGEIAAKI^LN^NIEPYGKYKAKINPAEAFKLPQKOGRLILV 
10 20 30 40 50 60 
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401 



201 


GGCGGGCGAA 


GGTAAAACCA 


251 


GCCATATCGG 


CAAAGACTCT 


301 


CCGGTGTTCG 


GCGTGAAAGG 


351 


TTTGCCGATG 


GAAGACATCA 


4 01 


TCGGTGCGGC 


AAATAATCTG 


451 


CAAGGCAACG 


AGTTGAACAT 


501 


GGTCGATATG 


AACGACCGCC 


551 


AGCCTGTTGA 


CGGCGTGATG 


601 


TCCGAAGTGA 


TGGCGGTATT 


651 


AGAGCGTTTG 


GGCAACATCC 


701 


TTTACGCCAA 


AGATTTGAAA 


751 


GATGCGATTA 


AGCCCAACTT 


801 


CGTACACGGC 


GGCCCGTTCG 


851 


CCGCAACCCG 


TCTGGCGAAA 


901 


GGCTTCGGCG 


CGGACTTGGG 


951 


CCTTGCCGGT 


TTGAAACCTG 


1001 


CGTTGAAATA 


TAACGGCGGC 


1051 


TTAGACGCTT 


TGGAAAAAGG 


1101 


CCTGAAAAAC 


GTATTCGGAC 


1151 


TGTCCGACTC 


CGATGCCGAG 


1201 


CACGGCGTTG 


AAGTTTCCCT 


1251 


CGGCGCGGAT 


TTGGCGCGCA 


1301 


ATAACTTCGG 


TTTCGCCTAC 


1351 


CGTGCGATTG 


CCCAAAAAGT 


1401 


GGAAGCGTCT 


GCCGAAATCG 


1451 


TGCCGATCTG 


CATGGCGAAA 


1501 


CTGTTGGGCT 


GCCCCGAAGA 


1551 


TTCCGCAGGC 


GCAGGTTTCA 


1601 


TGCCCGGCCT 


GCCCAAAGTT 


1651 


GAAGGCGTGA 


TTCACGGCTT 



CCGTAACCAT CGGTTTGGCG GACGCATTGC 
GTGATTGCTT TGCGCGAGCC TTCTTTGGGT 
CGGCGCGGCA GGCGGCGGCT ATGCCCAAGT 
ACCTGCACTT CACCGGAGAT TTTCACGCCA 
CTTGCCGCGA TGCTCGACAA CCATATCTAC 
CGACCCCAAA CGCGTGCTGT GGCGGCGCGT 
AGTTGCGCAA CATCATCGAC GGCATGGGCA 
CGTCCTGACG GTTTCGATAT TACCGTTGCT 
CTGTCTTGCC AAAGACATCA GCGATTTGAA 
TTGTCGCCTA CGCCAAAGAC GGCAGCCCCG 
GCGAATGGCG CGATGGCGGC ATTGCTTAAA 
GGTGCAAACC ATCGAAGGCA CGCCCGCCTT 
CCAACATCGC CCACGGCTGC AACTCCGTAA 
CACCTTGCCG ATTACGCCGT AACCGAAGCA 
CGCGGAAAAA TTCTGCGACA TCAAATGCCG 
ATGCGGCTGT TGTCGTGGCG ACTGTCCGCG 
GTGGAACGCG CCAACCTCGG CGAAGAAAAT 
TTTGCCCAAC CTGCTGAAAC ACATTTCCAA 
TGCCCGTCGT CGTTGCGCTC AACCGCTTCG 
TTGG^GATGA TTGAAAAAGC CTGTGCCGAA 
GACCGAAGTG TGGGGCAAAG GTGGTGCGGG 
AAGTCGTCAA . XATTGAA AGTCAAACCA 
GATGTCGAGT i'GGGCATCAA AGACAAAATC 
GTACGGCGCG GAAGATGTTG ATTTCAGCGC 
CTTCACTGGA AAAACTGGGC TTGGACAAAA 
ACCCAATACT CTTTGAGCGA CAACGCCAAA 
CTTCCGCATC GCCGTGCGCG GCATCACCGT 
TCGTCGCCCT GTGCGGCAAC ATGATGAAAA 
CCGGCTGCCG AGAAAATCGA TGTGGACGCA 
GTTCTGA 



This corresponds to the amino acid sequence <SEQ ID 558; ORF 141. a>: 

al41 .pep 

1 MSFKTDAEIA 

51 PQKQGRLILV 

101 PVFGVKGGAA 

151 QGNELNIDPK 

201 5EVMAVFCLA 

2 51 DAIKPNLVQT 

301 GFGADLGAEK 

351 LDALEKGLPN 

401 HGVEVSLTEV 

4 51 RAIAQKVYGA 

501 LLGCPEDFRI_ 

551 EGVIHGLF* 

ml41/al41 99.5% identity in 558 aa overlap 

10 20 30 40 50 60 

ml 41. pep MSFKTDAEIAQS3TMRPIGEIAAKLGLNADNIEPYGHYKAKINPAEAFKLPQKQGRLILV 
1 I i I 1 I I I I I I I i i I ! I i t It ! I 1 I II i : I I I I i i t I I i I I i t i I i i M I I I M I I I I li 
a!41 MSFKTDAEIAQSSTMRPIGEIAAKLGLNVDNIEPYGHYKAKINPAEAFKLPQKCGRLILV 

10 20 30 40 50 60 



OSSTMRPIGE IAAKLGLNVD 
TAINPTPAGE GKTTVTIGLA 
GGGYAQVLPM EDINLHFTGD 
RVLWRRVVDM NDRQLRNIID 
KDISDLKERL GNILVAYAKD 
IEGTPAFVHG GPFANIAHGC 
FCDIKCRLAG LKPDAAVWA 
LLKHISNLKN VFGLPWVAL 
WGKGGAGGAD LARKWKAIE 
EDVDFSAEAS AEIASLEKLG 
AVRGITVSAG AGFIVALCGN 



NIEPYGHYKA KINPAEAFKL 
DALRHIGKDS VIALREPSLG 
FHAIGAANNL LAAMLDNHXY 
GMGKPVDGVM RPD GFDITVA 
GSPVYAKDLK ANGAMAALLK 
NSVTATRLAK HLADYAVTEA 
TVRALKYNGG VERANLGEEN 
NRFVSDSDAE LAMIEKACAE 
SOTNNFGFAY DVELGIKDKI 
LDKMPICMAK TQYSLSDNAK 
MMKMPGLPKV PAAEKIDVDA 



70 80 90 100 110 120 

ml41.pep TAINPTPAGEGKTTVTIGLADALRHIGKDAVIALREPSLGPVFGVKGGAAGGGYAQVLPM 
i I ! I I I 1 I I I II I I i I I II ! I 1 I I I I I t I : I I I i I I i I I I I I I I I I I i S I I I I i t f I M I 
a 1 4 1 TAINPT PAGE GKTTVT I GLAD ALRH I GKDSVIALREP S LG P VFG VKGG AAGGG Y AQ VL PM 

70 80 90 100 110 120 



130 140 150 160 170 180 

ml 4 1 . pep EDINLHFTGDFHAIGAANNLLAAMLDNHI YQGNELNI DPKRVLWRRVVDMNDRQLRNI I D 

M i M I I I I f I I I i M I I I 1 I It I I I 1 1 || I I M M i I II ! I i i i H I i I I I II I II 1 i I 
al41 EDINLHFTGDFHAIGAANNLLAAMLDNHI YQGNELNI DPKRVLWRRVVDMNDRQLRNI ID 

130 140 150 160 170 180 
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The following partial DNA sequence was identified in N. meningitidis <SEQ ID 561 >: 



ml42 . seq 












1 


ATGCGTGCCG 


ATTTCATGTT 


TGCCGACAAT 


ATGCCCGTGC 


AGGTGCGCCA 


51 


ACGCGCCCTC 


TATTTCAAGT 


TGTCCCGTTT 


TGCCGCGATG 


CCAGATGTGG 


101 


TAGGCAAACC 


GCTCTTCGGG 


CGACAGGCCG 


GTCAGCCCGG 


CAAAATGTTC 


151 


GGCAACATCC 


TGATGTTCGT 


CCGCCAGCGT 


ATTGATGCAG 


AGGCTGCCGT 


201 


TTTCCGACAG 


GATCGGAATG 


ATTCGCGCAC 


TCCGGTTGAT 


GCACAGCATC 


251 


ACGGTCGGCG 


GCTCGTCGGT 


AACCGGCGCG 


ACCGCCGTCA 


TTGTAATGCC 


301 


GTAACGCCCT 


GCCGCACCGT 


CTGTCGTGAT 


GACATGAACG 


CCTGCCGCGC 


351 


AAGATGCCAT 


CGCATCACGG 


AACGAAGTTT 


GAAAATTTTT 


CTGCAAATCC 


401 


GCCATTTTTC 


CCCTTTAAAC 


TGTCCCCTAT 


ATAAGAATGC 


TGCACACAAG 


451 


GCATCCCCcC 


ATGTGCAGCA 


GTTTTGA 







This corresponds to the amino acid sequence <SEQ ID 562; ORF 142>: 

m!4 2 . pep 

1 MRADFMFADN MPVQVRQRAL YFKLSRFAAM PDWGKPLFG RQAGQPGKMF 
51 GNILMFVRQR IDAEAAVFRQ DRNDSRTPVD AQHHGRRLVG NRRDRRHCNA 
101 VTPCRTVCRD DMNACRARCH RITERSLKIF LQIRHFSPLN CPLYKNAAHK 
151 ASPHVQQF* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 142 shows 93.7% identity over a 158 aa overlap with a predicted ORF (ORF 142.ng) 
from N. gonorrhoeae: 

m!42/gl42 



10 20 30 40 50 60 

ml 4 2 . pep MRADFMFADNMPVQVRORALYFKLSRFAAMPDVVGKPLFGRQAGQPGKMFGNILMFVRQR 

lllllll!lllllllllli:|||||lili!!^ll!i!llllllillM[illlMlll: 

g 1 4 2 MRADFMFADNMPVQVRORAFYFKLSRFAAMPNMVGKPLFGROAGQPGKMFGNI LMFVRQH 

10 20 30 40 50 60 



70 80 90 100 110 120 

ml 4 2 . pep I DAEAAVFRQDRJTOSRTPVBAQHHGRRLVGNRRDRRHCNAVTPCRTVCRDDMNACRARCH 

miimiiimiiiii immiimi:iMi;mimmmiiii: n 

gl4 2 IDAEAAVFRQDRNDSRTPVYAQHHGRRLVGNRRURRHCNAVTPCRTVCRDDMNACRTGCH 
70 80 90 100 110 120 



130 140 150 159 

ml42 .pep RITERSLKIFLQIRHFSPLNCPLYKNAAHKASPHVOQFX 

(11(1111 lllilllllil illllllllllllMIII 

gl4 2 RITERSLKSFLQIRHFSPLNRPLYKNAAHKASPHVQQFX 
130 140 150 



The following partial DNA sequence 



al42 . seq 






1 


ATGCGTGCCG 


ATTTCATGTT 


51 


ACGCGCCCTC 


TATTTCAAGT 


101 


TAGGCAAACC 


GCTCTTCGGG 


151 


GGCAACATCC 


TGATGTTCGT 


201 


TTTCCGACAG 


GATCGGAATG 


251 


ACGGTCGGCG 


GCTCGTCCGT 


301 


GTAACGCCCT 


GCCGCACCGT 


351 


AGGATGCCAT 


CGCATCACGG 


401 


GCCATTTTTC 


CCCTTTAAAC 


451 


GCACCCCCCA 


TGTGCAGCAG 


501 


TTCCGCGCGT 


TACGGCGTAT 


551 


AGTTCTGCCA 


ACAGGTCTTT 


601 


TTCCTCGATA 


GCGTCGTAAC 


651 


ACAGCTTCTC 


GCCCTCGGCA 


701 


TTGGAAGGTT 


TCAGGCGGAC 


751 


GATACCGGTC 


AGGCTGGGGC 



identified in TV. meningitidis <SEQ ID 563>: 

TGCCGACAAT ATGCCCGTGC AGGTGCGCCA 
TGTCCCGTTT TGCCGCGATG CCAGATGTGG 
CGACAGGCCG GTCAGCCCGG CAAAATGTTC 
CCGCCAGCGT ATTGATGCAG AGGCTGCCGT 
ATTCGCGCAC TCCGGTTGAT GCACAGCATC 
AACCGGCGCA ACCGCCGTCA TTGTAATGCC 
CTGTCGTGAT GACATGAACG CCTGCCGCAC 
AACGAAGTTT GAAAAGTTTT CTGCAAATCC 
TGTCCCCTAT ATAAGAATGC TGCACACAAG 
TTCTGATTCA AAAAGCCGTC GGTCGGACAT 
TACGAGTTCA ACGCATCCTC GATTTTGGCA 
AAGCAGCAGC ATTTTCTCGC GGCCCAGCAC 
GCTCGTCCAC TTCTTCGCCG ATTTCCTCAT 
GTCAGCTTCA GAAAAACACG TCGTTGGTCG 
AACCAAACCC GCTTTTTCAA GGCGGGTCAG 
GCAAAATGCA CGCCTGATTC GCCAAATCTT 
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1201 CAGGCAACCA TGTTCTTGGT TGCAGGCGCA GTCTTGCTGC TGGGAGCCTT 

1251 CTCAGTCTGT CTGATTAAAG AGATCCACGG CGGGGTTTGA 

This corresponds to the amino acid sequence <SEQ ID 566; ORF 143.ng>: 

gl4 3 . pep 

1 MLSFGYLGVQ TAFTLQSSQM SRIFQTLGAD PHNLGW FFIL PPLAGMLVQP 

51 IVGYYSDRTVJ KPRLGGRR LP YLLYGTLIAV IVMIL MPNSG SFGFGYASLA 

101 ALSFGALMIA LLDV SSNMAM QPFKMMVGDM VNEEQKSYAY GIQSFLANTD 

151 AWAAILPFV FAYIGLA NTA EKGVVPQT VV VAFYVGAALL IITSA FTISK 

201 VKEYDPETYA RYHGIDVAAN QEKANWFELL KTAPKVFWTV TPVQFFCWFA 

251 FRYMWTYSAG AIAENVWHTT DASSVGHQEA GNRYG VLAAV *5VAAVICSF 

301 ILAKVPNKYH FAG YFGCLAL GALGFFSIFF IYNQ YALILS YILIGIAWAG 

351 IITYPLTIVA NALSGKHMDT YLGLFN GSVC MPQIVASLLS FVL FPMLGGH 

401 QATMFLVAGA VLLLGAFSVC LIKEIHGGV* 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 567>: 

ml43 . seq 

1 ATGCTCAGTT TCGGCTTTCT CGGCGTTCAG ACGGCCTTTA CCCTGCAAAG 

51 CTCGCAAATG AGCCGCATTT TTCAAACGCT AGGCGCAGAC CCGCACAATT 

101 TGGGCTGGTT TTTCATCCTG CCGCCGCTGG CGGGGATGCT GGTGCAGCCG 

151 ATTGTCGGCC ATTACTCCGA CCGCACTTGG AAGCCGCGTT TGGGCGGCCG 

201 CCGTCTGCCG TATCTGCTTT ATGGCACGCT GATTGCGGTT ATTGTGATGA 

251 TTTTGATGCC GAACTCGGGC AGCTTCGGTT TCGGCTATGC GTCGCTGGCG 

301 GCTTTGTCGT TCGGCGCGCT GATGATTGCG CTGTTAGACG TGTCGTCAAA 

351 TATGGCGATG CAGCCGTTTA AGATGATGGT CGGCGACATG GTCAACGAGG 

4 01 AGCAGAAAGG CTACGCCTAC GGGATTCAAA GTTTCTTAGC AAATACGGGC 

4 51 GCGGTCGTGG CGGCGATTCT GCCGTTTGTG TTTGCGTATA TCGGTTTGGC 

501 GAACACCGCC GAGAAAGGCG TTGTGCCGCA GACCGTGGTC GTGGCGTTTT 

551 ATGTGGGTGC GGCGTTGCTG GTGATTACCA GCGCGTTCAC GATTTTCAAA 

601 GTGAAGGAAT ACGATCCGGA AACCTACGCC CGTTACCACG GCATCGATGT 

651 CGCCGCGAAT CAGGAAAAAG CCAACTGGAT CGAACTCTTG AAAACCGCGC 

701 CTAAGGCGTT TTGGACGGTT ACTTTGGTGC AATTCTTCTG CTGGTTCGCC 

751 TTCCAATATA TGTGGACTTA CTCGGCAGGC GCGATTGCGG AAAACGTCTG 

801 GCACACCACC GATGCGTCTT CCGTAGGTTA TCAGGAGGCG GGTAACTGGT 

851 ACGGCGTTTT GGCGGCGGTG CAGTCGGTTG CGGCGGTGAT TTGTTCGTTT 

901 GTATTGGCGA AAGTGCCGAA TAAATACCAT AAGGCGGGTT ATTTCGGCTG 

951 TTTGGCTTTG GGCGCGCTCG GCTTTTTCTC CGTTTTCTTC ATCGGCAACC 

1001 AATACGCGCT GGTGTTGTCT TATACCTTAA TCGGCATCGC TTGGGCGGGC 

1051 ATTATCACTT ATCCGCTGAC GATTGTGACC AACGCCTTGT CGGGCAAGCA 

1101 TATGGGCACT TACTTGGGCT TGTTTAACGG CTCTATCTGT ATGCCTCAAA 

1151 TCGTCGCTTC GCTGTTGAGT TTCGTGCTTT TCCCTATGCT GGGCGGCTTG 

1201 CAGGCCACTA TGTTCTTGGT AGGGGGCGTC GTCCTGCTGC TGGGCGCGTT 

1251 TTCCGTGTTC CTGATTAAAG AAACACACGG CGGGGTTTGA 

This corresponds to the amino acid sequence <SEQ ID 568; ORF 143>: 

ml 4 3 . pep 

1 MLSFGFLGVQ TAFTLQSSQM SRIFQTLGAD PHNLGW FFIL PPLAGMLVQP 

51 IVGHYSDRTW KPRLGGRR LP YLLYGTLIAV IVMIL MPNSG SFGFGYASLA 

101 ALSFGALMIA LLDV SSNMAM QPFKMMVGDM VNEEQKGYAY GIQSFLANTG 

151 AWAAILPFV FAYIGLA NTA EKGVVPQT VV VAFYVGAALL VITSA FTIFK 

201 VKEYDPETYA RYHGIDVAAN QEKANWIELL KTAPKAFWTV TLVQFFCWFA 

251 FQYMWTYSAG AIAENVWHTT DASSVGYQEA GNWYG VLAAV QSVAAVICSF 

301 VLAKVPNKYH KAG YFGCLAL GALGFFSVFF IGHQ YALVLS YTLIGIAWAG 

351 IITYPLTIVT NALSGKHMGT YLGLFN GSIC MPQIVASLLS FVL FPMLGGL 

401 QATM FLVGGV VLLLGAFSVF LIKETHGGV* 

Computer analysis of this amino acid sequence gave the following results: 

Homology with a predicted ORF from N. gonorrhoeae 

ml43 / gl43 93.9% identity in 429 aa overlap 

10 20 30 40 50 60 
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7 51 TTCCAATATA TGTGGACTTA 

801 GCACACCACC GATGCGTCTT 

851 ACGGCGTTTT GGCGGCGGTG 

901 GTATTGGCGA AAGTGCCGAA 

951 TTTGGCTTTG GGCGCGCTCG 

1001 AATACGCGCT GGTGTTGTCT 

1051 ATTATCACTT ATCCGCTGAC 

1101 TATGGGCACT TACTTGGGCC 

1151 TCGTCGCTTC GCTGTTGAGT 

1201 CAGGCCACTA TGTTCTTGGT 

1251 TTCCGTGTTC CTGATTAAAG 



CTCGGCAGGC GCGATTGCGG AAAACGTCTG 
CCGTAGGTTA TCAGGAGGCG GGTAACTGGT 
CAGTCGGTTG CGGCGGTGAT TTGTTCGTTT 
TAAATACCAT AAGGCGGGTT ATTTCGGCTG 
GCTTTTTCTC CGTTTTCTTC ATCGGCAACC 
TA^ACCTTAA TCGGCATCGC TTGGGCGGGC 
GATTGTGACC AACGCCTTGT CGGGCAAGCA 
TGTTTAACGG CTCTATCTGT ATGCCGCAAA 
TTCGTGCTTT TCCCTATGCT GGGCGGCTTG 
AGGGGGCGTC GTCCTGCTGC TGGGCGCGTT 
AAACACACGG CGGGGTTTGA 



This corresponds to the amino acid sequence <SEQ ID 570; ORF 143.a>: 

al4 3 . pep 

1 MLSFGFLGVQ TAFTLQSSQM SRIFQTLGAD PH5LGW FFIL PPLAGMLVQP 

51 IVGHYSDRTW KPRLGGRR LP YLLYGTL1AV IVMIL MPNSG SFGFGYASLA 

101 ALSFGALMIA LLDV S5NMAM QPFKMMVGDM VNEEQKGYAY GIQSFLANTG 

151 AWAAILPFV FAYIGLA NTA EKGWPQT W VAFYVGAALL VITSA FTIFK 

201 VKEYNPETYA RYHGIDVAAN QEKANWIELL KTAPKAFWTV TLVQFFCWFA 

251 FQYMWTYSAG AIAENVWHTT DASSVGYQEA GNWYG VLAAV QSVAAVICSF 

301 VLAKV PNKYH KAGYFGCLAL GALGFFSVFF IGNQY ALVLS YTLIGIAWAG 

351 ITTYPLTIVT NALSGKHMGT YLGLFNGSIC MPQ IVASLLS FVLFPMLGGL 

4 01 QATMF LVGGV VLLLGAF5VF LI KETHGGV* 



ml43/al43 99.5% identity in 429 aa overlap 



10 20 30 40 50 60 

ml 4 3. pep MLSFGFLGVQTAFTLQSSQMSRIFQTLGADPHNLGWFFILPPLAGMLVQPIVGHYSDRTW 
I I I I I I M I t I I I M I I I I I I I t I J I ! I I I t I : I I II I i I M I I M I i t t ! i I I i I I I If 
al4 3 MLSFGFLGVQTAFTLQSSQMSRIFQTLGADPHSLGWFFILPPLAGMLVQPIVGHYSDRTW 

10 20 30 40 50 60 



70 80 90 100 110 120 

ml 4 3 .pep KPRLGGRRLPYLLYGTLIAVIVMILMPNSGSFGFGYASLAALSFGALMIALLDVSSNMAM 
[ I I I I t I I I I I 1 I I I i I I I I I I I I I I I I I I ! I [ ! ! I I [ I ! 1 M ! I ! I I I f ! [ M I II I II 
al 4 3 KPRLGGRRLPYLLYGTLIAVIVMILMPNSGSFGFGYASLAALSFGALMIALLDVSSNMAM 
70 80 90 100 110 120 



130 140 150 160 170 180 

ml 4 3 . pep QPFKMMVG DMVNEE QKG YA YG I Q S FLANTG AWAAI L P FVFA Y I GLANTAE KG W PQT W 

I I I I I i I I I I I I I I I I I I ( I I I i I I I I I I I I I M I I I II 1 I 1 I I i i II M I I I I I I \ I i I 
a!4 3 QPFKMMVGDMVMEEQKGYAYGIQSFLANTGAVVAAILPFVFAYIGLANTAEKGWPQTVV 

130 140 150 160 170 180 



190 200 210 220 230 240 

ml4 3 .pep VAFYVGAALLVITSAFTIFKVKEYDPETYARYHGIDVAANQEKANWIELLKTAPKAFWTV 
I t I I i I I I I I I I 1 I I I I I I I ( M I : I II I I I I I I 1 I I 1 I I 1 I 1 I I I t I I I I I I i I i I i I t 
a 1 4 3 VAFYVGAALLVI TSAFTI FKVKEYN PET YAR YHG I D VAANQEKANW I EL LKT APKA FWTV 

190 200 210 220 230 240 



250 260 270 280 290 300 

ml 4 3 .pep TLVQFFCWFAFQYMWTYSAGAIAENVWHTTDASSVGYQEAGMWYGVLAAVQSVAAVICSF 
I I I I II I I I I ! I I I II I I I I I I I I I I I I I I I I I I I I I II I I ! I I I I I I I I I M I I I I I I ! 
al4 3 TLVQFFCWFAFQYMWTYSAGAIAENVWHTTDASSVGYQEAGNWYGVLAAVQSVAAVICSF 

250 260 270 280 290 300 



310 320 330 340 350 360 

ml 4 3. pep VLAKVPNKYHKAGYFGCLALGALGFFSVFFIGNQYALVLSYTLIGIAWAGIITYPLTIVT 
I I ! II I I I I ! I II I I I I II I I I I I I I I II I I I i 1 II I I ! I I I I I I I I I I I I I ] I I I I I I i 
al4 3 VLAPCVPNKYHfCAG YFGCLALGALGFFSVFFIGNQYALVLS YTLIGIAWAG I ITYPLT I VT 

310 320 330 340 350 360 



ml 4 3 . pep 



370 380 390 400 410 420 

NALSGKHMGTYLGLFNGSICMPQIVASLLSFVLFPMLGGLQATMFLVGGWLLLGAFSVF 
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Homology with a predicted ORF from N. gonorrhoeae 

ml44 / gl44 91.3% identity in 218 aa overlap 

10 20 30 40 50 60 

ml44 . pep MSDTPATRDFGLIDGRAVTGYVLSNRRGTRVCVLDLGGIVQErSVLADGVRENLWSFDD 

i I i i i I i I I I i I I I I i I II i ! ) I I i I I I I I I II ! I II M I M I II I M I II MM! 
gl4 4 MSDTPATRDFGLIDGRAVTGYVLSNRRGTCVFVLDLGGIVQEFSVLADGVRENPWSFDD 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml 4 4 . pep AASYADNPFQINKQIGRVAGRIRGAAFDINGRTYRVEANEGRNALHGGSHGLAVTRFNAV 
1 M I 11 M I I I I II I I M I M I I ! I I I I I I I I I ] II I II I II I II I I I I M I M I I I I I I 
gl4 4 AASYADNPFQINKQIGRVAGRIRGAAFDINGRTYRVEANEGRNAIHGGSHGIAVTRFNAV 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml 4 4 .pep AADGRSWLRSRLATVGRRLSQRFGFC 'FLPLGRGRPAYRYLSRHRARRHGVRPDAAHLL 

III I I I II I I i I II I I I I II I I I M ! I I I I I I I I I I II I I I! II I 

gl4 4 AAD GRRLSQRFG--YFLPLGRGRPAYRYLSRHRARRHGVRPDAAHLL 

130 140 150 160 

190 200 210 219 

ml 4 4 .pep AAGRGPARCGSAYSAGRTYAGRCRKTARLNGFRRPRSIX 
I I I II I M I ( I I I I I i I I i : I I i I M I M i I I ! M M I I 
gl4 4 AAGRGPARCGSAYSAGRTYSGRCRKTARLNGFRRPRSIX 
170 180 190 200 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 575>: 

al4 4 . seq 

1 ATGAGCGATA CCCCCGCTAC CCGCGATTTC GGCCTGATCG ACGGGCGTGC 

51 CGTAACCGGC TATGTGCTGT CCAACCGGCG TGGTACGCGT GTCTGCGTGC 

101 TGGACTTGGG CGGGATTGTG CAGGAATTTT CCGTTTTGGC AGACGGCGTG 

151 CGCGAAAACC TCGTGGTGTC GTTCGACGAT GCGGCTTCCT ATGCGGACAA 

201 TCCGTTTCAG ATTAACAAGC AGATAGGGCG CGTGGCCGGA CGCATCCGCG 

251 GTGCGGCGTT CGACATCAAC GGCAGGACTT ACCGCGTGGA GGCCAACGAA 

301 GGCAGGAACG CGCTGCACGG CGGTTCGCAC GGGCTGGCCG TTACCCGTTT 

351 CAACGCGGTG GCGGCAGACG GCCGTTCGGT GGTGCTGCGC AGCCGCCTG . 

401 CAACAGTCGG CCGACGGTTA TCCCAACGAT TTGGATTTGG ATATTTCCTA 

4 51 CCGCTTGGAC GAGGACGACC GGCTTACCGT TACCTATCGC GCCACCGCGC 

501 TCGGCGACAC GGTGTTCGAC CCGACGCTGC ACATTTACTG GCGGCTGGAC 

551 GCGGGCCTGC ACGATGCGGT TCTGCATATT CCGCAGGGCG GACATATTCC 

601 GGCCGATGCC GAAAAACTGC CCGTCTCAAC GGTTTCAGAC GACCTCGAAG 

651 TATTTGA 

This corresponds to the amino acid sequence <SEQ ID 576; ORF 144.a>: 

al4 4 . pep 

1 MSDTPATRDF GLIDGRAVTG YVLSNRRGTR VCVLDLGGIV QEFSVLA DGV 

51 RENLVVSFDD AASYADNPFQ INKQIGRVAG RIRGAAFDIN GRTYRVEANE 

101 GRNALHGGSH GLAVTRFNAV AADGRSWLR SRLXTVGRRL SQRFGFGYFL 

151 PLGRGRPAYR YLSRHRARRH GVRPDAAHLL AAGRGPARCG SAYSAGRTYS 

201 GRCRKTARLN GFRRPRSI * 

ml44/al44 99.1% identity in 218 aa overlap 

10 20 30 40 50 60 

ml 4 4 .pep MSDTPATRDFGLIDGRAVTGYVLSNRRGTRVCVLDLGGIVQEFSVLADGVRENLWSFDD 
M I I I I I I I 1 M M I I I I 1 I II 1 I M I I I 1 II I M 1 I I I i ! I M I I I M I I 11 I I I I I I I 
al4 4 MSDTPATRDFGLIDGRAVTGYVLSNRRGTRVCVLDLGGIVQEFSVLADGVRENLWSFDD 

10 20 30 40 50 60 



70 80 90 100 110 120 

ml 4 4 . pep AAS YADNPFQINKQIGRVAGRIRGAAFDIMGRTYRVEANEGRNALHGGSHGLAVTRFNAV 
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Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ml46 / gl46 90.1% identity in 212 aa overlap 

10 20 30 40 50 60 

ml 4 6 . pep MAQILLRSRQVVIDHDKVKQYGLLDFMPCLRQPPLDNFPTVRPASVEARGKYVERRRQDK 
I II I I I I I M I ! I I : I I I I : I i I ( I I I 1 I I \ I I I I 11 M ! I I I I i : I i M I I I I 
gl4 6 MKQIPLRLLQWIDHDKVEQYGLFDFMPCLRQPPLDNFPTVRPAPFEARGKHVERRRQDK 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml 4 6 . pep DADG FGQRV AN LRRALN VD FQNH V I ACRRQR I HT LRACAV I V AK Y VG V FQK S FLRDKRLK 

1:1:1 I I I I I I ) I I I I I I t I I I I I I I I I I I I I : I I I I I ! I I I : I I I I I I I : I I 1 I I : I 
gl4 6 DTDSFRQRVANLRRALMVDFQNHVIACRRQRIHALRACAVIVAEYVCVFQKSLLRDKRFK 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml 4 6 .pep LFFGNKVIMYAVCFAFTRRARRVRHGNAQTVMVCQQPRHQRGFARAGSGRNDKDVAFSIS 

I I I I i ! U I ( I I I M f I I I I I I : I I M I I I III I I I I I ! I I I I t M ! M I ( I I I I I I I I I 
gl4 6 LFFGNK^/IMYAVCFAFTRRARRMRHGNAQTVMVCQQPRHORGFARAGSGRNDKDVAFSIS 

130 140 150 160 170 180 

190 200 210 

ml 4 6. pep GHIFYLYIFQPIVSQWTPSFLFADAHILPLLFX 
t 11 I 1 t 1 1 I M I I I i II 1 : I I 11 It 1! II M 
gl4 6 GHIFYLYIFQPIVSQRTPYFIFADAHILPLLFX 

190 200 210 

The following partial DNA sequence v/as identified in N. meningitidis <SEQ ID 58 : : 

al46 . seq 

1 ATGGCGCAAA TCCTCCTCCG CCCGCGCCAA GTCATCATTG ACCACGACAA 

51 AATCGAACAA TACGGACTGT TCGATTTCAT GCCTTGCCTT CGACAGCCTC 

101 CTTTGGATAA CTTCCCGACT GTCCGTCCCG CGTCCGTTGA GACGCGCAGC 

151 AAGCACATCG AAAGACGGCG GCAGGATAAA GATGCCGACG GCTTCGGGCA 

201 GCGCATCTCG AACCTGAGCC GCGCCCTGAA CGTCGATTTC CAAAATCACG 

251 TCATAACCTG CCGCCGCCAA CGCATTCACA CCCTCCGCGC TTGTGCCGTA 

301 ATAGTTGCCG AACACGTCCG CGTATTCCAA AAAAGCCTCC TGCGCGATAA 

351 GCGACTCAAA CTCTTCTTTG GAAACAAAGT GATAATGTAC GCCGTTTGCT 

401 TCGCCTTCAC GCGGCGGACG CGTCGTGTGC GACACGGAAA CGCGCAAACC 

4 51 GTTATGGTTT GCCAACAGCC GCGACACCAG CGTGGTTTTG CCCGTGCCGG 

501 AAGCGGCCGA AATGATAAAG ATGTTGCCTT TTCGATAAGC GGACATATTT 

551 TTTACCTGTA TATTTTCCAG CCGATTGTAT CACAACGGAC ACCCGGTTTC 

601 CTATTTGCCG ATGCCCATAT TTTGCCGCTA TTGTTTTGA 

This corresponds to the amino acid sequence <SEQ ID 582; ORF 146.a>: 

al4 6 . pep 

1 MAQILLRPRQ VIIDHDKIEQ YGLFDFMPCL RQPPLDNFPT VRPASVETRS 

51 KHIERRRQDK DADGFGQRIS NLSRALNVDF QNHVITCRRQ RIHTLRACAV 

101 IVAEHVRVFQ KSLLRDKRLK LFFGNKVIMY AVCFAFTRRT RRVRHGNAQT 

151 VMVCQQPRHQ RGFARAGSGR NDKDVAFSIS GHIFYLYIFQ PIVSQRTPGF 

201 LFADAHILPL LF* 

ml46/al46 90.6% identity in 212 aa overlap 

10 20 30 40 50 60 

ml 4 6 . pep MAQILLRSRQWIDHDKVKQYGLLDFMPCLRQPPLDNFPTVRPASVEARGKYVERRRQDK 

II I I I 1 I I i I : I I 1 I I : : i I I I : I! I I I f I I I I I I M I i I I II IK : I : I : : I I t I I I I 
al4 6 MAQILLRPRQVIIDHDKIEQYGLFDFMPCLRQPPLDNFPTVRPASVETRSKHIERRRQDK 

10 20 30 40 50 60 

70 80 90 100 110 120 
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1101 


AGGTCGTCTG 


AAAGGCAGCT 


1151 


GTGCTTTATC 


TGCCATATCC 


1201 


AACAAAGTGC 


AACATTACAG 


1251 


CAACTTCACG 


CTTGAAGGAG 


1301 


TTCAGTACGA 


CAAAGCATTG 


1351 


CTGCCCGhCC 


TCGGCGCGCA 


1401 


GGGCAACTGG 


TATTTCACGC 


14 51 


ATCAGGAACG 


CCTGCCGTCA 


1501 


GTCGCCACCA 


ACACCTTTGA 


1551 


TTCCAACAAT 


ATCGAACTCG 


1601 


ACAATCTGGC 


ACTCTACCGC 


1651 


ACCTTAAACG 


ACGGACGCGG 


1701 


GAAGCTCGTG 


CGCTACAACC 


1751 


GCGAAATCTA 


CTTCAAACCG 


1801 


GACTATGTAC 


GAGGCCGTCT 


1851 


AGATGCCTA r 


GGCAACCGTC 


1901 


CCCGTGTTCC 


GGCTGCGCGC 


1951 


GACCGTATCG 


ATGCCAATTT 


2001 


ACTCGCCCGC 


TACGAAACGC 


2051 


GCGCAAACTA 


CCGCCGCAAT 


2101 


AAAGCCGACA 


ACCTGCTCAA 


2151 


CTCTGATACG 


CCGCAAATGG 


2201 


AGTTTTAA 





GGGGCGTGCA ATATTTACAA CAAAAATCCA 
GAAGCGGTTA AACAACCGAT GCTGCTTGAC 
CTTTTTCGGT GTAGAACAGG CAAACTGGGA 
GCGTACGCGT GGAAAAACAA AAAGCCTCCA 
ATTGATCGGG AAAACTACTA CAACCACCCC 
CCGCCAAACC GCCCGCTCAT TCGCACTTTC 
CACAACACAA ACTCAGCCTG ACCGCCTCCC 
ACGCAAGAGC TGTACGCACA CGGCAAACAC 
AGTCGGCAAC AAACACCTCA ACAAAGAGCG 
CGCTGGGCTA CGAAGGCGAC CGCTGGCAAT 
AACCGCTTCG GTAACTACAT TTACGCCCAA 
CCCCAAATCC ATCGAAGACG ACAGCGAAAT 
AATCCGGCGC CGACTTCTAC GGCGCGGAAG 
ACACCGCGCT ACCGCATCGG CGTTTCCGGC 
GAAAAACCTG CCTTCCCTAC CCGGCAGAGA 
CTTTCATCGC ACAGGACGAC CAAAATGCCC 
CTCGGCTTCC ACCTGAAAGC CTCGCTGACC 
GGACTACTAC CGCGTGTTCG CCCAAAACAA 
GCACGCCCGG ACACCATATG CTCAACCTCG 
ACGCGCTATG GCGAGTGGAA TTGGTACGTC 
CCAATCCGTT TACGCCCACA GCAGCTTTCT 
GCCGCAGCTT TACCGGCGGC GTGAACGTGA 



This corresponds to the amino acid sequence <SEQ ID 586; ORF 147>: 



ml 4 7 .pep 


{partial ) 










1 


. . PHKTEQSVDL 


ETVSWGKSR 


PR=.TSGLLHT 


STASDKIISG 


DTLRQKAVNL 


51 


GDALDGVPGI 


HASQYGGGAS 


APVIRGQTGR 


RIKVLNHHGE 


TGDMADFSPD 


101 


RAIMVDTALS 


QQVEILRGPV 


TLLYSSGNVA 


GLVDVADGKI 


PEKMPENGV5 


151 


GELGLRLSSG 


NLEKLTSGGI 


NIGLGKNF^L 


HTEGLYRKSG 


DYAVPRYRNL 


201 


KRLPDSHADS 


QTGSIGLSWV 


GEKGFIGVAY 


SDRRDQYGLP 


AHSHEYDDCH 


25\ 


ADIIWQKSLI 


NKRYLQLYPH 


LLTEEDIDYD 


NPGLSCGFHD 


DDNAHAHTHS 


301 


GRPWIDLRNK 


RYELRAJEWKQ 


PFPGFEALRV 


HLNRNDYRHD 


EKAGDAVENF 


351 


FNNQTQNARI 


ELRHQPIGRL 


KGSWGVQYLQ 


OKSSALSAIS 


EAVKQPMLLD 


401 


NKVQHYSFFG 


VEQANWDNFT 


LEGGVRVEKQ 


KASIQYDKAL 


IDRENYYNHP 


451 


LPDLGAHRQT 


ARSFALSGNW 


YFTPQHKLSL 


TASHQERLPS 


TQELYAHGKH 


501 


VATNTFEVGN 


KHLNKERSNN 


IELALGYEGD 


RWQYNLALYR 


NRFGNYIYAQ 


551 


TLNDGRGPKS 


IEDDSEMKLV 


RYNQSGADFY 


GAEGEIYFKP 


TPRYRIGVSG 


601 


DYVRGRLKNL 


PSLPGREDAY 


GNRPFIAQDD 


ONAPRVPAAR 


LGFHLKASLT 


651 


DRIDANLDYY 


RVFAQNKLAR 


YETRTPGHHM 


LNLGANYRRN 


TRYGEWNWYV 


701 


KADNLLNQSV 


YAHSSFLSDT 


PQMGRSFTGG 


VNVKF* 





Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 



ml47 / gl47 92.3% identity in 142 aa overlap 



10 20 30 

ml 4 7 . pep PHKTEQSVDLETVSWGKSRPRATSGLLHT S 

1:11111 1 M 1 ! 11 1 I i 1 1 11 t II I I I 1 1 
gl4 7 MRREAKMAQITLKPIVLSILLINTPLLAQAHETEQSVGLETVSVVGKSRPRATSGLLHTS 
10 20 30 40 50 60 



40 50 60 70 80 90 

ml 4 7 .pep TASDKIISGDTLRQKAVNLGDALDGVPGIHASQYGGGASAPVIRGQTGRRIKVLNHHGET 
t ! I U 1 I ! i M 1 I M I I f I I i li I 1 I 11 [ M I I I I I I I I 1 I 1 I I I I I I I I I I 1 ! I I I I I I 
gl47 TASDKII5GDTLRQKAVNLGDALDGVPGIHASQYGGGASAPVIRGQTGRRIKVLNHHGET 
70 80 90 100 110 120 

100 110 120 130 140 150 

ml 4 7 .pep GDMADFSPDHAIMVDTALSQQVEILRGPVTLLYSSGNVAGLVDVADGKIPEKMPENGVSG 

1 1 t 1 M 1 1 1 I I 1 1 I 1 I I 1 1 1 1 11 I | | ( I | M I 1 I I I f I I I : I I j 
gl 4 7 GDMADFSPDHAIMVDTALSQQVEILRGPVTLLYSSGNVAGAGQCCRWKNPPKNA 
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601 YNQSGADFYG AEGEIYFKPT PRYRIGVSGD YVRGRLKNLP SLPGREDAYG 

651 NRPLIAQADQ NAPRVPAARL GVHLKASLTD RIDANLDYYR VFAQNKLARY 

7 01 ETRTPGHHML NLGANYRRNT RYGEWNWYVK ADNLLNQSVY AKSSFLSDTP 

751 QMGRSFTGGV NVKF* 

m!47/al47 98.1% identity m 734 aa overlap 

10 20 30 

ml 4 7 . pep PHKTEQSVDLETVSWGKSRPRATSGLLHTS 

I i 1 I M M I i I I t I I i I 1 I 1 i I I I I I I f 
al4 7 MRREAKMAQTTLKPIVLSILLINTPLLSQAHGTEQSVGLETVSWGKSRPR^TSGLLHTS 
10 20 30 40 50 60 

40 50 60 70 80 90 

ml4 7 .pep TASDKIISGDTLRQKAVNLGDALDGVPGIHASQYGGGASAPVIRGQTGRRIKVLNHHGET 
I I I I I I I I I i i I I I I I 1 I I I M ! I I I ( I ! I I I I I i I M ! I I I ! I M I I I M ! f 1 t ! I i I ! 
al4 7 TASDKIISGDTLRQKAVNLGDALDGVPGIHASQYGGGASAPVIRGQTGRRIKVLNHHGET 
70 80 90 100 110 120 

100 110 120 130 140 150 

ml47.pep GDMADFSPDHAIMVDTALSQQVEILRGPVTLLYSSGNVAGLVDVADGKIPEKMPENGVSG 
I I I I I I I I I I I I t I I : I f I I I I I I I II I I I I I t 1 I I I I I I ! I I I f ! I I I ! I I I II I ! I I ! 
a 14 7 GDMADFSPDHAIMVDSALSQQVEILRGPVTLLYSSGNVAGLVDVADGKIPEKMPENGVSG 
130 140 150 160 170 180 

160 170 180 190 200 210 

ml 4 7. pep ELGLRLSSGNLEKLTSGGINIGLGKNFVLHTEGLYRKSGDYAVPRYRNLKKLPDSHADSQ 
I I I I II I I I If I I I I I i i t t I I I I i 1 I I I I II I I I I I I I I I I I 1 ! I II I I I I I I I I I M I 
al4 7 ELGLRLSSGNLEKLTSGGINIGLGKNFVLHTEGLYRKSGDYAVPRYRNLKRLPDSHADSQ 
190 200 210 220 230 240 

220 230 240 250 260 270 

ml47.pep TGSIGLSWVGEKGFIGVAYSDRRDQYGLPAHSHEYDDCHADIIWQKSLINKRYLQLYPHL 
I I ! I I I I I I I I I I I I I : 1 I M I f I I I I I I I I I I I I I M I I i I I I I ! I I I I I I I I I I I M I 
a!47 TGSIGLSWVGEKGFIGAAYSDRRDQYGLPAHSHEYDDCHADIIWQKSLINKRYLQLYPHL 
250 260 270 280 290 300 

280 290 300 310 320 330 

ml47 . pep LTEEDIDYDNPGLSCGFHDDDNAHAHTHSGRPWIDLRNKRYELRAEWKQPFPGFEALRVH 
M I M I I t I t I II III I M i I : IM I : I : I : I I M ! i I ! M 11 II 1 H II III I i 111 i i 
a!47 LTEEDIDYDNPGLSCGFHDDDDAHAHAHNGKPWIDLRNKRYELRAEWKQPFPGFEALRVH 
310 320 330 340 350 360 

340 350 360 370 380 390 

ml47 . pep LNRNDYRHDEKAGDAVENFFNNQTQNARIELRHQPIGRLKGSWGVQYLQQKSSALSAISE 
I I I I II I ! I I I I I I M t I I t M I I I ! It I I I i I f M ! M [ I I I ( I I M I t I M 1 I I 11 
al4 7 LNRNDYRHDEKAGDAVENFFNNQTQNARIELRHQPIGRLKGSWGVQYLGQKSSALSATSE 
370 380 390 400 410 420 

400 410 420 430 440 450 

ml4 7 . pep AVKQPMLLDNKVQHYSFFGVEQANWDNFTLEGGVRVEKQKASIQYDKALIDRENYYNHPL 
I I M I I I I I I i I I I I II I I I 1 I I I I I I I I I i II I I I I I I I II I : I I I II I I II ! I I I i i I 
al4 7 AVKQPMLLDNKVQHYSFFGVEQANWDNFTLEGGVRVEKQKASIRYDKALIDRENYYNHPL 
430 440 450 460 470 480 

460 470 480 490 500 510 

ml 4 7. pep PDLGAHRQTARSFALSGNWYFTPQHKLSLTASHQERLPSTQELYAHGKHVATNTFEVGNK 
I I I I I I I I I I I I I I I I I I I M i I M I I I M I I i I II I I I I I I I I ! I I I I I I I I I I I I I I I 
al47 PDLGAHRQTARSFALSGNWYFTPQHKLSLTASHQERLPSTQELYAHGKHVATNTFEVGNK 
490 500 510 520 530 540 

520 530 540 550 560 570 

ml 47 .pep HLNKERSNNIELALGYEGDRWQYNLALYRNRFGNYI YAQTLNDGRGPKSIEDDSEMKLVR 

I I M II 1 1 I I I I I I I I I I I I I I I I I I i I I I ! I I I I I I I I I ! II I I I I I I i I I I I M I I I I 
a!47 HLNKERSNNIELALGYEGDRWQYNLALYRNRFGNYI YAQTLNDGRGPKSIEDDSEMKLVR 
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101 KGKLPFETVS QSYALEYGEA AVEIHTDAVK LGSRVLLVDD LIATGGTMLA 
151 GLELIRKLGG EIVEAAAILE FTDLCGGKNI RASGAPLFTL LQNEGCMKG * 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted QRF from N. gonorrhoeae 

ml48 / gl48 99.0% identity in 199 aa overlap 

10 20 30 40 50 60 

ml 48 . pep MALKTSNLEHAMLVHPEAMSVGALADKIRKIENWPQKGILFHDITPVLQSAEYFRLLVDL 
I I I I! ) II I M I 1 i I I M I I I t M I t il I t I 11 t i t I i I 1 I 1 I 1 i I i i It I I I t I 1 I I 1 ] 
gl4 8 MALKTSNLEHAMLVHPEAMSVGALADKIRKIENWPQKGILFKDITPVLQSAEYFRLLVDL 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml4 8.pep LVYRYMDQKIDIVAGLDARGFIIGAALAYQLNVGFVPIRKKGKLPFETVSQSYALEYGEA 
I 1 ! i I ! I i I I i ( ! I ! ! I I I I I ! I I I I ! I ( I I I I i I i M I i I i I I t i f I ! I I i I I I [ I ( I I 
g!4 8 LVYRYMDQKIDIVAGLDARGFIIGAALAYQLNVGFVPIRKKGKLPFETVSQSYALEYGEA 

70 80 90 100 110 120 

130 140 ■ . : 160 170 180 

ml 4 8. pep AVEIHTDAVKLGSRVLLVDDLIATGGThiAGLELIRKLGGEIVEAAAILEFTDLQGGKNI 
t I I I t 1 II I I I i I I M ) I I i : I II 1 I I I II i I I i ) I I I ft I I I I I I I ! I I I I I I I M ] | 
gl4 8 AVEIHTDAVKPGSRVLLVDDLVATGGTMLAGLELIRKLGGEIVEAAAILEFTDLQGGKNI 

130 140 150 160 170 180 

190 200 
ml 4 8. pep RASGAPLFTLLQNEGCMKGX 
i II I 1 I i i I I I I I I i I I I I I 
gl4 8 RASGAPLFTLLQNEGCMKGX 

190 200 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 593>: 

al48 . seq 

1 ATGGCGTTAA AAACATCAAA CTTGGAACAC GCAATGCTGG TTCATCCCGA 

51 AGCTATGAGT GTCGGTGCGC TTGCCGACAA AATCCGCAAA AT CG AAAACT 

101 GGCCGCAAAA AGGCATCTTA TTCCACGACA TCACGCCCGT CCTGCAAAGC 

151 GCGGAATACT TCCGACTTTT GGTTGATTTA TTGGTTTACC GCTATATGGA 

201 TCAGAAAATC GACATCGTTG CCGGTTTGGA CGCGCGCGGC TTCATTATCG 

251 GCGCGGCACT CGCCTACCAG CTCAACGTCG GTTTCGTCCC CATCCGCAAA 

301 AAAGGCAAGC TGCCTTTTGA AACCGTATCG CAAAGCTACG CGCTCGAATA 

351 CGGGGAAGCT GCGGTGGAAA TCCACACCGA TGCCGTCAAA CTCGGTTCGC 

4 01 GCGTGCTGCT GGTCGATGAT TTGGTTGCCA CGGGCGGCAC GATGCTTGCC 

451 GGACTGGAGC TGATCCGCAA ACTCGGCGGG GAAATTGTCG AAGCCGCCGC 

501 CATTTTGGAA TTTACCGACC TTCAAGGCGG CAAGAATATC CGTGCAAGCG 

551 GCGCGCCCTT ATTTACCCTG CTTCAAAACG AAGGCTGTAT GAAGGGCTGA 

This corresponds to the amino acid sequence <SEQ ID 594; ORF 148.a>; 

al4 3 . pep 

1 MALKTSNLEH AMLVHPEAMS VGALADKIRK IENWPQKGIL FHDITPVLQS 

51 AEYFRLLVDL LVYRYMDQKI D IVAGLDARG FIIGAALA YQ LNVGFVPIRK 

101 KGKLPFETVS QSYALEYGEA AVEIHTDAVK LGSRVLLVDD LVATGGTMLA 

151 GLELIRKLGG EIVEAAAILE FTDLQGGKNI RASGAPLFTL LQNEGCMKG* 

ml48/al48 99.5% identity in 199 aa overlap 

10 20 30 40 50 60 

ml 4 8. pep MALKTSNLEHAMLVHPEAMSVGALADKIRKIENVJPQKGILFHDITPVLQSAEYFRLLVDL 
i I I I I I i M I M I ! I I I M M M I ( M I I I f I 1 I I I I 1 ( 1 I I 1 1 I I I I II I I I I I i I I I I 
al4 8 MALKTSNLEHAMLVHPEAMSVGALADKIRKIENWPQKGILFHDITPVLQSAEYFRLLVDL 

10 20 30 40 50 60 

™ 80 90 100 110 120 
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451 ATTTACGCCC AAACCTTAAA CGACGGACGC GGCCCCAAAT CCATCGAAGA 

501 CGACAGCGAA ATGAAGCTCG TGCGCTACAA CCAATCCGGC GCCGACTTCT 

551 ACGGCGCGGA AGGCGAAATC TACTTCAAAC CGACACCGCG CTACCGCATC 

6 01 GGCGTTTCCG GCGACTATGT ACGAGGCCGT CTGAAAAACC TGCCTTCCCT 

651 ACCCGGCAGA GAAGATGCCT ACGGCAACCG TCCTTTCATC GCACAGGACG 

701 ACCAAAATGC CCCCCGTGTT CCGGCTGCGC GCCTCGGCTT CCACCTGAAA 

751 GCCTCGCTGA CCGACCGTAT CGATGCCAAT TTGGACTACT ACCGCGTGTT 

301 CGCCCAAAAC AAACTCGCCC GCTACGAAAC GCGCACGCCC GGACACCATA 

851 TGCTCAACCT CGGCGCAAAC TACCGCCGCA ATACGCGCTA TGGCGAGTGG 

901 AATTGGTACG TCAAAGCCGA CAACCTGCTC AACCAATCCG TTTACGCCCA 

951 CAGCAGCTTT CTCTCTGATA CGCCGCAAAT GGGCCGCAGC TTTACCGGCG 

1001 GCGTGAACGT GAAGTTTTAA 

This corresponds to the amino acid sequence <SEQ ID 598; ORF 149>: 

ml49 .pep 

1 MLLDNKVQHY SFFGVEQANW DNFTLEGGVR VEKQKASIQY DKALIDRENY 

51 YNHPLPDLGA HRQTARSFAL SGNWYFTPQH KLSLTASHQE RLPSTQELYA 

101 HGKHVATNTF EVGNKHLNKE RSNNIELALG YEGDRWQYNL ALYRNRFGNY 

151 IYAQTLNDGR GPKSIEDDSE MKLVRYNQSG ADFYGAEGEI YFKPTPRYRI 

201 GVSGDYVRGR LKNLPSLPGR EDAYGNRPFI AQDDQNAPRV PAARLGFHLK 

251 ASLTDRIDAN LDYYRVFAQN KLARYETRTP GHHMLNLGAN YRRNTRYGEW 

3 01 NWYVKADNLL NQSVYAHSSF LSDTPQMGRS FTGGVNVKF* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N.gonorrhoeae 

ORF 149 shows 95.9% identity over a 339 aa overlap with a predicted ORF (ORF 149.ng) 
from N. gonorrhoeae: 

ml49/gl49 

10 20 30 40 50 60 

MliLDNKVQHYS FFGVEQANWDNFTLEGGVRVEKQKAS IQYDKALIDRENYYNHPLPDLGA 

IhlhhlllttMNIIIIiilllllllliiMlihlililiMIMIhllillll 

MLIDNNVRHYSFFGVEQANWDNFTLEGGVRVEKQKASIRY1)KALIDRENYYNQPLPDLGA 
10 20 30 40 50 60 

70 80 90 100 110 120 

HRQTARSFALSGNWYFTPQHKLSLTASHQERLPSTQELYAHGKHVATNTFEVGNKHLNKE 

lllllllllll!lllll|:||||llllll!ll!lllllllllll!l!ll!MI!l!IMI 

HRQTARSFALSGNWYFTPHHKLSLTASKQERLPSTQELYAHGKHVATNTFEVGNKHLNKE 

70 80 90 100 110 120 

130 140 150 160 170 180 

RSNNI E LALGYEGDRWQYNLALYRNRFGNY I YAQTLNDGRGPKS I EDDSEMKLVRYNQSG 

I I I I I I I I I I I : I I I I I II i I I i I i I i i I II I I I I I I i I I I I I I i I I I ! I I I I I I I I i I 
RSNNIELALGYKGDRWQYNLAAYRNRFGNYI YAQTLNDGRGPKS I EDDSEMKLVRYNQSG 
130 140 150 160 170 180 

190 200 210 220 230 240 

ADFYGAEGE I YFKPTPRYRIGVSGDYVRGRLKNLPSLPGREDAYGNRPFI AQDDQNAPRV 

1 1 1 1 1 1 ! 1 1 1 1 II I ! 1 1 ! 1 1 1 1 ! 1 1 1 1 1 1 j 1 1 1 1 1 1 1 1 1 1 1 ! I h i 1 1 1 1 1 

ADFYGAEGE I YFKPTPRYR I GVSGDYVRGRLKNLPSLPGREDPYGKRP FI AQADQNAPR I 
190 200 210 220 230 240 

250 260 270 280 290 300 

PAARLGFHLKASLTDRIDANLDYYRVFAQNKLARYETRTPGHHMLNLGANYRRNTRYGEW 

IIIIIIIIIhl!llMllllll!llllllMIIIII!IMIM!lllllllilllllll 

PAARU5FHLKTSLTDRIDANLDYYRVFAQNKLARYETRTPGHHMLNLGANYRRNTRYGEW 
250 260 270 280 290 300 



m!4 9 . pep 
gl49 

ml4 9 . pep 
gl49 

ml4 9 .pep 
gl49 

ml4 9 .pep 
gl49 

ml4 9 -pep 
gl49 



ml49 .pep 



310 320 330 340 

NW YVKADNLLNQS VYAHSS FLSDTPQMGRS FTGGVNVKFX 

iMIIMIiniliillllMIIMIIIIIhltllilli 
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I I I f ' I I I I I I I I I I I | | M | ! I | ! | | | | | l I | | ( j j | ( | | | | | | | | | | | t | | | | j s| | 
al4 9 PAARLGVHLKASLTDRIDANLDYYRVFAQNKLARYETRTPGHHMLNLGANYRRNTRYGEW 
250 260 270 280 290 300 

310 320 330 340 

ml 4 9 . pep NWYVKADNLLNQSVYAHSSFLSDTPQMGRSFTGGVNVKFX 
I S I i II II I I I I I ! I | | M | ( | | | | i | i | j | t ) | | | M f I 
al4 9 NWYVKADNLLNQSVYAHSSFLSDTPQMGRSFTGGVNVKFX 
310 320 330 340 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 601 >: 

g!4 9-l .seq 

1 ATGGCACAAA TCACACTCAA ACCCATTGTT TTATCAATTC TTTTAATCAA 

51 CACACCCCTC CTCGCCCAAG CGCATGAAAC TGAGCAATCG GTGGGCTTGG 

101 AAACGGTCAG CGTCGTCGGC AAAAGCCGTC CGCGCGCGAC TTCGGGGCTG 

151 CTGCACACTT CGACCGCCTC CGACAAAATC ATCTCCGGCG ATACTTTGCG 

201 CCAAAAAGCC GTCAACTTGG GCGACGCTTT GGACGGCGTA CCGGGCATCC 

251 ACGCTTCGCA ATACGGCGGC GGCGCATCCG CTCCCGTTAT TCGCGGTCAA 

301 ACGGGCAGAC GGATTAAAGT ATTGAACCAT CACGGCGAAA CGGGCGATAT 

351 GGCGGACTTT TCTCCCGATJ ACGCCATTAT GGTAGATACC GCCTTGTCGC 

401 AACAGGTTGA AATCCTGCGC GGGCCGGTTA CGCTCTTGTA CAGCTCGGGC 

451 AATGTGGCGG GGCTGGTCGA TGTTGCCGAT GGAAAAATCC CCGAAAAAAT 

501 GCCTGAAAAC GGCGTATCGG GCGaagccgG ATTGCGTTTG AGCAGCGGCA 

551 ATTTAGAAAA ACTGACATCC GCAGGCATCA ATATCGGACT GGGCAAAAAC 

601 TTCGTGCTGC ATACCGAAGG CTTGTACCGC AAATCGGGCG ATTACGCCGT 

651 ACCGCGTTAC CGCAATCTGA AACGCCTGCC CGACAGCCAT GCCGATTCGC 

701 AAACGGGCAG CATCGGGCTG TCTTGGGTGG GCGAAAAAGG CTTTATCGGC 

751 GCAGCATACA GCGACCGTCG CGACCGCTAC GGCCTGCCTG CCCACAGCCA 

801 CGAATACGAT GATTGCCACG CCGACATCAT CTGGCAAAAG AGTTTGATCA 

851 ACAAACGCTA TTTGCAGCTT TATCCGCACT TGTTGACCGA AGAAGACATC 

901 GATTACGACA ATCCGGGCTT GAGCTGCGGC TTCCACGACG GCGACGGTGC 

951 ACACGCACAC ACCCACAACG GCAAACCGTG GATAGACCTG CGCAACAAAC 

1001 GCTACGAACT CCGCGCCGAA TGGAAGCAGC CATTCCCCGG TTTTGAAGCC 

1051 CTGCGCGTAC ATCTGAACCG CAATGACTAC CACCACGACG AAAAAGCAGG 

1101 CGATGCAGTA GAAAACTTCT TCAACAACAA AACACACAAC GCCCGTATCG 

1151 AGTTGCGCCA CCAACCCATA GGCCGTCTGA AAGGCAGCTG GGGCGTGCAA 

1201 TATTTGGGAC AAAAATCCAG CGCGCTTTCC GCCATTCCCG AAACCGTCCA 

1251 ACAACCGATG TTGATTGACA ACAATGTCCG CCATTACAGC TTTTTCGGTG 

1301 TAGAACAGGC AAATTGGGAC AACTTCACGC TTGAAGGCGG CGTACGCGTG 

1351 GAAAAACAAA AAGCCTCCAT CCGGTACGAC AAAGCATTGA TTGATCGAGA 

14 01 AAACTACTAC AACCAGCCCC TGCCCGACCT CGGCGCGCAC CGCCAAACCG 

1451 CCCGCTCGTT CGCACTTTCG GGCAACTGGT ATTTCACGCC ACACCACAAA 

1501 CTCAGCCTGA CCGCCTCCCA TCAGGAACGC CTGCCGTCAA CGCAAGAACT 

1551 GTACGCACAC GGCAAGCACG TCGCCACCAA CACCTTTGAA GTCGGCAACA 

1601 AACACCTCAA CAAAGAGCGT TCCAACAATA TCGAACTCGC GCTGGGCTAC 

1651 GAAGGCGACC GCTGGCAATA CAATCTGGCA GCCTACCGCA ACCGATTCGG 

1701 CAACTACATT TACGCCCAAA CCTTAAACGA CGGACGCGGC CCCAAATCCA 

1751 TCGAAGACGA CAGCGAAATG AAGCTCGTGC GCTACAACCA ATCCGGTGCC 

1801 GACTTCTACG GCGCG GAAGG CGAAATCTAC TTCAAACCGA CACCGCGCTA 

1851 CCGCATCGGT GTTTCCGGCG ACTATGTACG AGGCCGTCTG AAAAACCTGC 

1901 CGTCCCTACC CGGCAGGGAA GATCCCTACG GCAAACGTCC CTTCATCGCA 

1951 CAAGCCGACC AAAACGCCCC CCGCATTCCG GCTGCGCGCC TCGGCTTCCA 

2001 CCTGAAAACC TCGCTAACCG ACCGTATCGA TGCCAATTTG GACTACTACC 

2051 GCGTGTTCGC CCAAAACAAA CTCGCCCGCT ACGAAACGCG TACGCCCGGA 

2101 CACCATATGC TCAACCTCGG TGCAAACTAC CGCCGCAATA CGCGCTATGG 

2151 CGAGTGGAAT TGGTACGTCA AAGCCGACAA CCTGCTCAAC CAATCCGTTT 

2201 ACGCCCACAG CAGCTTCCTC TCTGATACGC CGCAAATGGG CCGCAGCTTT 

2251 ACCGGCGGCG TAAACGTGAA GTTTTAA 

This corresponds to the amino acid sequence <SEQ ID 602; ORF 149-l.ng>: 

gl49-l.pep 

1 MAQXTLKPIV LSILX.INTPL LAQA HETEQS VGLETVSWG KSEPRATSGL 

51 LHTSTASDKI ISGDTLRQKA VNLGDALDGV PGIHASQYGG GASAPVIRGQ 

101 TGRRIKVLNH HGETGDMADF SPDHAIMVDT ALSQQVEILR GPVTLLYSSG 

151 NVAGLVDVAD GKIPEKMPEN GVSGEAGLRL SSGNLEKLTS AGINIGLGKN 

201 FVXHTEGLYR KSGDYAVPRY RNLKRLPDSH ADSQTGSIGL SWVGEKGFIG 

251 AAYSDRRDRY GLPAHSHEYD DCHADIIWOK SLINKRYLQL YPHLLTEEDI 

301 DYDNPGLSCG FHDGDGAHAH THNGKPWIDL RWKRYELRAE WKQPFPGFEA 
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551 EGDRWQYNLA LYRNRFGNYJ YAQTLNDGRG PKSIEDDSEM KLVRYNQSGA 

601 DFYGAEGEIY FKPTPRYRIG VSGDYVRGRL KNLPSLPGRE DAYGNRPFIA 

651 QDDQNAPRVP AARLGFHLKA SLTDRIDANL DYYRVFAQNK LARYETRTPG 

7 01 HHMLNLGANY RRNTRYGEWN WYVKADNLLN QSVYAHSSFL SDTPQMGRSF 

7 51 TGGVNVKF* 

ml49-l/gl4 9~l 9c. 21 identity in 758 aa o/erlap 

10 20 30 40 50 60 

ml 4 9-1 .pep MAQTTLKPI VLSILLINTPLLAQAHETEQSVDLETVSVVGKSRPRATSGLLHTSTASDKI 
111 I ! 1 (( I I I I I I I I I I I 1 I ( I ! 1 I I M I I I I I M I II I I ! I I I I I I II I I I I I I I 1 
g 14 9-1 MAQITLKPIVLSILLINTPLLAQAHETEQSVGLETVSWGKSRPRATSGLLHTSTASDKI 

10 20 30 40 SO 60 

70 80 90 100 110 120 

ml 4 9-1 .pep ISGDTLRQKAVNLGDALDGVPGIHASQYGGGASAPVIRGQTGRRIKVLNHHGETGDMADF 
i I I I I I I I ! t I t I I I t I I I I I II It ! I t I I 1 I I I I I I I 1 I I i I I I ! I I i II t I I I I I I! I 
gl4 9-l ISGDTLRQKAVNLGDALDGVPGIHASQYGGGASAPVIRGQTGRRIKVLNHHGETGDMADF 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml49-l . pep SPDHAIMVDTALSQQVEILRGPVTLLYSSGNVAGLVDVADGKIPEKMPENGVSGELGLRL 
I 1 II I! I I I I I I I I I I I I I i I I I I I I ! I I I I I I I I I I I I I I I I I M I I I I I I I I i I I I I 
g!4 9-l SPDHAIMVDTALSQQVEILRGPVTLLYSSGNVAGLVDVADGKIPEKMPENGVSGEAGLRL 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml4 9-l .pep SSGNLEKLTSGGINIGLGKNFVLHTEGLYRKSGDYAVPRYRNLKRLPDSHADSQTGSIGL 
i t t I ( I M I I : i i I I I I I I I t I I I I I II I I I I I I ! i M I t i I I I I I I I I I i i I t I I II I I 
gl4 9-l SSGNLEKLTSAGINIGLGKNFVLHTEGLYRKSGDYAVPRYRNLKRLPDSHADSQTGSIGL 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml49-l .pep SWVGEKGFIGVAYSDRRDQYGLPAHSHEYDDCHADIIWQKSLTMKRYLQLYPHLLTEEDI 
I I I I II i 1 I 1 : 1 I I I I I I : II t I I 1 I I 11 II I i M I I I f I I I I I I I I I I I 1 i I I I I M M 
gl4 9-l SWVGEKGFIGAAYSDRRDRYGLPAHSHEYDDCHADIIWQKSLINKRYLQLYPHLLTEEDI 

250 260 270 280 290 300 

310 320 330 340 350 360 

ml 4 9 - 1 . pep DYDNPGLSCGFHDDDNAHAHTHSGRPWI DLRNKRYELRAEWKQPFPGFEALRVHLNRNDY 
I I I I I I I I I I II I I : I i I I I I : I : I I i I I i I II I ! I M I I I I I I I I I I I M I I I j I I 1 I 
gl 4 9- 1 DYDNPGLSCGFHDGDGAHAHTHNGKPWIDLRNKRYELRAEWKQPFPGFEALRVHLNRNDY 

310 320 330 340 350 360 

370 380 390 400 410 420 

ml 4 9-1 .pep RHDEKAGDAVENFFNNQTQNARIELRHQPIGRLKGSWGVQYLQQKSSALSAISEAVKQPM 
: III I I I 1 I I I I I I I I : I : I I I I i II I I ( I 1 I I ! t I 1 I M I 1 I I II I I I I I I : I : I I I 
g 149-1 HHDEKAGDAVENFFNNKTHNARIELRHQPIGRLKGSWGVQYLGQKSSALSAIPETVQQPM 

370 380 390 400 410 420 

430 440 450 460 470 480 

ml4 9-l .pep LLDNKVQHYSFFGVEQANWDMFTLEGGVRVEKQKASIQYDKALIDRENYYNHPLPDLGAH 
I : I I : I : i I II I I I I I I I II I I I I i I I I I I I I I II 1 1 : I I I I I II I i I II I : I I 1 I I ! 1 I 
g 1 4 9- 1 LIDNNVRH YS FFGVEQANWDNFTLEGGVRVEKQKAS I RYDKALI DRENYYNQPLPDLGAH 

430 440 450 460 470 480 

490 500 510 520 530 540 

ml 4 9- 1 . pep RQTARSFALSGNWYFTPQHKLSLTASHQERLPSTQELYAHGKHVATNTFEVGNKHLNKER 
I I I I I I I I I I I II I I I I : I I II 11 I I I I I I I I I I I I If I I I I I I II II I I I I I 11 I I I I I 
g 1 4 9- 1 RQTARSFALSGNWYFTPHHKLSLTASHQERLPSTQELYAHGKH VATNTFEVGNKHLNKER 

490 500 510 520 530 540 

550 560 570 580 590 600 

ml 4 9-1 .pep S NN I ELALGYEGDRWQYNLALYRNRFGNY I YAQTLNDGRG PKS I EDDSEMKLVRYNQSGA 
I I I 1 I I I I I ( I I I I I I I I I ( I II I I I II I I I I I I I I I I I I I I I I I I I I I ( I I I I I I I I I 
gl4 9-l SNNIELALGYEGDRWQYNLAAYRNRFGKYIYAQTLNDGRGPKSIEDDSEMKLVRYKQSGA 

550 560 570 580 590 600 

610 620 630 640 650 660 

ml 4 9-1. pep DFYGAEGEIYFKPTPRYRIGVSGDYVRGRLKNLPSLPGREDAYGNRPFIAQDDQNAPRVP 
I II I I I i I I I II I I II I I I I I I I I II I I I 11 I I I I H I it I 11:111111 I III I I : I 
gl49-l DFYGAEGEIYFKPTPRYRIGVSGDYVRGRLKNLPSLPGREDPYGKRPFIAQADQNAPRIP 

610 620 630 640 650 660 
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351 LRVHLNRNDY RHDEKAGDAV ENFFNNQTQN ARIELRHQPI GRLKGSWGVQ 

4 01 YLGQKSSALS ATSEAVKQPM LLDNKVQHYS FFGVEQANWD NFTLEGGVRV 

451 EKQKASIRYD KALIDRENYY NHPLPDLGAH RQTARSFALS GNWYFTPQHK 

501 LSLTASHQER LPSTQELYAH GKHVATNTFE VGNKHLNKER SNNIELALGY 

551 EGDRWQYNLA LYRNRFGNYI YAQTLNDGRG PKSIEDDSEM KLVRYNQSGA 

601 DFYGAEGEIY FKPTPRYRIG VSGDYVRGRL KNLPSLPGRE DAYGNRPLIA 

651 QADQNAPRVP AARLGVHLKA SLTDRIDANL DYYRVFAQNK LARYETRTPG 

701 HHMLNLGANY RRNTRYGEWN WYVKADNLLN QSVYAHSSFL SDTPQMGRSF 

751 TGGVNVKF* 

al49-l/ml49-l 98.0% identity in 756 aa overlap 

10 20 30 40 50 60 

al4 9-l .pep MAQTTLKPIVLSILLINT?IiLSQAHGTEQSVGLETVSVVGKSRPRATSGLl.HTSTASDKI 
I I I I I i I 1 II t i 1 t I t I I i I I : I I t HIM t I I t 1 I i I t i 1 i I I i I 1 it M I I I t 1 II 
ml 4 9-1 MAQTTLKPIVLSILLINTPLLAQAHETEQSVDLETVSVVGKSRPRATSGLLHTSTASDKI 

10 20 30 40 50 60 

70 80 90 100 110 120 

al 4 9-1 .pep ISGDTLRQKAVNLGDALDGVPGIHASQYGGGASAPVIRGQTGRRIKVLNHHGETGDMADF 
I I I I I I I I I ! I I I t I I I I I I I I ( I ! I I I I I II I I I I M I I t U II I I I I II t I I II I II 
ml 4 9- 1 I SGDTLRQKAVNLGDALDGVPGIHASQYGGGASAPVIRGQTGRRIKVLNHHGETGDMADF 

70 80 90 100 110 120 

130 140 150 160 170 180 

al 4 9-1 .pep SPDHAIMVDSALSQQVEILRGPVTLLYSSGNVAGLVDVADGKIPEKMPENGVSGELGLRL 
I I I I II I I I : M I I I II I I I I I I I I ! 1 I I I I I I M I M I I I I I I I ! I I I ! I I I I j I I I I I 
ml4 9-l SPDHAIMVDTALSQQVEILRGPVTLLYSSGNVAGLVDVADGKIPEKMPENGVSGELGLRL 
130 14Q 150 160 170 130 

190 200 210 220 230 240 

al4 9-l.pep SSGNLEKLT3GGINIGLGKNFVLHTEGLYRKSGDYAVPRYRNLKRLPDSHADSQTGSIGL 
I I I I I I I I 1 I I ! i I I II I I I I I I I I I I I I I I I I ! I I I I I ! i I I i II I I I I I i I i I I I I lj 
ml 4 9-1 SSGNLEKLTSGGINIGLGKNFVLHTEGLYRKSGDYAVPRYRNLKRLPDSKADSQTGSIGL 

190 200 210 220 230 240 

250 260 270 280 290 300 

al4 9-l.pep SWVGEKGFIGAAYSDRRDQYGLPAHSHEYDDCHADIIWQKSLINKRYLQLYPHLLTEEDI 
I II 1 i I I f I I : I I I i I I f I I I I I I I I M II 1 I I ! I I I I I I I I I I I ! I I I I M I I N I I M 
ml4 9- 1 SWVGEKGFIGVAYSDRRDQYGLPAHSHEYDDCHADI IWQKSLINKRYLQLYPHLLTEEDI 

250 260 270 280 290 300 

310 320 330 340 350 360 

a 14 9- 1 . pep DYDNPGLSCGFHDDDDAHAHAHNGKPW1DLRNKRYELRAEWKQPFPGFEALRVHLNRNDY 
I I I 1 I I I M I I M I I : I I I I : I : i : I I 1 1 I I II I II I I M I M I M M I I 1 ! I I I I I 1 11 
ml 4 9-1 DYDNPGLSCGFHDDDNAHAHTHSGRPWIDLRNKRYELRAEWKQPFPGFEALRVHLNRNDY 

310 320 330 340 350 360 

370 380 390 400 410 420 

al4 9-l .pep RHDEKAGDAVENFFNNQTQNARIELRHQPIGRLKGSWGVQYLGQKSSALSATSEAVKQPM 
I I I ( I I I I I I I I 1 ( I I ! I I I M I ( I I I I I I I I I ! I I I I I II I I I I I I II I I I I I I { I I 
ml49-l RHDEKAGDAVENFFWNQTONARIELRHQPIGRLKGSWGVQYLQQKSSALSAISEAVKOPM 

370 380 390 400 410 420 

430 440 450 460 470 480 

al49-l . pep LLDNKVQHYSFFGVEQANWDNFTLEGGVRVEKQKASIRYDKALIDRENYYNHPLPDLGAH 
I I I I II I I I I I I f I I I I I I I M I I I I I I I I I I I I I I I : I I I I II II II I I M I I I I ] I I f 
ml 4 9-1 LLDNKVQHYSFFGVEQANWDNFTLEGGVRVEKQKASIQYDKALIDREMYYNHPLPDLGAH 

430 440 450 460 470 480 

490 500 510 520 530 540 

al4 9-l .pep RQTARSFALSGNWYFTPQHKLSLTASHQERLPSTQELYAHGKHVATNTFEVGNKHLNKER 
I i M I I ! I I i I I t M I I I H I M I I I I I I I i I I M II I I I II I I I I I II I I I N I I I I I I 
ml 4 9-1 RQTARSFALSGNWYFTPQHKLSLTASHQERLPSTQELYAHGKHVATNTFEVGNKHLNKER 

490 500 510 520 530 540 

550 560 570 580 590 600 

al4 9-l .pep SNNIELALGYEGDRWOYNLALYRNRFGNYIYAQTLNDGRGPKSIEDDSEMKLVRYNQSGA 
I I I I M I I I I I I I I I I I I I I I I I I I I I ( I I I i I I I I I I I I II M I II I I I I I I I I I I ! I I 
ml49-l SNNIELAIiGYEGDRWQYNLALYRNRFGNYIYAQTLNDGRGPKSIEDDSEMKLVRYNQSGA 

550 560 570 530 590 600 



610 620 630 640 650 660 
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4 51 GGCGACAGTT CCTATCCGAA TTTCTGTCAG GCAGGTAAAG ATTTCGACCG 

5 01 GCGTTTTGAA GAATTGGGCG CAAAACGGCT GCTCGAACGC GTTGATGCGG 
551 ATTTGGACTT TACCGCCTCC GCAAACGCCT GGACAGATAA TATCGCCGCA 
601 CTCTTAAAAG AAGAAGCCGC AAAAAACCGG GCAACGCCCG CGCCGCAGAC 
651 AACGCCCCCC GCCGGCCTTC AGACGGCACC GGATGGCAGG TACTGCAAGG 
701 CAGCCCCCTT TCCCGCCGCC CTGCTGGCCA ATCAGAAAAT CACCGCCCGC 
751 CAATCCGATA AAGACGTGCG CCACATCGAA ATCGATTTGA GCGGTTCGGA 
801 TTTGCACTAC CTCCCGGGCG ACGCGCTCGG CGTTTGGTTT GACAACGATC 
851 CGGCACTGGT CAGGGAAATC CTAGACCTGC TCGGCATCGA TCCGGCAACG 
901 GAAATACAGG CGGGCGGAAA GATGATGCCG GTTGCGCGCG CACTTTCATC 
951 TCATTTCGAA CTCACGCAAA ACACTCCGGC TTTCGTCAAA GGCTATGCCG 

1001 CGTTCGCCCA TTATGAAGAA CTCGATAAAA TCATTGCCGA TAACGCCGTT 

1051 TTGCAGGATT TCGTGCAAAA CACGCCTATT GTCGATGTGC TGCACCGCTT 

1101 CCCGGCAAGC CTGACGGCAG AACAATTCAT CCGTTTACTG CGTCCGCTTG 

1151 CACCCCGTTT GTATTCGATT TCTTCAGCAC AGGCGGAAGT GGGCGATGAA 

1201 GTGCATTTAA CTGTCGGCGi GGTTCGTTTT GAACACGAAG GCCGCGCCAG 

1251 AACGGGCGGC GCATCGGGTT TCCTTGCCGA CCGGCTGGAA GAGGACGGCA 

1301 CGGTGCGCGT GTTTGTGGAA CGCAACGACG GCTTCAGGCT GCCCGAAGAC 

13 51 AGCCGCAAGC CGATTGTGAT GATCGGCTCG GGCACCGGCG TCGCACCGTT 
1401 CCGCGCTTTC GTCCAACAAC GTGCCGCAGA AAATGCGGAA GGCAAAAACT 

14 51 GGCTGATTTT CGGCAATCCG CATTTTGCCC GTGATTTTCT CTATCAAACC 
1501 GAATGGCAGC AGTTTGCCAA AGACGGCTTC CTGCACAGGT ACGATTTCGC 
1551 CTGGTCCCGC GATCAGGAAG AAAAAATCTA TGTGCAGGAC AAAATCCGCG 
1601 AACAGGCGGA AGGACTTTGG CAATGGCTGC AGGAAGGCGC GCATATCTAT 
1651 GTGTGCGGCG ATGCGGCAAA AATGGCAAAA GACGTGGAAG CCGCCTTGCT 
1701 GGATGTGATT ATCGGGGCAG GACATTTGGA CGAAGAGGGC GCAGAAGAAT 
1751 ATTTGGATAT GCTGCGCGAA GAAAAACGCT ATCAGCGTGA TGTTTATTGA 

This corresponds to the amino acid sequence <SEQ ED 610; ORF 150>: 

ml 50 -pep 

1 MQNTNPPLPP LPPEI TOLLS GLDAAQWAWL SGYAWAKAGN GASAGLPAIQ 

51 TALPAAEPFS VTVLSASQTG NAKb^ADKAA DSLEAAGIOV SRAELKDYKA 

101 KNIAGERRLL LVTSTQGEGE PPKEAWLHK LLNGKKAPKL DKLQFAVLGL 

151 GDSSYPNFCQ AGKDFDRRFE ELGAKRLLER VDADLDFTAS ANAWTDNIAA 

201 LLKEEAAKNR ATPAPQTTPP AGLQTAPDGR YCKAAPFPAA LLANQKITAR 

251 QSDKDVRHIE IDLSGSDLHY LPGDALGVWF DNDPALVREI LDLLGIDPAT 

3 01 EIQAGGKMMP VARALSSHFE LTQNTPAFVK GYAAFAHYEE LDKI I ADNAV 

3 51 LQDFVQNTPI VDVLHRFPAS LTAEQFIRLL RPLAPRLYSI SSAQAEVGDE 

4 01 VHLTVGWRF EHEGRARTGG ASGFLADRLE EDGTVRVFVE RNDGFRLPED 
4 51 SRKPIVMIGS GTGVAPFRAF VQQRAAENAE GKNWLIFGNP HFARDFLYQT 
501 EWQQFAKDGF LHRYDFAWSR DQEEKIYVQD KIREQAEGLW QWLQEGAHIY 
551 VCGDAAKMAK DVEAALLDVI IGAGHLDEEG AEEYLDMLRE EKRYQRDVY* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 150 shows 91.3% identity over a 369 aa overlap with a predicted ORF (ORF ISO.ng) 
from N. gonorrhoeae; 

ml50/gl50 

210 220 230 240 250 260 

LLKEEAAKNRATPAPQTTPPAGLQTAPDGRYCKAAPFPAALLANQKITARQSDKDVRHIE 

Mil I t i I I I M I I i I I 11 I I ! t I I II M 
YCKADPFPAALLANQKITARQSDKDVRHIE 
10 20 30 

270 280 290 300 310 320 

IDLSGSDLHYLPGDALGVWFDNDPALVREILDLLGIDPATEIQAGGKMMPVARALSSHFE 
I i I I I I I I I I I I I I I I I I I I I I I I I I I ( I I I 1 I I I : I I I I I I I I I I : I I i (! 1 I I [ 
IDLSGSDLHYLPGDALGVWFDNDPALVGEILDLLGINPATEIQAGGKTLPVASALLSHFE 
40 50 60 70 80 90 



mlSO .pep 
gl50 

ml50 . pep 
gl50 



ml50 . pep 



330 340 350 360 370 380 

LTQNTPAFVKGYAAFAHYEELDKIXADNAVLQDFVQNTPIVDVLKRFPASLTAEQFIRLL 
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17 01 GGATGTGATT ATCGGGGCAG GACATTTGGA CGAAGAGGGC GC AG AAGAAT 
17 51 ATTTGGATAT GCTGCGCGAA GAAAAACGCT ATCAGCGTGA TGTTTATTGA 



This corresponds to the amino acid sequence <SEQ ID 612; ORF 150.a>: 

al50 . pep 

1 MQNTNPPLPP MPPEITQLLS GLDAAQWAWL SGYAWAKAGN GASAGLPALQ 

51 TALPTAEPFS VTVLSASQTG NAKSVADKAA DSLEAAGIQV SRAELKDYKA 

101 KNIAGERRLL LVTSTQGEGE PPEEAWLHK LLNGKKAPKL DKLQFAVLGL 

151 GDSSYPNFCR AGKDFDKRFE ELGAKRLLER VDADLDFAAA ADGWTDNIAA 

201 LLKEEAAKNR ATPAPQTTPP AG LQTAPDGP YCKADPFPAA LLANQKITAR 

251 QSDKDVRHIE IDLSGSDLKY LPGDALGVWF DNDPALVREI LDLLGIDQAT 

301 EIQAGGKTLP VASALLSHFE LTQNTPAFVK GYAPFADDDE LDRIAADNAV 

351 LQGFVQSTPI ADVLHRFPAK LTAEQFAGLL RPLAPRLYSI SSSQAEVGDE 

4 01 VHLTVGAVRF EHEGRARAGG ASGFLADRLE EDGTVRVFVE RNDGFRLPED 

4 51 SRKPIVMIGS GTGVAPFRAF VQQRAAENAE GKNWLFFGNP HFARDFLYQT 

501 EWQQFAKDGF LHR.7DFAWSR DOEEKIYVQD KIREQAEGLW QWLQEGAHIY 

551 VCG DAAKMAK DVEAALLDVI IGAGHLDEEG AEEYLDMLRE EKRYQRDVY* 



m!50/al50 94,8% identity in 599 aa overlap 

10 20 30 40 50 60 

ml 50 . pep MQNTNPPLPPLPPEITQLLSGLDAAQWAWLSGYAWAKAGNGASAGLPALQTALPAAEPFS 
I I It I t I I I 1 : I I I I 1 I I I I I I I I II i II } I I II I M ) I I I I I f I I II M I I M : I I M I 
al5D MQNTNPPLPPMPPEITQLLSGLDAAQWAWLSGYAWAKAGNGASAGLPALQTALPTAEPFS 
10 20 30 40 50 60 



70 80 90 100 110 120 

ml 50 . pep VTVLSASQTGNAKSVADKAADSLEAAGIQVSRAELKDYKAKN1AGERRLLLVT5TQGEGE 

1 1 1 m i i 1 1 1 1 ii 1 1 1 1 1 1 1 1 1 1 1 m n 1 1 :: 1 1 1 1 1 1 1 1 1 1 1 n i ! n 1 1 1 1 1 1 1 i 1 1 1 

al50 VTVLSASQTGNAKSVADKAADSLEAAGIQVSRAELKDYKAKNIAGERRLLLVTSTQGEGE 
70 80 90 100 110 120 



130 140 150 160 170 180 

ml 50. pep PPKEAWLHKLLNGKKAPKLDKLQFAVLGLGDSSYPNFCQAGKDFDRRFEELGAKRLLER 
I I : M I I i I I I I I I ! I M I I I I I I M II I I If I I I I I I I : I I I I I I : I I I I I ! ! I i I M I 
al50 PPEEAVVLHKLLNGKKAPKLDKLQFAVLGLGDSSYPNFCRAGKDFDKRFEELGAKRLLER 

130 140 150 160 170 ISO 



190 200 210 220 230 240 

ml 50 . pep VDADLDFTASANAWTDNIAALLKEEAAKNPATPAPQTTPPAGLQTAPDGRYCKAAPFPAA 
II I I I I I : I : I : : I I I f II I I I I I I I I II 1 I I I I I I ( I 1 ! I I I I I ! I ! I I I I I I 111(1 
al50 VDADLDFAAAADGWTDNIAALLKEEAAKNRATPAPQTTPPAGLQTAPDGRYCKADPFPAA 

190 200 210 220 230 240 



250 260 270 280 290 300 

ml 50 .pep LLANQKITARQSDKDVRHIEIDLSGSDLHYLPGDALGVWFDNDPALVREILDLLGIDPAT 
! I I I I I I I I U I I ! M I I M II II I I I I I I I II I I I i ! I I I II I I I III I I i I M I i II 
al50 LLANQKITARQSDKDVRHI EI DLSGSDLHYLPGDALGVWFDNDPALVREI LDLLGIDQAT 

250 260 270 280 290 300 



310 320 330 340 350 360 

ml 50 .pep EIQAGGKMMPVARALSSHFELTQNTPAFVKGYAAFAHYEELDKIIADNAVLODFVQNTPI 
I I I II I I : I I I II I II i I I I I I I I I I I i I I I I : I I I : I lllilll i I I : SI I 
a!50 EIQAGGKTLPVASALLSHFELTQNTPAFVKGYAPFADDDELDRIAADNAVLOGFVQSTPI 

310 320 330 340 350 360 



370 380 390 400 410 420 

ml50 .pep VDVLHRFPASLTAEQFIRLLRPLAPRLYSISSAQAEVGDEVHLTVGVVRFEHEGRARTGG 
: I I II I I 1 I : I I I I M I 11 I M I I I I I I I I : I I I t 11 I 1 I 1 1 I I : I I I I M 1 1 I I : M 
a 150 ADVLHRFPAKLTAEQFAGLLRPLAPRLYSISSSQAEVGDEVHLTVGAVRFEHEGRARAGG 

370 380 390 400 410 420 



430 440 450 460 470 480 

ml 50 .pep ASGFLADRLEEDGTVRVFVERNDCFRLPEDSRKPIVMIGSGTGVAPFRAFVQQRAAENAE 
I I I I I I I I I I I I I I I I I I I I 1 I II I I I I I I I I I I I I I I I I I I I I i I I II I I I I 1 I I I I II 
a 1 5 0 ASGFLADRLEEDGTVRVFVERKDG FRLPEDSRKP I VMIGSGTGVAPFRAFVQQRAAENAE 

430 440 450 460 470 480 

490 500 510 520 530 540 

mlSO . pep GKNWLIFGNPHFARDFLYQTEWQQFAKDGFLHRYDFAWSRDQEEKIYVODKIREQAEGLW 
I I M I : I I f I I I I I I I I I i I I I I I I I I (((! I I I I I I i i I I I I t I I I I I I I I ( I I 1 I I I I 
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501 NLEDRGRMFV SPNDKIYEGM I IGIHSRDND LWNPLKGKK LTNIRASGTD 

551 EAVRLTTPIK LTLEGAVEFI DDDELVEITP QSIRLRMRYL SELERRRHFK 

601 KLD* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 615>: 

m!51 . seq 

1 ATGAAACAAA TCCGCAACAT CGCCATCATC GCCCACGTCG ACCACGGCAA 

51 AACCACATTG GTCGACCAAC TGCTGCGCCA ATC CGGCACA TTCCGCGCCA 

101 ACCAGCAGGT TGACGAGCGC GTGATGGACA GCAACGACCT TGAAAAAGAA 

151 CGCGGCATCA CCATCCTCGC CAAAAACACC GCCATCGATT ACGAAGGCTA 

201 CCACATCAAT ATCGTCGACA CGCCGGGACA CGCCGACTTC GGCGGCGAAG 

251 TAGAGCGCGT TTTGGGGATG GTGGACTGCG TCGTCTTGTT GGTGGACGCG 

3Q1 CAGGAAGGCC CGATGCCGCA AACCCGTTTC GTGACCAAAA AAGCCTTGGC 

3 51 TTTGGGGCTG AAACCGATTG TCGTCATCAA CAAAATCGAC AAGCCGTCCG 

4 01 CTCGTCCGAG CTGGGTTATC GACCAAACTT TCGAGCTGTT CGACAACTTG 

4 51 GGCGCGACCG ACGAGCAGTT GGATTTCCCG ATTGTTTACG CTTCAGGGTT 
501 GAGCGGTTTC GCCAAATTGG AAGAAACCGA CGAGAGCAAC GACATGCGTC 

5 51 CGCTGTTCGA TACTATCTTA AAATATACGC CTGCACCGAG CGGCAGCGCG 

6 01 GACGAAACGC TGCAACTGCA AATTTCCCAA CTCGACTACG ACAACTACAC 
651 CGGCCGCCTC GGTATCGGTC GTATCTTGAA CGGACGCATC AAACCCGGCC 
701 AAACCGTTGC CGTCATGAAC CACGATCAGC AAATCGCCCA AGGCCGCATC 
751 AACCAGCTTT TGGGTTTCAA AGGTTTGGAA CGCGTGCCGC TTGAAGAAGC 
801 CGAAGCCGGC GACATCGTGA TTATTTCCGG TATCGAAGAC ATCGGTATCG 
851 GCGTAACCAT CACCGACAAA GACAATCCCA AAGGCCTACC GATGTTGAGC 
901 GTGGACGAAC CGACGCTGAC GATGGACTTT ATGGTCAACA CCAGCCCGCT 
951 GGCGGGTACG GAAGGCAAAT TCGTAACCAG CCGCCAAATC CGCGACCGCC 

1001 TGCAAAAAGA ATTGCTGACC AACGTCGCCC TGCGCGTGGA AGATACCGCC 

1051 GATGCCGACG TGTTCCGCGT ATCCGGGCGC GGCGAGCTGC ACCTGACCAT 

1101 TTTGCTGGAA AACATGCGCC GCGAAGGCTA CGAACTCGCC GTCGGCAAAC 

1151 CGCGCGTCGT GTACCGCGAC ATCGACGGTC AAAAATGCGA ACCGTATGAA 

1201 AACCTGACCG TGGATGTACC CGACGACAAC CAAGGCGCGG TAATGGAAGA 

12 51 ACTCGGCCGC CGCCGTGGCG AACTGACTAA TATGGAAAGC GACGGCAACG 

13 01 GACGCACCCG CCTCGAATAC CATATTCCAG CGCGCGGCTT GATCGGTTTC 

13 51 CAAGGCGAAT TTATGACCCT GACGCGCGGG GTCGGGCTGA TGAGCCACGT 
1401 GTTCGACGAT TACGCGCCCG TCAAACCCGA TATGCCCGGC CGCCACAACG 

14 51 GCGTGCTGGT GTCCCAAGAG CAGGGCGAGG CAGTCGCTTA CGCCTTGTGG 
1501 AATCTGGAAG ACCGCGGCCG TATGTTCGTA TCGCCCAACG ACAAAATCTA 

15 51 CGAAGGCATG ATTATCGGCA TCCACAGTCG CGACAACGAT TTGGTGGTCA 
1601 ACCCGCTCAA AGGCAAAAAA CTTACCAACA TCCGTGCCAG CGGTACCGAC 
1651 GAAGCCGTTC GCCTGACCAC GCCAATCAAG CTGACGCTGG AAGGTGCGGT 
1701 TGAGTTTATC GACGATGACG AACTCGTTGA AATCACGCCG CAATCCATCC 
1751 GTCTGCGCAA GCGTTACTTG AGCGAATTGG AACGCCGCCG CCACTTTAAA 
1301 AAGCTGGATT GA 

This corresponds to the amino acid sequence <SEQ ID 616; ORF 151>: 

ml 51 .pep 

1 MKQIRNIAII AHVDHGKTTL VDQLLRQSGT FRANQQVDER VMDSNDLEKE 

51 RGITILAKNT AIDYEGYHIN IVDTPGHADF GGE VERVLGM VDCWLLVDA 

101 QEGPMPQTRF VTKKALALGL KPIWINKID KPSARPSWVI DQTFELFDNL 

151 GATDEQLDFP IVYASGLSGF AKLEETDESN DMRPLFDTIL KYTPAFSGSA 

201 DETLQLQISQ LDYDNYTGRL GIGRILNGRI KPGQTVAVMN HDQQIAQGRI 

251 NQLLGFKGLE RVPLEEAEAG D IVIISGIED IGIGVTIT DK DNPKGLPMLS 

301 VDEPTLTMDF MVNTSPLAGT EGKFVTSRQI RDRLQKELLT NVALRVEDTA 

3 51 DADVFRVSGR GELHLT1 LLE NMRREGYELA VGKPRWYRD IDGQKCEPYE 
401 NLTVDVPDDN QGAVMEELGR RRGELTNMES DGNGRTRLEY HIPARGLIGF 

4 51 QGEFMTLTRG VGLMSHVFDD YAPVKPDMPG RHNGVLVSQE QGEAVAYALW 
501 NLEDRGRMFV SPNDKIYEGM I IGIHSRDND LWNPLKGKK LTNIRASGTD 
551 EAVRLTTPIK LTLEGAVEFI DDDELVEITP QS I RLRKRYL SELERRRHFK 
601 KLD* 

Computer analysis of this amino acid sequence gave the following results: 



WO 99/57280 PCT7US99/09346 

433 



m!51.pep KLDX 
I I 1 I 

gl51 KLDX 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 61 7>: 

al51 . seq 

1 ATGAAACAAA TCCGCAACAT CGCCATCATC GCCCACGTCG ACCACGGCAA 

51 AACCACATTG GTCGACCAAC TGCTGCGCCA ATCCGGCACA TTCCGCGCCA 

101 ACCAGCAGGT TGACGAGCGC GTGATGGACA GCAACGACCT TGAAAAAGAA 

151 CGCGGCATCA CCATCCTCGC CAAAAACACC GCCATCGATT ACGAAGGCTA 

201 CCACATCAAT ATCGTCGACA CGCCGGGACA CGCCGACTTC GGCGGCGAAG 

251 TAGAGCGAGT TTTGGGGATG GTGGACTGCG TCGTCTTGTT GGTGGACGCG 

301 CAGGAAGGCC CGATGCCGCA AACCCGTTTC GTGACCAAAA AAGCCTTGGC 

351 TTTGGGGCTG AAACCGATTG TCGTCATCAA TAAAATCGAC AAACCGTCCG 

4 01 CCCGTCCGAG CTGGGTCATC GACCAAACTT TCGAGCTGTT CGACAACTTG 

4 51 GGCGCGACCG ACGAGCAGTT GGATTTCCCG ATTGTTTATG CTTCCGGTCT 

501 GTCCGGTTTC GCCAAATTGG AAGAAACCGA CGAGAGCAAC GACATGCGTC 

551 CGCTGTTCGA TACTATCTTA AAATATACGC CTGCACCGAG CGGCAGCGCG 

601 GACGAAACGC TGCAACTGCA AATTTCCCAA CTCGACTACG ACAACTACAC 

651 CGGCCGCCTC GGTATCGGTC GTATCT1 - v CGGACGTATC AAGCCCGGTC 

701 AAGTTGTTGC CGTCATGAAC CACGATCAAC AAATCGCCCA AGGCCGCATC 

7 51 AACCAGCTTT TGGGTTTCAA AGGTTTAGAA CGCGTGCCGC TTGAAGAAGC 

801 CGAAGCCGGC GACATCGTGA TTATTTCCGG TATTGAAGAC ATCGGCATCG 

851 GCGTAACCAT CACCGACAAA GACAACCCCA AAGGCCTGCC GATGTTGAGC 

901 GTGGACGAAC CGACGCTGAC GATGGACTTT ATGGTCAACA CCAGCCCGTT 

951 GGCAGGTACG GAAGGCAAAT TCGTAACCAG CCGCCAAATC CGCGACCGCC 

1001 TGCAAAAAGA ATTGCTGACC AACGTCGCCC TGCGCGTGGA AGATACCGCC 

1051 GATGCCGACG TGTTCCGCGT ATCCGGGCGC GGCGAGCTGC ACCTGACCAT 

1101 TTTGCTGGAA AACATGCGCC GCGAAGGCTA CGAACTCGCC GTCGGCAAAC 

1151 CGCGCGTCGT GTACCGCGAC ATCGACGGTC AAAAATGCGA ACCGTATGAA 

1201 AACCTGACCG TGGACGTACC CGACGACAAC CAAGGCGCGG TAATGGAAGA 

1251 ACTCGGCCGC CGCCGTGGCG AACTGACTAA TATGGAAAGC GACGGCAACG 

1301 GACGCACCCG CCTCGAATAC CATATTCCAG CGCGCGGCTT GATCGGCTTC 

1351 CAAGGCGAAT TTATGACCCT GACGCGCGGG GTCGGGCTGA TGAGCCACGT 

14 01 GTTCGACGAT TACGCGCCCG TCAAACCCGA TATGCCTGGC CGCCACAACG 

14 51 GCGTGCTGGT GTCCCAAGAG CAGGGCGAGG CAGTCGCTTA CGCCTTGTGG 

1501 AATCTGGAAG ACCGCGGCCG TATGTTCGTA TCGCCCAACG ACAAAATCTA 

1551 CGAAGGTATG ATTATCGGCA TCCACAGTCG CGACAACGAT TTGGTGGTCA 

1601 ACCCGCTCAA AGGCAAAAAA CTTACCAACA TCCGTGCCAG CGGTACCGAC 

1651 GAAGCCGTTC GCCTGACCAC GCCGATTAAG CTGACGCTGG AAGGTGCGGT 

1701 CGAGTTTATC GACGATGATG AGCTGGTAGA AATCACGCCG CAATCCATCC 

17 51 GTCTGCGCAA GCGTTACTTG AGCGAATTGG AACGCCGCCG CCATTTCAAA 

1801 AAGCTAGATT GA 

This corresponds to the amino acid sequence <SEQ ID 618; ORF 151.a>: 

al51 . pep 

1 MKQIRNIAII AHVDHGKTTL VDQLLRQSGT FRANQQVDER VMDSNDLEKE 

51 RGITILAKNT AIDYEGYHIN IVDTPGHADF GGE VERVLGM VDCWLLVDA 

101 QEGPMPQTRF VTKKALALGL KPIWINKID KPSARPSWVI DQTFELFDNL 

151 GATDEQLDFP IVYASGLSGF AKLEETDESN DMRPLFDTIL KYTPAPSGSA 

201 DETLQLQISQ LDYDNYTGRL GIGRILNGRI KPGQWAVMN HDQQIAQGRI 

251 NQLLGFKGLE RVPLEEAEAG D IVIISGIED IGIGVTIT DK DNPKGLPMLS 

301 VDEPTLTMDF MVNTSPLAGT EGKFVTSRQI RDRLQKELLT NVALRVEDTA 

351 DADVFRVSGR GELHLTILLE NMRREGYELA VGKPRVVYRD IDGQKCEPYE 

401 NLTVDVPDDN QGAVMEELGR RRGELTNMES DGNGRTRLEY HIPARGLIGF 

4 51 QGEFMTLTRG VGLMSHVFDD YAPVKPDMPG RHNGVLVSQE QGEAVAYALW 

501 NLEDRGRMFV SPNDKIYEGM IIGIHSRDND LWNPLKGKK LTNIRASGTD 

551 EAVRLTTPIK LTLEGAVEFI DDDELVEIT? QSIRLRKRYL SELERRRHFK 

601 KLD* 

m \ 5 1 /a 1 5 1 99.8% identity in 603 aa overlap 

10 20 30 40 SO 60 
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151 CTCGTATTCC GCCTCTGCTG GGGCATTTGG GGCAgcgATA CCGCCCGTTT 

201 CTCccgTtTC GTCCGAGGTT GGGCAGGTAT ACGCGGCTAT CTGAAAAAcg 

251 gCATTCCCGA ACAtatcCAG CCCGGACACA ACCCCTTGGG CGCACTgatg 

301 gtcGTTGCGC TTTTGgccgc cgtcTCATTT CAagtcggcA CGGGGCTTTT 

351 Tgccgccaat gaaaacacct tcagcaCCAa cggctacctc aaccatttgg 

401 tttccgaaca tacgGGCAGC CTTATACGGA AAATCCACCT CAACTTTTTC 

4 51 AAG CTGCTCG CCGTTTTTTC CGCAGTCCAC ATCGCCGCCG TCGCCGCATA 

501 CCGCATATTC AAAAAGAAAA ACCTCGTCCG CCCGATGATA ACCGGCTTCA 

551 AATACATCGA AGGCAAAACC TCAATCCGC? TTGCCGGCAA AGCCGCGCTT 

601 GCCGCCGCAT TATCGGTTGC CGCGCTTGCC GCAGCCGCCA TCCTGCTCCT 

651 GTCCTGA 

This corresponds to the amino acid sequence <SEQ ID 620; ORF 152.ng>: 

gl52 -pep 

1 MKNKTKVWDF PTRLFHWLLA ASLPFMWYSA KAGGDMLQWH TR VGLLVLFL 

51 LVFRLCWGI W GSDTARFSRF VRGWAGIRGY LKNGIPEHIQ PGHNPLGALM 

101 WALLAAVSF QV GTGLFAAN ENTFSTNGYL NHLVSEHTGS LIRKIHLNFF 

151 KLLAVFSAVH IAAVA AYRIF KKKNLVRPMI TGFKYIEGKT SIRFAGKAAL 

201 AAALSVAALA AAAIL LLS* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 62 1>: 



ml52 . seq 




1 


ATGAAAAACA 


51 


GCTGCTTGCC 


101 


GCGATATGCT 


151 


CTCGTATTTC 


201 


TTCCCGTTTC 


251 


GTATTCCCGA 


301 


GTCGTTGCGC 


351 


TGCCGCCGAT 


401 


TTTCCGAACA 


451 


AAGCTGCTCG 


501 


CCGCGTATTC 


551 


AATACATCGA 


601 


GCCGCCGCAT 


651 


GTCCTGA 



This corresponds to the amino acid sequence <SEQ ID 622; ORF 152>: 

ml52 .pep 

1 MKNKTKVWDL PTRLFHWLLA ASLPFMWYSA KAGGDMLQWH TR VGLFVLFL 

51 LVFRLCWGI W GSDTARFSRF VQGWAGIRGY LKNGIPEHIQ PGHNP LGALM 

101 WALLAAVSF QV GTGLFAAD ENTFSTNGYL NHLVSEHTGS LMRKIHLNFF 

151 KLLAVFSAIH IAAVAA YRVF KKKNLILPMI TGFKYIEGKT SIRFAGKAAL 

201 AAALSVASLA AAAILLL S* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 152 shows 95.4% identity over a 218 aa overlap with a predicted ORF (ORF 152.ng) 
from N. gonorrhoeae: 

ml52/gl52 

10 20 30 40 50 60 

ml52 .pep MKNKTKVWDLPTRLFHV?LLAASLPFMWYSAKAGGDMLQWHTRVGLFVLFLLVFRLCWGIW 

l!l!l!llI:!lllllllllllllllll!lll!!l!l!ll!!MhililIlll!!l!ll 

gl52 MKNKTKVWDFPTRLFHWLLAASLPFMWYSAKAGGDMLQWHTRVGLLVLFLLVFRLCWGIW 
10 20 30 40 50 60 

70 80 90 100 110 120 

ml52 .pep GSDTARFSRFVQGWAGIRGYLKNGIPEHIQPGHNPLGALMWALLAAVSFQVGTGLFAAD 

l!l!l!l!llhlll!!l!l!lll!lli!l!i!!l!tlil!lllll!IIIIMII!l!i: 

gl52 GSDTARFSRPVRGWAGIRGYLKNGI PEHIQPGHNPLGALMVVALLAAVSFQVGTGLFAAN 

70 80 SO 100 110 120 
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The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 625>: 

gl53 . seq 

1 atggggtttg cttaCAgtat gacgtatatc gaggtCGGGa taccggaggc 

51 ggcacccgtc ctttCgctGC CCGAGATgat gcgcccgatG GTGTTt Cagg 

101 attATGGTTT TttggcCGAA GTGATGTTTG TGctgaCTTT cGGCGcgcCG 

151 GTTCTGTTtC TGCTGCTGTG CCTGTATGTC TATGCCGCGC TGATACGGAA 

2 01 ACAGGCGTAT CCTGCGCTGC GTTTGGCAAC GCGTGTGATG GTGCGCTTGA 

2 51 GGCAGGCGAT GATGGTGGAT GTGTTTTTTG TTTCCACTCT GGTGGCGTAT 

3 01 ATCAAGCTCT CGTCTGTGGC AAAGGTTCGC TTCGGGCCGG CGTTTTATCT 
351 GATGTTCGCG CTGTCGGTTA TGCTGATTCG GACTTCGGTA TCGGTTCCCC 

4 01 AGCATTGGGT GTATTTCCA^ ATCGGGCGGC TGACGGGGAA TAATGCGGTT 
4 51 CAGACGGCAT CGGAAGGCAA AACCTGTTGC AGCCGCTGCC TGTATTTccg 
501 cgacAGTgcc gaacccCCCT GCGGGGTGTg cgGCGcggaA CTgtacggcg 
551 gacggccgaa aagtCTGAGt atttCgtCGG CGTTTCTgac ggcggcggTT 
601 GTTTTGTATT TCCctgCcaa TATCctgccg attaTGAttt cgtccAATCc 
651 tgccgccacg GAGGcCAACA CCATCTTTAG CGGCATCGCT TATATGTGGG 
701 ACGagggcgA CAGGCTGATT GCGGCGGTTA TTTTCAGCGC GAGTATTTTG 
751 GTGCCGGTGC TGAAGATTGC GGCAATGTCG GTTTTGATTG CGGCGGCACG 
801 GTTCGCTTTG CCGGCGGGCG CAAAGAAATT GTCGCACCTC tacCGCATCA 
851 CCGAAGCGGT CGGCCGCTJG TCGATGATTG ATATTTTTGT GATTATTATT 
901 TTGATGTGTT CGTTCCacaC TTATGCCGCG CGCGTCATTC CGGGCAGTGC 
951 GGCAGTCTAT TTCTGCCTGG TCGTGATTTT GACGATGCTG TCCGCCTATT 

1001 ATTTCGACCC GCGCCTGCTT TGGGACAAAC GCGCTTCAGA CGGCATTGCT 

1051 TTCAACGAAA CGGAAAAATA TGACTGA 

This corresponds to the amino acid sequence <SEQ ID 626; ORF 153.ng>: 

gl53 .pep 

1 MGFAYSMTYI EVGIPEAASV LSLPEMMRLM VFQDYGFLAE VMFVLTFGAP 

51 VLFLLLC LYV YAALIRKQAY PALRLATRVM VRLRQA MMVP VFFVSTLVAY 

101 IKLSSVAKVR FGP AFYLMFA L5VMLIRTSV SVPQHWVYFQ IGRLTGNNAV 

151 QTASEGKTCC SRCLYFRDSA ESPCGVCGAE LYGGRPKSLS ISSAFLTAAV 

201 VLYFPANILP I MISSNPAAT EANTIFSGIA YMWDEGDRLI AAVIFSASIL 

251 VPVLKIAAMS VLIAA ARFAL PAGAKKLSHL YRITEA VGRW SMIDIFVIII 

301 LKCSFHTYAA R VIPGSAAVY FCLWILT ML SAYYFDPRLL WDKRASDGIA 

3 51 FNETEKYD* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 627>: 

ml53 . seq 

1 ATGGCGTTTG CTTACGGTAT GACGTATATC GAGGTCGGGA TACCGGGTGC 

51 GGCATCCGTC CTTTCGCTGC CCGAGATGAT GCGCCTGATG GTGTTTCAGG 

101 ATTATGGTTT TTTGGCCGAA GTGATGTTTG TGCTGACTTT CGGCGCGCCG 

151 GTTCTGTTTC TGCTGCTGTG CCTGTATGTC TATGCCGCGC TGATACGGAA 

201 ACAGGCGTAT CCTGCGCTGC GTTTGGCAAC GCGTGTGATG GTGCGCTTGA 

251 GACAGGCGAT GATGGTGGAT GTGTTTTTTG TTTCCACTTT GGTGGCGTAT 

301 ATCAAGCTCT CGTCTGTGGC AGAGGTTCGC TTCGGGCCGG CGTTTTATCT 

351 GATGTTCGCG CTGTCAGTTA TGCTGATTCG GACTTCGGTA TCGGTTCCCC 

401 AGCATTGGGT GTATTTTCAA ATCGGGCGGC TGACGGGGGA TAATGCGGTT 

451 CAGACGGCAT CGGAAGGTAA AACCTGTTGC AGCCGCTGCC TGTATTTCCG 

501 CGACAGTGCC GAATCCCCCT GCGGCGTGTG CGGTGCGGAA CTGTACCGCC 

551 GACGGCCGAA AAGTCTGAGT ATTTCGTCGG CGTTTCTGAC GGCGGCGGTT 

601 ATTTTGTATT TCCCTGCCAA TATCCTGCCG ATTATGATTT CGTCCAATCC 

651 TGCCGCCACG GAGGTCAATA CCATCCTTAA CGGCATCGCT TATATGTGGG 

701 ACGAGGGCGA CAGGCTGATT GCGGCGGTTA TTTTCAGCGC GAGTATTTTG 

751 GTGCCGGTAC TGAAGATTGC GGCAATGTCG GTTTTGATTG CGTCCGCCCG 

801 CTTCGCTTTG CCAACGGGTG CAAAGAAATT GTCGCACCTC TACCGCATCA 

851 CCGAAGCGGT CGGCCGCTGG TCGATGATTG ATATTTTTGT GATTATTATT 

901 TTGATGTGTT CGTTCCACAC TTATGCCGCG CGCGTCATTC CGGGCAGTGC 

951 GGCAGTCTAT TTCTGCCTGG TCGTGATTCT GACGATGCTG TCCGCCTATT 

1001 ATTTCGACCC GCGCCTGCTT TGGGACAAAC GCGCTTCAGA CGGCATTGCT 

1051 TTCAATGAAA CGGAAAAACA TGACTGA 

This corresponds to the amino acid sequence <SEQ ID 628; ORF 153>: 

ml53 .pep 

1 MAFAYGMTYI EVGIPGAASV LSLPEMMRLM VFQDYGFLAE VMFVLTFGA? 

51 VLFLLLC LYV YAALIRKQAY PALRLATRVM VRLRQA MMVP VFFVSTLVAY 
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851 CCGAAGCGGT CGGCCGCTGG 

901 TTGATGTGTT CGTTCCACAC 

951 GGCAGTCTAT TTCTGCCTGG 

1001 ATTTCGACCC GCGCCTGCTT 

1051 TTCAATGAAA CGGAAAAACA 



TCGATGATTG ATATTTTTGT GATTATTATT 

TTATGCCGCG CGCGTCATTC CGGGCAGTGC 

TCGTGATTCT GACGATGCTG TCCGCCTATT 

TGGGACAAAC GCGCTTCAGA CGGCATTGCT 
TGACTGA 



This corresponds to the amino acid sequence <SEQ ID 630; ORF 153.a>; 



al53 .pep 












I 


HAFAYGMTYI 


EVGIPGAASV 


LSLPEMMRLM 


VFQDYGFLAE 


VMFVLT FGAP 


51 


VLFLLLCLYV 


YAALIRKQAY 


PALRLATRVM 


VRLRQAMMVD 


VFFVSTLVAY 


101 


IKLSSVAEVR 


FGSAFYLMFA 


LSVMLIRTSV 


SVPQHWVYF0 


IGRLTGDNAV 


151 


QTASEGKTCC 


SRCLYFRDSA 


ESPCGVCGAE 


LYRRRPKSLS 


ISSAFLTAAV 


201 


ILYFPANILP 


IMISSNPAAT 


EVNTILNGIA 


YMWDEGDRLI 


AAVIFSASIL 


251 


VPVLKIAAMS 


VLIASARFAL 


PTGAKKLSHL 


YRITEAVGRW 


SMIDIFVIII 


301 


LMCSFHTYAA 


RVIPGSAAVY 


FCLWILTML 


SAYYFDPRLL 


WDKRASDGI A 


351 


FNETEKHD* 











ml53/al53 99.7% identity in 358 aa overlap 

10 20 30 40 50 60 

ml 53. pep MAFAYGMTYIEVGIPGAASVLSLPEMMRLMVFQDYGFLAEVMFVLTFGAPVLFLLLCLYV 

I ! I I I I I I I ! I II II i ( I I I II t ! I ! ! I I M I I I I i I t I II ( ( I ! I I ! I ! II I I I I I I I ! 
al53 MAFAYGMTYIEVG1PGAASVLSLPEMMRLMVF0DYGFLAEVMFVLTFGAPVLFLLLCLYV 

10 20 30 40 50 60 



70 80 90 100 110 120 

ml 53 . pep YAALIRKQAY PALRLATRVMVRLRQAMMVDVFFVSTLVAYIKLSSVAEVRFGPAFYLMFA 

I M I i I I 1 I I I I ( I I ( I I I I II I I I M ( ! t I ! i 1 I I I 1 I I i i ! 1 ! I ! I I I I t II I M I i 
a 153 YAALIRKQAYPALRLATRVMVRLRQAMMVDVFFVSTLVAYIKLSSVAEVRFGSAFYLMFA 

70 80 90 100 110 120 



130 140 150 160 170 180 

ml53.pep LSVMLIRTSVSVPQHWVYFQIGRLTGDNAVQTASEGKTCCSRCLYFRDSAESPCGVCGAE 
I I I ! I I I i I I I I I I I I I I I I I II i I I I II I ! I I I I I I I II I I ! I I I i I I I i I I I I ! I I I I 
al53 LSVMLIRTSVSVPQHWVYFQIGRLTGDNAVQTASEGKTCCSRCLYFRDSAESPCGVCGAE 

130 140 150 160 170 180 



190 200 210 220 230 240 

ml53. pep LYRRRPKSLSISSAFLTAAVILYFPANILPIMISSNPAATEVNTILNGIAYMWDEGDRLI 
I I 1 ! I I I I I I I I I I II I I I I I I I 1 I I ! I I ! I 1 I I I I I I i M I I II II I I ! i I I I S I I i i I 
al53 LYRRRPKSLS I SSAFLTAAVILYFPANILPIMISSNPAATEVNTILNG I A YMWDEGDRLI 

190 200 210 220 230 240 



250 260 270 280 290 300 

ml53 . pep AAVI FSASILVPVLKIAAMSVLIASARFALPTGAKKLSHLYRITEAVGRWSMI DI FVI I I 

I I ! I I i I I I I I 11 I I I I I i I I I I I I M I I I I I H M I I I I I I I I I I I I I I I I I ! I I II II 
a!53 AAVIFSASILVPVLKIAAMSVLIASARFALPTGAKKLSHLYRITEAVGRWSMIDIFVIII 

250 260 270 280 290 300 



310 320 330 340 350 359 

ml53.pep LMCSFHTYAARVIPGSAAVYFCLWILTMLSAYYFDPRLLWDKRASDGIAFNETEKHDX 
I M I I I I I I II 1 I I M II I I I i I i ! I I I I 11 I I I I I I I I I I I I I I 1 I II I I I II I I I I I 
al53 LMCSFHTYAARVIPGSAAVYFCLVVILTMLSAYYFDPRLLWDKRASDGIAFNETEKHDX 

310 320 330 340 350 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 63 1>: 

gl54 .eeq 

1 ATGACTGACA ACAGCCCTCC TCCAAACGGA CACGCTCAAG CACGCGTCCG 

51 CAAAAACAAC accttCCtCT CCGCCGTCTG GCTGGTCCCG CTGATCGCGC 

101 TGATTGCCGG CGGCTGGCTT TGGGTTAAGG AAATCCGCAA CAGGGGGCCT 

151 GTGGTTACGC TCTTGATGGA CAGCGCGGAA GGCATCGAAG TCAACAATAC 

201 GGTCATTAAG GTATTGAGCA TCGATGTCGG ACGCGTTACC CGAATCAAAC 

251 TGCGCGACGA CCAAAAAGGC GTGGAAGTTA CTGCCCAACT CAATGCGGAC 

301 GTATCCGGCC TCATCCGCAG CGATACCCAG TTTTGGGTGG TCAAGCCGCG 

351 TATCGACCAA AGCGGcgtAA CCGGTTTGGG TACGCTGCTT TCGGGTTCGT 
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1201 


TCACCTAAGC 


TGCGACCGCA 


1251 


GACCCAGGGC 


GGCGGTTTGG 


1301 


TGGACAAGTT 


CGACAAACTG 


13 51 


GGTTCGCTTG 


CCGAGCTCAA 


1401 


AAGCTCCATC 


GACAAACTGG 


14S1 


ACGAACTGAA 


CCAAACCCTG 


1501 


TCGCCGCAAT 


CGCCTATCTA 


1551 


GGACAAAACT 


TTAAAAGACG 


1601 


AACCCAACGC 


GCTGATTTTC 


1651 


GGAAGCCGAT 


AA 



TACCGTTTAT GCAGGCGATA CCGTTATCGC 
ACGATTTGCA GGTCAAATTG GCGGATTTGC 
CCTTTAGATA AGACGGTTGC CGAATTGAAC 
ATCCACACTC AAATCTGCCA ATGCCGCCCT 
TCGGCAAACC GCAGACACAA AACATTCCGA 
AAAGAGTTGC GCACAACCCT GCAAGGCGTA 
CGGCGACGTA CAAAATACGC TGCAAAGTTT 
TTCAACCCGT GATTAATACT TTGAAAGAAA 
AACAGCAGCA GCAAAGACCC TATCCCGAAA 



This corresponds to the amino acid sequence <SEQ ID 634 ORF 154.a>: 



ml 5 4 . pep 












1 


MTDNSPPPNG 


HAQARVRKNN 


TFLSAVWLVP 


LIALIAGGWL 


WVKEIRNRGP 


51 


WTLLMDSAE 


GIEVNNTVIK 


VLSIDVGRVT 


RIKLRDDQKG 


VEVTAQLNAD 


101 


VSGLIRSDTQ 


FWWKPRIDQ 


SGVTGLGTLL 


SGSYIAFTPG 


KSDEAKDVFQ 


151 


VQDIPPVTAI 


GQSGLRLNLI 


GKNDRILNVN 


SPVLYEMFMV 


GQVESAHFDP 


201 


SDQSVHYTIF 


IQSPNDKLIH 


SASRFWLESG 


INIETTGSGI 


KLNSAPLPAL 


251 


LSGAISFDSP 


KTKNSKNVKS 


EDSFTLYDSR 


SEVANLPDDR 


SLYYTAFFKQ 


301 


SVRGLTVGSP 


VEYKGLNVGV 


VSDVPYFDRN 


DSLHLFEnGW 


IPVRIRIEPS 


351 


RLEINADEQS 


KEHWKQQFQT 


ALNKGLTATI 


SSHNLLTGSK 


MIELNDQPSA 


401 


SPKLRPHTVY 


AGDTVIATQG 


GGLDDLQVKL 


ADLLDKFDKL 


PLDKTVAELN 


451 


GSLAELKSTL 


KSANAALSSI 


DfCLVGKPQTQ 


NIPNEIiNQTL 


KELRTTLQGV 


501 


SPQSPIYGDV 


QNTLQSLDKT 


LKDVQPVINT 


LKEKPNALIF 


NSSSKDPIPK 


551 


GSR* 











ml54 / g!54 97.8% identity in 553 aa overlap 



10 20 30 40 50 60 

ml 54 . pep MTDNSPPPNGHAQARVRKNNTFLSAVWLVPLIALIAGGWLWVKEIRNRGPVVTLLMDSAE 

iii![ilj!liil!Minillill)iltMlllltltilMI!l!t!!titilliii!( 

gl54 MTDNSPPPNGHAQARV^KWTFLSAWLVPLIALIAGGWLVT^EIR^RGPVVTLLMDSAE 
10 20 30 40 50 60 



70 80 90 100 110 120 

m!54 .pep G 1 EVNNTV I KVLS I D VGRVTRI KLRDDQKGVEVTAQLNADVSGL I RSDTQFWWKPR I DQ 

IMIMtltllllillliMllliMfHtlllllinillllillliliMlltlMI! 

g 1 5 4 GI EVNNTVI KVLS IDVGRVTRI KLRBDQKGVEVTAQLN ADVSGLI RSDTQFWWKPRIDQ 

70 80 90 100 110 120 



130 140 150 160 170 180 

ml54 .pep SGWG1X3TLLSGSYIAFTPGKSDEAKDVFQVQDIPPVTAIGQSGLRLNLIGKNDRILNVH 

MjlliiMIMIIiillMM IMllllillllilllllllHilllllilimili 

gl£4 SGVTGLGTLLSGSYIAFTPGKSGEAKDVFQVQDIPPVTAIGQSGLRLNLIGKNDRILNVN 
130 140 150 160 170 180 



190 200 210 220 230 240 

ml 54 .pep S PVLYENFMVGQVESAHFD P SDQS VHYT I F I QS PNDKLI HSAS RFWLESGI NI ETTGSG I 

1 1 1 1 1 1 1 1 1 1 1 1 ■■ I I I M M I I I I I I I I II I I I I I I i f II I I i II I I i I I I i I! I f II I i 
gl54 SPVLYENFMVGQIESAHFDPSDQSVHYTIFIQSPMDKLIHSASRFWLESGINIETTGSGI 
190 200 210 220 230 240 



250 260 270 280 290 300 

mI54 .pep KLNSAPLPALLSGAISFDSPKTKNSKNVKSEDSFTLYDSRSEVANLPDDRSLYYTAFFKO 

llll(Mliiil!IIIIIM!llllllli({illllflin| : j!lllllllil!l!lli 

gl54 KIiNSAPLPALLSGAISFDSPKTKNSKNVKSEDSFTLYDSRSEIANLPDDRSLYYTAFFKQ 
250 260 270 280 290 300 



310 320 330 340 350 360 

ml54 .pep SVRGLTVGSPVEYKGLNVGWSDVPYFDRNQSLHLFENGWIPVRIRIEPSRLEINADEQS 

liniMIIMIllllllhlllllllllilllllMllllilllHIjllllllMIII 

gl54 SVRG LTVGS P VEYKGLNVGMVSD VPYFDRNDS LHLFENGWI PVR I RI E PSRLE INADEQS 

310 320 330 340 350 360 



370 380 390 400 410 420 

ml54 .pep KEHWKQQFQTAI^KGLTATISSNNLLTGSKMIELNDQPSASPKLRPHTVY AGDTVIATQG 

I i t II I i 1 1 i I i i I i i 1 1 M 1 1 1! 1 1 1 i 1 1 1 1 1 ! I M ! i 1 1 M ! 1 1 1! 1 1 1 1 1 ( 1 1 1 = ! 

gl 5 4 KEHWKQQFQTALtf KGLTATI SSNNLLTGGKMI ELNPQPSAS PKLRPHTVYAGDTVIATRG 
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151 VQDIPPVTAI 

201 SDQSVHYTIF 

251 LSGAISFDSP 

301 SVRGLTVGSP 

351 RLEINADEQS 

401 SPKLRPHTVY 

451 GSLAELKSTL 

5Q1 SPQSPIYGDV 

551 GSR* 



GQSGLRLNLI 
IQSPNDKLIH 
KTKNSKNVKS 
VEYKGLNVGV 
KEHWKQQFQT 
AGDTVIATQG 
KSANAALSSI 
QNTLQSLDKT 



GKNDRILNVN 
SASRFWLESG 
EDSFTLYDSR 
VSDVPYFDRN 
ALNKGLTATI 
GGLDDLQVKL 
DKLVGKPQTQ 
LKDVQPVINT 



SPVLYENFMV 
INIETTGSGI 
SEVANLPDDR 
DSLHLFENGW 
SSNNLLTGSK 
ADLLDKFDKL 
NIPNELNQTL 
LKEKPMALIF 



GQVESAHFDP 
KLNSAPLPAL 
SLYYTAFFKQ 
IPVRIRIEPS 
MIELNDQPSA 
PLDKTVAELN 
KELRTTLQGV 
NSSSKDPIPK 



ml54/al54 100.0% identity in 553 aa overlap 

10 20 30 40 50 60 

ml 54 .pep MTDNSPPPNGHAQARVRKNNTFLSAVWLVPLIALIAGGWLWVKEIRNRGPWTLLMDSAE 
I I I I I i I i I M I I i I I I i 1 I I I I I I M I I I i I M I ! I I I I I I I I I I I I I 1 I I I I t t 1 I ! I 
a 154 MTDNSPPPNGHAQARVRKNNTFLSAVWLVPLIALIAGGWLWVKEIRNRGPWTLLMDSAE 

10 20 30 40 50 60 



70 80 90 100 110 120 

ml54 . pep GIEVNNTVIKVLSIDVGRVTRIKLRDDQKGVEVTAQLNADVSGLIRSDTQFWWKPRIDQ 
I I I I ! I I ! ! II I I I ! ! I i I 1 I 1! i II ! i I I I I I I I II I M I I I I I I II I i I I I I I ! I I I i 
al54 GIEVNNTVIKVLSIDVGRVTRIKLRDDOKGVEVTAQLNADVSGLIRSDTQFWWKPRIDQ 

70 80 90 100 110 120 



130 140 150 160 170 180 

ml54 . pep SGVTGLGTLLSGSYIAFT PGKSDEAKDVFQVQDIPPVTAIGQSGLRLNLIGKNDRILNVN 

II I 1 I I I I 1 i I il ! I I I I I I I I I I M t I i i I I I M 1 I ! ! I I I I I II I I I I M I I i i I I M 
al54 SGVTGLGTLLSGSYIAFTPGKSDEAKDVFQVQDIPPVTAIGQSGLRLNLIGKNDRILNVN 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml54 .pep SPVLYENFMVGQVESAHFDPSDQSVKYT I FIQSPNDKL I BSASRFWLESG INIETTGSGI 

! II i I I I I i ! ! ' M I ; | ( i ' ; ' ! | | | I i ( ( | ( j ! | | I ! i ! | | | ( t ; | 1 ! I! I , , i ! i ) I I 
aI54 SPVLYENFMVGQVESAHFDPSDQSVHYTIFIQSPWDKLIHSASRFWLESGINIETTGSGI 

190 200 210 220 230 240 



250 260 270 230 290 300 

ml54 . pep KLNSAPLPALLSGAISFDSPKTKNSKNVKSEDSFTLYDSRSEVANLPDDRSLYYTAFFKQ 
II I I M I I I i I M 1 I I M I I I I i I I I I I I M I 1 I I I I I I I I i M I II I I I I I I I I I I I I I 
al54 KLNSAPLPALLSGAISFDSPKTKNSKNVKSEDSFTLYD5RSEVANLPDDRSLYYTAFFKQ 

250 260 270 230 290 300 



310 320 330 340 350 360 

ml 54 .pep SVRGLTVGSPVEYKGLNVGWSDVPYFDRNDSLHLFENGWIPVRIRIEPSRLEINADEQS 
M ! I I I I I I I I I ! I ! 1 i I i I I I I I I II I I I I M I I I I I M II I M I I M i I I I I I I I I I i 
al54 SVRGLTVGSPVEYKGLNVGWSDVPYFDRNDSLHLFENGWI PVRIRIEPSRLEINADEQS 

310 320 330 340 350 360 



370 380 390 400 410 420 

ml54 . pep KEHWKQQFQTALNKGLTATISSNNLLTGSKMIELNDQPSASPKLRPHTVYAGDTVIATQG 
i II I II I I I t M I I I II I I I 11 I ) I I I 1 I I I I I I I I I ) I I 1 I II 1 I I M I I i I I 1 I I II I 
al54 KEHWKQQFQTALNKGLTATISSNNLLTGSKMIELNDQP5ASPKLRPHTVYAGDTVIATQG 

370 380 390 400 410 420 



430 440 450 460 470 480 

ml 54 .pep GGLDDLQVKLADLLDKFDKLPLDKTVAELMGSLAELKSTLKSANAALS SI DKLVGKPQTQ 

I I I I I I II I I I I I II I I I i ! I I I I I I I I I f I I I I I I I M I I I I I I M I I I I I I I I I I I ! I 
al54 GGLDDLQVKLADLLDKFDKLPLDKTVAELNGSLAELKSTLKS AN AALSS I DKLVGKPQTQ 

430 440 450 460 470 480 



490 500 510 520 530 540 

ml 5 4 . pep NIPNELNQTLKELRTTLQGVSPQSPIYGDVQNTLQSLDKTLKDVQPVINTLKEKPNALIF 
I I I i I i M I I I I I M I I I M I I I I 11 M I II I II I II I I I I I I M I i I I i I I I M | I I M 
al54 NIPNELNQTLKELRTTLQGVSPQSPIYGDVQNTLQSLDKTLKDVQPVINTLKEKPNALIF 

490 500 510 520 530 540 
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301 


TTr,nTrr.a zip 




351 






401 






4 51 


PTITTTPTTPZl 


ppp^tt"!! a 


501 


1 ! 1 i Vjo 1 o 


/til UU 1 


551 


PAAAPTPHPT 


pppppp AftTr 


601 






65 1 


a pzi £PZi itt,'': 




7 01 




a p p p p a p a t p 


751 


P,APATPATPA 


TP APPAPPPP 


on i 






851 


ATTTGGCGGC 


GGCGACGGGC 


901 


TTGTCCGTAA 


CCGGCAACGG 


951 


AAACCGCCTT 


GCCGGACAGT 


1001 


ACCTGACCAA 


GCTGTTAAGC 


1051 


TTCGAAGACG 


TG? W TATCCG 


1101 


CACCTTCCCG 


CCTCCGCCGA 


1151 


CGTCTGAAAA 


AGCCGTGCCT 


1201 


CTGTGGAAAA 


AACTCGCGCC 


1251 


GGTCGGCGCG 


GTCGCACCCG 


1301 


TTCTCGCCTG 


CGTCATCGGC 


1351 


CTGCACACAC 


CGCTGATGTC 


1401 


CGTCGGCGCG 


CTGCTGCAAA 


1451 


TGTCGTTTGT 


TGCCATCCTG 


1501 


GCGGTAACAC 


GGCGTATGCT 



CAAGAAAGTG AACGCGCTGG CGATGGATAT 
CGCAGGCTTT GGACGCTTTG TCTTCGATGG 
GCCGTAATTG AAGCCGCCAA CGCCTTCGGC 
TACCGCCGCC GGCAAAGTGC CGCCCGCGCA 
GTGTGGCAGG TTTGGCGGCG ATCGGTACGG 
GTACGCGCGT TCGATACCCG CTTGGAAGTG 
GGGCGGCAAG TTCCTGAAAC TCGACTTCCC 
GAGACGGCTA CGCCAAAGTG ATGAGCGACG 
AAGCTCTTTG CCGAGCAGGC GAAAGAAGTG 
CGCCATTCCG GGCAAACCCG CGCCCAAGCT 
AAAGCATGAA ATCCGGCTCC GTCATCGTCG 
GGCAACTGCG AACTCACCCG CCCGGGCGAA 
CGTGAAAATC ATCGGCTACA CCGACATGGC 
CTTCCCAGCT TTACGCCACC AACTTGGTCA 
CCGAACAAAG ACGGCGAAAT CACGTTGGAC 
CAACATGACC GTTACCCACG ACGGCGAAAT 
TTCAAGTTTC CGCCCAGCCG CAGCAAACGC 
GCCGCCAAGC CCGAGCCAAA ACCCGTTCCC 
CGCCGTCATC GCCGCCGTCT TGGTACTGTG 
CAGCATTCCT GAACCACTTT ATCGTGTTCG 
TACTACGTCG TCTGGAACGT CAGCCACTCG 
GGTAACCAAC GCCATCTCCG GCATCATCGT 
TCGGTCAGGG CAACGGCTTC GTTTCGCTGC 
ATTGCCGGCA TCAACATCTT CGGCGGCTTT 
GAATATGTTT AAGAAAGGGT AA 



This corresponds to the amino acid sequence <SEQ ID 640; ORF 155>: 

ml55 . pep 

1 MKIGIPRESL SGETRVACTP ATVALLGKLG FETWESGAG LAASLDDAAY 

51 QTAGATVADK AAVWVCP LXY KVNAPSEQEL PLLN EGQTIV SFLWPRQNEA 

101 LVEALRAKKV NALAMDMVPR ISRAQALDAL SSMANISGYR AVIEAANAFG 

151 RFFTGQITAA GKVPPAQVLV IGAGVAGLAA IGTANSLGAV VRAFDTRLEV 

201 AEQIESMGGK FLKLDFPQES GGSGDGYAKV MSDEFIAAEM KLFAEQAKEV 

251 DIIITTAAIP GKPAPKLITK EMVESMKSGS VIVDLAAATG GNCELTRPGE 

301 LSVTGNGVKI IGYTDMANRL AGQSSQLYAT NLVNLTKLLS PNKDGEITLD 

351 FEDVIIRNMT VTHDGEITFP PPPIQVSAQP QQTPSEKAVP AAKPEPKPVP 

401 LWKK LAPAVI AAVLVLWVGA V AP AAFLNHF IVFVLACVIG YYWWNVSHS 

4 51 LHTP LMSVTN AISGIIWGA L LQIGQGNG F VSLLSPVAIL IAGINI FGGF 

501 AVTRRMLNMF KKG* 



Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 155 shows 97.9% identity over a 513 aa overlap with a predicted ORF (ORF 155.ng) 
from N. gonorrhoeae: 

ml55 / gl55 97.92 identity in 513 aa overlap 

10 20 30 40 50 60 

ml 55 . pep MKIGI PRESLSGETRVACTPATVALLGKLGFETWESGAGLAASLDDAAYQTAGATVADK 

I I I i I I ! I i I i I ! I ! I I I I 1 I t I I I M I I I i M I 1 I i I i II I 1 I I I 1 I I I 1 I !l I I I II I 
gl55 MKIGI PRE SLSGETRVACT PAT VALLGKLGFETWESGAGLAASLDDAAY QTAGATVADK 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml 55 . pep AAWVCPLIYKVNAPSEQELPLLNEGQTIVSFLWPRQNEALVEALRAKKVNALAMDMVPR 
I I I I : I H I I I I I I I I I I I It i : 1 I I 1 I I I I I I I I i M I I I 1 ) t 1 i I I 1 I I I i 1 I i ! i I 
gl55 AAVWACPLIYKVNAPSEGELPLLK£GQTIVSFLWPRQNEALVEALRAKKV^1ALAHDMVPR 

70 80 90 100 110 120 



130 140 150 160 170 180 

ml 55. pep ISRAQALDALSSMANISGYRAVIEAANAFGRFFTGQITAAGKVPFAQVLVIGAGVAGLAA 
1 I i I M I I ( ! I I I I I I I I I I ! I I I I | I | 1 | | | | M M | | | | | | M I I I I I I I I I M I M I 
g!55 I SRAQALDALSSMAN I SGYRAVIEAANAFGRFFTGQ I TAAGKVPPAQVLV IGAGVAGLAA 
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1201 CTGTGGAAAA AACTCGCGCC 

12 51 GGTCGGCGCG GTCGCACCCG 
1301 TCCTCGCCTG CGTCATCGGC 

13 51 CTGCACACAC CGCTGATGTC 

14 01 CGTCGGCGCG CTGCTGCAAA 
14 51 TGTCGTTTGT TGCCATCCTG 
1501 TTTGTAACGC GGCGGATGCT 



CGCCNTNATC GCCGCCGTG7 TGGTACTGTG 
CAGCATTCCT GAACCACTTT ATCGTCTTCG 
TACTATGTCG TTTGGAACGT CAGCCACTCG 
GGTGACCAAC GCCATTTCCG GCATCATCGT 
TCGGTCAGGG CAACGGCTTC GTTTCGCTGC 
ATTGCCAGCA TCAACATCTT CGGCGGCTTC 
GAATATGTTT AGGAAAGGGT AA 



This corresponds to the amino acid sequence <SEQ ID 642; ORF 155.a>: 

a!55 . pep 

1 MKIGI PRESL SGETRVACTP ATVALLGKLG FETWESGAG LAASLDDAAY 

51 QAAGATVADK AAVWAYPLIY KVNAPSEDEL PLLKEGQTIV SFLWPRQNEA 

101 LVEALRAKKV NALAMDMVPR ISRAQALDXL SXMANISGYR AVIEAANAFG 

151 RXFTGQITAA GKVPPAQ VLV IGAGVAGLAA IGTA N5LGAV VRVFDTRLXV 

201 AEQLESMGGK FLKLDFPQES GGSGDGYAKV MSDEFIAAEM KLFAEQAKEV 

251 DIIITTAAIP GKPAPKXXXK EMVESMKPGS VIVDLAAATG GNCELTKQGE 

301 LFVTGNGVKI IGYTDMANRL AGQSSQLYAT NLVNLTKLLS PNKDGEITLD 

351 FEDVIIRNMT VTRDGEITFP PPPIQVSAQP QQTPSEKAAP AAKPEPKPVP 

4 01 LWKK LAPAXI AAVLVLWVGA V AP AAFLNHF IVFVLACVIG YYWWNVSHS 

4 51 LHTP LMSVTN AISGIIVVGA L LQIGQGNG F VSLLSFVAIL IASINI FGGF 

501 FVTRRMLNMF RKG* 

ml55/a!55 95.3% identity in 513 aa overlap 



10 20 30 40 50 60 

ml 55 .pep MKIGI PRESLSGETRVACTPATVALLGKLGFETWESGAGLAASLDDAAYQTAGATVADK 

I M i I I I I I 1 I I M I M I I I 1 I I I I I I I I M I I M I I I I I I ) I N 1 I I I I I : I I I 1 M I I 
al55 MKIGI PRESLSGETRVACTPATVALLGKLGFETVVESGAGLAASLDDAAYQAAGATVADK 

10 20 30 40 50 60 



70 80 90 100 110 120 

ml 55 . pep AAVWVCPLIYKVNAPSEQELPLLNEGQTIVSFLWPRQNEALVEALPAKKVNALAMDMVPR 
I I I i : M ! II I i i I i I : I I I ! I : ! I i i I I II I t I 1 i i t I M 1 I I ) I I t ! M II ! I I 1 I I 
al55 AAVWAYPLIYKVNAPSEDELPLLKEGQTIVSFLWPRQNEALVEALRAKKVNALAMDMVPR 

70 80 90 100 110 120 



130 140 150 160 170 180 

ml55 . pep ISRAQALDALSSMANISGYRAVIEAANAFGRFFTGQITAAGKVPPAQVLVIGAGVAGLAA 
I 11 II ! I I I I II I I M I I i I i I I ) I ! 1 I I I I I I I I I I I I I 1 I II 1 I I I I I M I I I I I 
a!55 I SRAQALDXLSXMAN I SGYRAVIEAANAFGRXFTGQITAAGKVPPAQVLV IGAGVAGLAA 

130 140 150 160 170 180 



190 200 210 220 230 240 

ml 55. pep IGTANSLGAWRAFDTRLEVAEQIESMGGKFLKLDFPQESGGSGDGYAKVMSDEFIAAEM 
I I i I I 1 I I I ! I I : II I I I I I I I : I I I M I II I I I I I I I I I I I I I I I I I 1 I I I II I I II I 
al55 IGTANSLGAWRVFDTRLXVAEQLESMGGKFLKLDFPQESGGSGDGYAKVMSDEFIAAEM 

190 200 210 220 230 240 



250 260 270 280 290 300 

ml 5 5 . pep KLFAEQAKEVDI I ITTAAI PGKPAPKLITKEMVE SMKSGSVI VDLAAATGGNCELTRPGE 

I I I I I I I I I ! I II II I I II I I i I I I i : I I i 1 M I I I I I I I I I I I II i I I I I II : II 
a 1 5 5 KLFAEQAKEVDI I ITTAAI PGKPAPKXXXKEMVESMKPGSVIVDLAAATGGMCELTKQGE 

250 260 270 2S0 290 300 



310 320 330 340 350 360 

ml 55. pep LSVTGNGVKIIGYTDMANRLAGQSSQLYATNLVNLTKLLSPNKDGEITLDFEDVIIRNMT 
I I I I I I I I I M I I I t I I I I II I I I I I I I I I I II I I I 1 I I 11 I I I II M I I f I I I I I I I 1 
al55 LFVTGNGVKI IGYTDMANRLAGQSSQLYATNLVNLTKLLSPNKDGEITLDFEDVIIRNMT 

310 320 330 340 350 360 



370 380 390 400 410 420 

ml 55 . pep VTHDGEITFPPPPIQVSAQPQQTPSEKAVPAAKPEPKPVPLWKKLAPAVI AAVLVLWVGA 

I i : I I i M I I M I I I II ! I I I I I I I I I I : I I I I I I I I M I I I F I I I I I II I I I II 1 1 I I 
al 55 VTRDGEITFPPPPIQV5AQPQQTPSEKAAPAAKPEPKPVPLWKKLAPAX I AAVLVLWVGA 

370 380 390 400 410 420 
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g!56 FAPFAAAVLTAHATGNAGQATVNTLAGLFILFRLAFIWCYIADKAALRSLMWAGGFACTV 

70 80 90 100 110 120 

ml 5 6. pep GLFVAAAX 
i ! 1 i I I I 1 
gl56 GLFVAAAX 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 647>: 

al56 . seq 

1 ATGACTTTCG CCTATTGGTG TATTCTGATT GCCTACCTAT TGCCGCTTTT 

51 TTGTGCGGCG TATGCCAAAA AAGCGGGCGG ATTCCGGTTT AAAGACAACC 

101 ACAATCCGCG CGATTTTCTG GCGCGCACGC AAGGCACAGC CGCCCGTGCC 

151 CACGCCGCGC AGCAAAACGG TTTTGAAGCC TTTGCACCGT TTGCAGCCGC 

201 CGTTTTGACG GCACACGCAA CCGGCAATGC CGGACAAGCA ACCGTCAACA 

2 51 CGCTTGCCGG CCTGTTCATC CTGTTCCGCC TCGCCTTTAT CTGGTGCTAC 

301 ATCGCAGACA AAGCAGCATT ACGCTCGCTG ATGTGGGTGG GCGGATTTGT 

351 CTGCACCGTC GGGCTGTTTG TCGTGGCTGC TTGA 

This corresponds to the amino acid sequence <SEQ ID 648; ORF 156.a>; 

al56 . pep 

1 MTFAYWCILI AYLLPLFCAA YAKKAGGFRF KDNHNPRDFL ARTQGTAARA 

51 HAAQQNGFEA FAPFAAAVLT AHATGNAGQA TVNT LAGLFI LFRLAFIWCY 

101 IADKAALRS L MWVGGFVCTV GLFWA A* 

m!56/al56 90.6% identity in 127 aa overlap 

10 20 30 40 50 60 

ml 5 6. pep MTFAYWCILIACLLPLFCAAYAKKAGGFRFKDNHNPRGFLAHTQGAAARAHAAQQNGFEA 
i I i I i I I I I I I I I I I I i I I I I I I ! i I I I I I II II I I I I I : I I I : II I M I II 1 I I I I I 
al56 MTFAYWCILIAYLLPLFCAAYAKKAGGFRFKDNHNPRDFLAKTQGTAARAHAAQQNGFEA 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml 56 . pep FAPFAAAVLTAHATGNAAQSTINTLACLFILFRLAFIWCYIADKAAMRSLMWAGGFACTV 
II 1 I I I I I I I I I I I I M : I : I : I I ! I M I I I 1 I I I I I I I II I I I I : I I I I I : i I I : I I I 
al56 FAPFAAAVLTAHATGNAGQATVNTLAGLFILFRLAFIWCYIADKAALRSLMWVGGFVCTV 
70 80 90 100 110 120 



ml 5 6. pep GLFVAAAX 
lllhll! 
al56 GLFWAAX 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 649>: 

gl57 . seq 

1 atgaggaacg aggAAAAACg cgccctgcgc cgcgaattgC gCgGgcggcg 

51 ttcgcAAATg GGgcgagacg tGCGggCGGC GGCGgCgatA Aaaatcaacc 

101 gcctgctcaa aCGTtatatc AAGCGCggtc gGaAaatcgG CGTGTATTgg 

151 cCGATGGGCA AGGAATTGcg TTTGGGCGgc tTtgtcCGCG CGGCGCAAAA 

201 ACGCgGCGCA AAactctatc tgccttATAT CGAACCGCAC ACGCGGCGGA 

251 TGTGGTTTAC GCCGTATCCT GAACGCGGAA TGGAACGGGA ACGCAAGCGC 

301 GGTAGGGCGA AGCTGCATGT CCCTCAGTTT GCAGGGCGCA AAATCCGCGT 

351 GCACGGTTTG TCGGTATTGC TCGTCCCGCT TGTCGGCATA GACCGCGAAG 

401 GCTACCGTTT GGGGCAGGCA GGCGGCTATT ACGATGCGAC GCTTTCGGCG 

4 51 ATGAAATACC GTTTGCAGGC GAAAACCGTG GGCGTGGGCi: TTGCCTGCCA 

501 GTTGGTGGAC AGGCTCCCAC GCGAGGCGCA CGACCTGCCG CTGGACGGTT 

551 TTGTATCGGA AGCGGGGATA TTGTGTTTTT AG 
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101 GCCTGCTCAA ACGTTATATC 

151 CCGATGGGCA AGGAATTGCG 

201 ACGCGGTGCA AAACTTTATC 

251 TGTGGTTTAC GCCGTATCCT 

301 GGCAGGGCGA AGTTGAACGT 

351 GCACGGTTTG TCGGTATTGC 

401 GCTACCGCTT AGGACAGGCA 

4 51 ATGAAATACC GTTTGCAGGC 

501 GTTTGTGGAC AGGCTGCCGC 

551 TTGTGTCGGA GGCGGGGATA 



AAGCGTGGTC GGAAAATCGG CGTGTATTGG 
TTTGGACGGC TTTGTCCGCG CGGCGCAAAA 
TGCCTTATAT CGAACCGCGT TCGCGGCGGA 
GAAAGCGGAA TGGAACGGGA GCGCATACGG 
GCCGCAGTTT GCAGGGCGCA AAATCCGCGT 
TCGTCCCGCT TGTCGGCATA GACCGCGAGG 
GGCGGCTATT ACGATGCGAC GCTTGCGGCG 
AAAAACCGTG GGCGTGGGCT TTGCCTGCCA 
GCGAACCGCA CGATCTGCTG CTGGACGGTT 
TTGTGCTTTT AG 



This corresponds to the amino acid sequence <SEQ ID 654; ORF 157.a>: 

al57 .pep 

1 MRNEEKHALR RELRRARAQM GHQGRLAAGQ TINRLLKRYI KRGRKIGVYW 

51 PMGKELRLDG FVRAAQKRGA KLYLPYIEPR SRRMWFTPYP ESGMERERIR 

101 GRAKLNVPQF AGRKIRVHGL SVLLVPLVGI DREGYRLGQA GGYYDATLAA 

151 MKYRLQAKTV GVGFACQFVD RLPREPHDLL LDGFVSEAGI LCF* 

ml57/al57 82.4% identity in 193 aa overlap 



10 20 30 40 50 60 

ml 57 . pep MRNEEKRALRRELRGRRSQMGRDVRAAATVKIMHLLKRYIKKGRKIGVYWPMGKELRLDG 
111111:1111111 I : I I I : : i II I I : I I I I 1 I I : I M I I I I I 1 ! M I I I I 1 i 
a!57 MRNEEKHALRRELRRARAQMGHQGRLAAGQTINRLLKRYIKRGRKIGVYWPMGKELRLDG 

10 20 30 40 50 60 



70 80 90 100 110 120 

ml 57 . pep FVRAAQKRGAELYLPYIEPRSRRMWFTPYPADGVKQERKRGRAKLHVPQFAGRKKRVHDL 
I 1 I 11 I I II I : I 1 I I I I 1 I I 1 t I I I 1 t ! 1 1 : I : : : 1 I I 1 M I 1 : 1 t I I I M i 111 I 
a 157 FVRAAQKRGAKLYLPYIEPRSRRMVIFT 7 YPESGMERERIRGRAKLNVPQFAGRKIRVHGL 

70 60 90 100 110 120 



130 140 150 160 170 180 

ml57 . pep NLLLVPWGMDRLGYRLGQAGGYYDATLSAMKYRLQAKTVGVGFACQLVDRLPVEAHDRS 

: : II I I : I I : II I 1 I M 1 I I I it I ! M : 1 I I 1 I ! I II II I M I I M : M I I I I II 
al57 SVLLVPLVGIDREGYRLGQAGGYYDATLAAMKYRLQAKTVGVGFACQFVDRLPREPHDLL 

130 140 150 160 170 180 



190 

ml57.pep LDGFVSEAGILCFX 
1111)11)11)111 
al57 LDGFVSEAGILCFX 
190 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 655>: 

gl58 . seq 

1 ATGAAAACCA ATTCAGAAGA ACTGACCGTA TTTGTTCAAG TGGTGGAAAG 

51 CGGCAGCTTC AGCCGTGCGG CGgagcAGTT GGAGAtggCA AATTCTGCCG 

101 TAAGCCGCAT CGTCAAACGG CTGGAGGAAA AGTTGGGCGT GAAcCTGCtc 

151 aACCGCACCA CGCGGCAACT CAATCTGACG GAAGAAGGCG CGCAATATTT 

201 CCGCCGCGCG CAGAGAATCC TGCAAGAAAT GGCAGCGGCG GAAACCGAAA 

251 TGCTGGCAGT GCACGAAGTA CCGCAAGGCG TGTTGCGCGT GGATTCCGCG 

301 ATGCcgatgg TGCTGCATCT GCTGGCGCCG CTGGCAGCAA AATTCAACGA 

351 ACGCTATCCG CATATCcgaC TTTCGCTCGT TTCTTCCGAa ggctatatca 

4 01 atctGattGA Acgcaaagtc gAtatTGCCT TACGGGCCGG AGAATTGGAC 

4 51 GATTCCGGGC TGCGTGCACG CCATCTGTTT GACAGCCACT TCCGCGtagt 

501 cgCCAGTCCT GAATATTTAG CAAAACACGG CACGCCACAA TCTGCAGAAG 

551 atcTTGCCAA CCATCAATGT TTAGGCTTCA CAGAACCCGG TTCTCTAAAT 

601 ACATGGGCGG TTTTAGAtgC GCAGGGAAAT CCCTATAAAA TTTCACCGCA 

651 CTTTACCGCC AGCAGCGGTG AAATCTTACG CTCGTTGTGC CTTTCAAGtt 

7 01 gCGGTATTGC TTGCTTATCA GATTTTTTGG TTGACAACGA CATCACTGAA 

7 51 GGAAAGTTAA TTCCcctatt cgCCGAACAA ACCTCCAATA AAACACACCC 
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130 140 150 160 170 180 

ml58 . pep HIRLSLVSSEGYINL1ERKVDIALRAGELDDSGLRARHLFDSRFRVIASPEYLAKHGTPQ 
( I I I I I i I II I I I I i 1 i i 1 I I i I I I I I f ! f I i I I f I I i i I i t : I I ! : I 1 I I M I It I II I 
gl58 HIRLSLVSSEGYINLXERKVDIALRAGELDDSGLRARHLFDSHFRVVASPEYLAKHGTPQ 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml58 . pep STEELAGHQCLGFTEPGSLNTWAVLDAQGNPYKISPHFTASSGEILRSLCLSGCGIVCLS 
I : I : f I : M J I M ! I I I I I II I I ! f I ( II I I I I ! ! I ! ! I I I I I I I I I I I ( I I : I I i : i I I 
gl58 SAEDLANHQCLGFTEPGSLNTWAVLDAQGNPYKISPHFTASSGEILRSLCLSSCGIACLS 
190 200 21C 220 230 240 

250 260 270 230 290 300 

ml 58. pep DFLVDNDIAEGKLIPLLAEQTSDKTHPFNAVYYSDKAVNLRLRVFLDFLVEELGNNLCGX 

\ I I I I I I I : i I II I II : M I I I : I I II I II I I I I I I I I I I I I I I II I I I I : I 11 : I : 
g!58 DFLVDNDITEGKLIPLFAEQTSNKTHPFWAVYYSDKAVNLRLRVFLDFLVKELGKNMNRT 

250 260 270 280 290 300 

gl58 NTKX 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 659>: 

al58 . seq 

1 ATGAAAACCA ATTCAGAAGA ACTGACCGTA TTTGTTCAAG TGGTGGAAAG 

51 CGGCAGCTTC AGCCGTGCGG CGGAGCAGTT GGCGATGGCA AATTCTGCCG 

101 TAAGCCGCAT CGTCAAACGG CTGGAGGAAA AGTTGGGTGT GAACCTGCTC 

151 AACCGCACCA CGCGGCAACT CAGTCTGACG GAAGAAGGCG CGC AATATTT 

201 CCGCCGCGCG CAGAGAATCC TGCAAGAAAT GGCAGCGGCG GAAACCGAAA 

251 TGCTGGCAGT GCACGAAATA CCGCAAGGCG 7GTTGCGCGT GGATTCCGCG 

301 AT GCCGAT GG TGCTGCATCT GCTGGCGCCG CTGGCAGCAA AATTCAACGA 

351 ACGCTATCCG CATATCCGAC TTTCGCTCGT TTCTTCCGAA GGCTATATCA 

401 ATCTGATTGA ACGCAAAGTC GATATTGCCT TACGGGCCGG AGAATTGGAC 

451 GATTCCGGGC TGCGTGCACG CCATCTGTTT GACAGCCGCT TCCGCGTAAT 

501 CGCCAGTCCT GAATACCTGG CAAAACACGG CACGCCGCAA TCTACAGAAG 

551 AGCTTGCCGG CCACCAATGT TTAGGCTTCA CCGAACCCGG TTCTCTAAAT 

601 ACATGGGCGG TTTTAGATGC GCAGGGAAAT CCCTATAAGA TTTCACCGCA 

651 CTTTACCGCC AGCAGCGGTG AAATCTTACG CTCGTTGTGC CTTTCAGGTT 

7 01 GCGGTATTGC TTGCTTATCA GATTTTTTGG TTGACAACGA CATCGCTGAA 

751 GGAAAGTTAA TTCCCCTGCT CGCCGAACAA ACCTCCAATA AAACGCACCC 

801 CTTTAATGCT GTTTATTACA GCGATAAAGC CGTCAACCTC CGCTTACGCG 

851 TATTTTTGGA TTTTTTAGTG GAGGAACTGG GAAACAATCT CTGTGGATAA 

This corresponds to the amino acid sequence <SEQ ID 660; ORF I58.a>: 

al58 -pep 

1 MKTNSEELTV FVQWESGSF 5RA AEQLAMA NSAVSRIVKR LEEKLGVNLL 

51 NRTTRQLSLT EEGAQYFRRA QRILQEMAAA ETEMLAVHEI PQGVLRVDSA 

101 MPMVLHLLAP LAAKFNERYP HIRLSLVSSE GYINLIERKV DIALRAGELD 

151 DSGLRARHLF DSRFRVIASP EYLAKHGTPQ STEELAGHQC LGFTEPGSLN 

201 TWAVLDAQGN PYKISPHFTA SSGEILR SLC L5GCGIACLS DFLV DNDIAE 

251 GKLIPLLAEQ TSNKTHPFNA VYYSDKAVNL RLRVFLDFLV EELGNNLCG* 

mX58/al58 99.0% identity in 299 aa overlap 

10 20 30 40 50 60 

ml 58 . pep MKTNSEELTVFVQWESGSFSRAAEQLAMANSAVSRIVKRLEEKLGVNLLNRTTRQLSLT 
I I M I I I I M II I I I I 1 I I 1 I I II I I i I I I i 1 I I I I I 11 M I \ I 1 I I I I I II 1 I I M i I I 
al 58 MKTNSEELTVFVQWESGSFSRAAEQLAMANSAVSRIVKRLEEKLGVNLLNRTTRQLSLT 

10 20 30 40 50 60 

70 80 90 100 110 120 

rt\158 .pep EEGAQYFRRAQRILQEMAAAETEMLAVHEIPQGVLSVDSAMPMVLHLLAPLAAKFNERYP 
I I I I I I 1 I I 1 I I I I I I I I I I I I t I ! II M I I I I I I I I I I I I I I I I II I M I I I I I I | | | 
a 158 EEGAQYFRRAQRI LQEMAAAETEMLAVHE I PQGVLRVDSAM PMVLHLLAPLAAKFNER Y ? 

70 80 90 100 110 120 
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501 CGTCCTGCTG GTGCTTATCC TGCGCGCCTA TCTCGAACAG GATAAGGATG 

551 TCGAACTCTC GGGCGTATTG AAAGGTTGGC AGGACAAACG TTTGGGACAT 

601 TTAATCCAAA AGGTGATAGA CAAACCGGAA GACGAATGGA ATGTCGACAA 

651 AATGGTGGCG GCTGCCAATA TGTCGCGCGC GCAACTGATG CGCCGTTTCA 

701 AAAGCCGGGT CGGACTCAGC CCGCACGCCT TTGTGAACCA TATCCGCCTG 

7 51 CAAAAAGGCG CGTTGCTGCT GAAAAAAAAC CCGGATTCGG TTTTGTCGGT 

SOI CGCACTGTCG GTAGGCTTTC AGTCGGAAAC GCACTTCGGC AAGGCGTTCA 

851 AACGGCAATA TCACGTTTCG CCGGGTCAAT ACCGGAAAGA AggCGGGCAA 

901 AAATAA 



This corresponds to the amino acid sequence <SEQ ID 664; ORF 160>: 

ml 60 . pep 

I MDILDKLVDF AQLTGSVDVQ CLLGGQWSVR HETLQREGLV HIVTSGSGYL 

51 CIDGETSPRP VSTGDIVFFP RGLGHVLSHD GKCGESLQPD MRQHGAFTVK 

101 QCGNGQDMSL FCARFRYDTH ADLMUGLPET VFLNIAHPSL QYWSMLQLE 

151 SKKPLTG TVS MVNALSSVLL VLILRAYLEQ DKDVELSGVL KGWQDKRLGH 

201 LIQKVIDKPE DEWNVDKMVA AANMSRAQLM RRFKSRVGLS PHAFVNHIRL 

251 QKGALLLKKN PDSVLSVALS VGFQSETHFG KAFKRQYHVS PGQYRKEGGQ 
301 K* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ml60 / gl60 93.4% identity in 301 aa overlap 



10 20 30 40 50 60 

ml 60. pep MDILDKLVDFAQLTGSVDVOCLLGGQWSVRHETLQREGLVHIVTSGSGYLCIDGETSPRP 
11 M I M 1 I : M f 1 M : 1 1 I M M I I! 1 1 I 1 1 I i I 1 I I 1 I I : ! I M II I I I M ! I 1 I 
gl60 MDILDKLVDLAQLTGSADVQCLLGGQW HETLQREGLVHIVTAGSGYLCIDGETSPRP 

10 20 30 40 50 



70 80 90 100 110 120 

ml 60 . pep VSTGDIVFFPRGLGHVLSHDGKCGESLQPDMRQHGAFTVKQCGNGQDMSL FCARFRYDTH 

I : I I M 1 1 I 1 1 I I I ! i 11 I I 1 I 1 1 1 ( 1 I I : 1 I : 1 : 1 I 1 I 1 ! 1 i 11 M ! 1 I 1 i I I i I t 
gl60 VGTGDIVFFPRGLGHVLSHDGKYGESLQPDIRQNGTFMVKQCGNGLDMSLFCARFRYDTH 
60 70 80 90 100 110 



130 140 150 160 170 180 

ml 60. pep ADLMNGLPETVFLNIAHPSLQYWSMLQLESKKPLTGTVSMVNALSSVLLVLILRAYLEQ 

i I M It 1 1 I I M 1 i M 1 I M 1 I I 1 I I I ! I i I : I I 11 I I 1 1 : I ! I! 11 M I ! I! 11 I 11 I 
gl60 ADLMNGLPETVFLNIAHPSLQYWSMLQLESEKPLTGTVSVVWALPSVLLVLILRAYLEQ 

120 130 140 150 160 170 



190 200 210 220 230 240 

ml 60 . pep DKDVELSGVLKGWQDKRLGHLIQKVIDKPEDEWNVDKMVAAANMSRAQLMRRFKSRVGLS 

11 11 1 11 1 1 I I 1 ) 1 M 1 I I 11 I I 1 11 1 I 11 I 1 I I : I 11 1 I 1 11 ! I t 1 1 1 1 I 11 I I : 1 1 1 I 
gl60 DKDVELSGVLKGWQDKRLGHLIQKVIDKPEDEWNIDKMVAAANMSRAQLMRRFKSQVGLS 

180 190 200 210 220 230 



250 260 270 280 290 300 

ml 60. pep PHAFVNHIRLQKGALLLKKNPDSVLSVALSVGFQSETHFGKAFKRQYHVS PGQYRKEGGQ 

M 1 I 1 1 I I 1 1 1 M 1 1 I M I : I M I I 1 1 I 1 I 1 I I 1 I 1 1 1 1 1 1 1 1 i 1 I I 1 1 1 I 1 1 ! I 1 1 1 I 
gl60 PHAFVNHIRLQKGALLLKKTPDSVLEVALSVGFQSETHFGKAFKRQYHVS PGQYRKEGGQ 

240 250 260 270 280 290 



ml 60 . pep KX 
I I 

gl60 KX 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 665>: 

al60. seq 

1 ATGGACATTC TGGACAAACT GGTCGATTTC GCCCAATTGA CGGGCAGTGT 
51 GGATGTGCAG TGCCTTTTGG GCGGACAATG GTCGGTACGG CATGAAACCT 
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The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 667>: 

gl 61 . seq 

1 ATGGATACCG CAAAAAAAGA CATTTTAGGA TCGGGCTGGA TGCTGGTGGC 

51 GGCGGCCTGC TTCACCGTTA TGAACGTATT GATTAAAGAG GCATCGGCAA 

101 AATTTGCCCT CGGCAGCGGC GAATTGGTCT TTTGGCGCAT GCTGTTTTCA 

151 ACCGTTACGC TCGGTGCTGC CGCCGTATTG CGGCGCGACA CCTTCCGCAC 

201 GCCCCATTGG AAAAACCACT TAAACCGCAG TATGGTCGGG ACGGGGGCGA 

251 TGCTGCTGCT GTTTTACGCG GTAACGCATC TGCCTTTGAC AACCGGCGTT 

301 ACCCTGAGTT ACACCTCGTC GATTTTTt rg GCGGTATTTT CCTTCCTGAT 

351 TTTGAAAGAA CGGATTTCCG TTTACACGCA GGCGGTGCTG CTCCTTGGTT 

4 01 TTGCCGGCGT GGTATTGCTG CTTAATCCCT CGTTCCGCAG CGGTCAGGAA 

451 CCGGCGGCAC TCGCCGGGCT GGCGGGCGGC GCGATGTCCG GCTGGGCGTA 

501 TTTGAAAGTG CGCGAACTGT CTTTGGCGGG CGAACCCGGC TGGCGCGTCG 

551 TGTTTTACCT TTCCGCAACC GGCGTGGC^A. TGTCGTCggt ttgggcgacg 

601 Ctgaccggct ggCACAcccT GTCCTTTcca tcggcagttt ATCtgtCGGG 

651 CATCGGCGTG tccgcgCtgA TTGCCCAaCT GtcgatgAcg cGCGcczaca 

701 aaGTCGGCGA CAAATTCACG GTTGCCTCGC tttcctaTAt gaccgtcGTC 

751 TTTTCCGCCC TGTCTGCCGC ATTTTTTCTg ggcgaagagc ttttctggCA 

801 GGAAATACTC GGTATGTGCA TCATTATcct CAGCGGCATT TTGAGCAGCA 

851 TCCGCCCCAT TGCCTTCAAA CAGCGGCTGC AAGCCCTCTT CCGCCAAAGA 

901 TAA 



This corresponds to the amino acid sequence <SEQ ID 668; ORF 161.ng>: 

gl 61 . pep 

1 MDTAKKDILG SGWMLVAAA C FTVMNVLIKE ASAKFALGSG ELVFWRMLFS 

51 TVTLGAAAVL RRDTFRTPHW KNHLNRS MVG TGAMLLLFYA VTKL PLTTGV 

101 T LSYTSSIFL AVFSFLIL KE RISVYTQA VL LLGFAGVVLL LNPSF RSGQE 

151 PAALAGLAGG AMSGWAYLKV RELSLAGEPG WRVVFYLSAT GVAMSSVWAT 

201 LTGWHTL5 FP 5AVYLSGIGV SALIA QLSMT RAYKVGDKFT VAS LSYMTW 

251 FSALSAAFFL GEELFWQ EIL GMCIIILSG1 LSSI RPIAFK QRLQALFRQR 

301 * 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 669>: 

ml 61 . seq 

1 ATGGATACCG CAAAAAAAGA CATTTTAGGA TCGGGCTGGA TGCTGGTGGC 

51 GGCGGCCTGC TTTACCATTA TGAACGTATT GATTAAAGAG GCATCGGCAA 

101 AATTTGCCCT CGGCAGCGGC GAATTGGTCT TTTGGCGCAT GCTGTTTTCA 

151 ACCGTTGCGC TCGGGGCTGC CGCCGTATTG CGTCGGGACA mCTTCCGCAC 

201 GCCCCATTGG AAAAACCACT TAAACCGCAG TATGGTCGGG ACGGGGGCGA 

251 TGCTGCTGCT GTTTTACGCG GTAACGCATC TGCCTTTGGC CACTGGCGTT 

301 ACCCTGAGTT ACACCTCGTC GATTTTTTTG GCGGTATTTT CCTTCCTGAT 

351 TTTGAAAGAA CGGATTTCCG TTTACACGCA GGCGGTGCTG CTCCTTGGTT 

401 TTGCCGGCGT GGTATTGCTG CTTAATCCCT CGTTCCGCAG CGGTCAGGAA 

451 ACGGCGGCAC TCGCCGGGCT GGCGGGCGGC GCGATGTCCG GCTGGGCGTA 

501 TTTGAAAGTG CGCGAACTGT CTTTGGCGGG CGAACCCGGC TGGCGCGTCG 

551 TGTTTTACCT TTCCGTGACA GGTGTGGCGA TGTCGTCGGT TTGGGCGACG 

601 CTGACCGGCT GGCACACCCT GTCCTTTCCA TCGGCAGTTT ATCTGTCGTG 

651 CATCGGCGTG TCCGCGCTGA TTGCCCAACT GTCGATGACG CGCGCCTACA 

701 AAGTCGGCGA CAAATTCACG GTTGCCTCGC TTTCCTATAT GACCGTCGTT 

751 TTTTCCGCTC TGTCTGCCGC ATTTTTTCTG GGCGAAGAGC TTTTCTGGCA 

801 GGAAATACTC GGTATGTGCA TCATCATCCT CAGCGGTATT TTGAGCAGCA 

851 TCCGCCCCAC TGCCTTCAAA CAGCGGCTGC AATCCCTGTT CCGCCAAAGA 

901 TAA 



This corresponds to the amino acid sequence <SEQ ID 670; ORF 161 >: 

ml 61 .pep 

1 MDTAKKDILG SGWMLVAAA C FTIMNVLIKE ASAKFALGSG EL VFWRMLFS 

51 TVALGAAAV L RRDXFRTPHW KNHLNRSMVG TGAMLLLFYA VT.HLPLATGV 

101 T LSYTSSIFL AVFSFLIL KE RISVYTQA VL LLGFAGVV LL LNPSFRSGQE 
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851 TCCGCCCCAC TGCCTTCAAA CAGCGGCTGC AATCCCTGTT CCGCCAAAGA 

901 TAA 

This corresponds to the amino acid sequence <SEQ ID 672; ORF 161 .a>: 

a!6i . pep 

1 MDTAKKDILG SGWMLVAAA C FTIMNVLIKE ASAKFALGSG EL VFWRMLFS 

51 TVALGAAAV L RRDTFRTPHW KNHLNRSMVG TGAMLLLFYA VTHLPLATGV 

101 T LSYTSSIFL AVFSFLIL KE RISVYTQA VL LLGFAGWLL LNPSF RSGQE 

151 TAALAGLAGG AMSGWAYLKV RELSLAGEPG WR WFYLSVT GVAMSSVWA T 

201 LTGWHTLS FP SAVYLSCIGV SALIA QLSMT RAYKVGDKFT VASLSYMTVV 

251 FSALSAAFFL AEELFWQ EIL GMCIIILSGI LSSI RPTAFK QRLQSLFRQR 

301 * 



ml61/al61 99.3% identity in 300 aa overlap 



10 20 30 40 50 60 

ml 61 . pep MDTAKKDILGSGWMLVAAACFTIMNVLIKEASAKFALGSGELVFWRMLFSTVALGAAAVL 
I I I I I ! I i I I II I I ! i I I I I I II i I I I M I ! i i I I I I II I II I I I I I I I [ I ! I I I ! I I I I 
al61 MDTAKKDILGSGWMLVAAACFTIMNVLIKEASAKFALGSGELVFWRMLFSTVALGAAAVL 

10 20 30 40 50 60 



70 80 90 100 110 120 

ml 61 .oep RRDXFRTPHWKNHLNRSMVGTGAMLLLFYAVTHLPLATGVTLSYTSSIFLAVFSFLILKE 
I I I : I ! i I I I I I I I I ! I II I I I I I II I II I i I II I I I I I II i I I I I ! I II 1 II II M I I I 
al61 RRDTFRTPHWKNHLNRSMVGTGAMLLLFYAVTHLPLATGVTLSYTSSI FLAVFSFLILKE 

70 80 90 100 110 120 



130 140 150 160 170 180 

ml61 .pep R I S V YTQAVLL LG FAG WLLLN P S FR SGQETAALAGLAGGAMSGWAY LKVRE L S LAGE PG 

I M I I 1 I M I I I I I I I I M I I I I I i M I I I I I I I II I I I I I M II I I I N I I I I I I I I I I 
a!61 RISVYTQAVLLLGFAGWLLLNPSFRSGQETAALAGLAGGAMSGWAYLKVRELSLAGEPG 

130 140 150 160 170 180 



190 200 210 220 230 240 

ml61.pep WRWFYLSVTGVAMSSVWATLTGWHTLSFPSAVYLSCIGVSALIAQLSMTRAYKVGDKFT 
II I I II I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I ! I I I I I i i I I I i I I M I I I I M i i 
al61 WRVVFYLSVTGVAMSSVWATLTGWHTLSFPSAVYLSCIGVSALIAQLSMTRAYKVGDKFT 

190 200 210 220 230 240 



250 260 270 280 290 300 

ml 61. pep VASLSYMTVVFSALSAAFFLGEELFWQEILGMCIIILSGILSSIRPTAFKQRLQSLFRQR 
I I I I M I I I I I I I i I I I I I I : I I I II I I M I I I M I I ! I I 1 I I I I (I I I I M I I I I I I II 
al61 VASLSYMTWFSALSAAFFLAEELFWQEILGMCIIILSGILSSIRPTAFKQRLQSLFRQR 

250 260 270 280 290 300 



ml 61. pep X 

I 

al61 X 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 673>: 

gl 63 . seq 

1 ATGGTTATTT TGACGACTTT GTTTTTTGTG TGTGTTTTGG TGGTATTGGT 

51 TTTAACCGTG CCGGATCAGG TGCAGATGTG gctCGACCGG GCAAAAGAAG 

101 TCATTTTTAC CGAGTTCAGC TGGTTTTATG TTTTAACGTT TTCCATTTTt 

151 ctgGGTTTtc tgctGATACT CTCGGTCAGC GGTTTGGGAA ACATcagGCT 

201 AGGACGGGAT GAAGATGTGC CGGAATTCGG CTTCCTGTCG TGGCTGGCGA 

251 TGCTGTTTGC GGCCGGGATG GGCGTGGGCC TGATGTTTTT CGGCGTGGCA 

301 GAGCCGTTGA TGCATTATTT TTCGGACATT ACGGTCGGCG CGCCGGAACA 

351 CAGGCAGCAG CAGGCATTGC TGCACACGGT GTTCCATTGG GGCGTTCACG 

4 01 CCTGGTCGGT GTACGGTACG ATTGCATTGG CTTTGGCTTA TTTCGGTTTC 
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651 CGTGCAGGTT TTGATTATCG 

7 01 CAATATCCGG CGTGGGGAAG 
751 GGCCTTGCGT TTTTGCTGCT 
801 TTACCTGTTG TCGGCATTCG 

8 51 TGGTGCGCCT CAGTTTTAAA 
901 TGGTTTGAAT CTTGGACGGT 
951 GCCGTTTGTG GGTTTGTTTA 

1001 GCGAGTTTGT CTTCGGGGTT 

1051 TGGTTTACCG TCTTCGGCAA 

1101 GGGGGGAATG CTCGAAAAGA 

1151 AATTCTTTAA TTACCTCCCC 

1201 CTGGTCATTT CTCTGTTTTT 

1251 CCTGAACAAT ATTACCTCTC 

1301 AGGCGGTTAT GTGGGGCGTG 

1351 CGCTCGGGCG GACTCGGCAA 

14 01 GCCGTTTGCC CTGCTGATGC 

14 51 TGAGTGCGGA TAAGAAATAT 

1501 TTTTGGACGG GCGGCAAGTG 

1551 GACGCAGGAG CAGGATATTT 

1601 CTATGCACGA GTTGCAACGG 

1651 CGGGTCGATA AAATGTTTCA 

1701 TCGGAAAGAG ACGATGCGCG 

17 51 AGGATGTATC CGACCAGTTG 

1801 CATCAGACAA CTTACAAACC 

1851 GTACGATGTG CAGTATATGA 

1901 AAAACTACGA ACGTTATTTG 

1951 ATGGCGCACG AGCAGGTGGA 



CCGCCGTCAT GTCCCTCGCC GTCGTTTCGG 
GGCGTGAAGG TGTTGAGCGA GTTGAACCTG 
GTTTTTTGTT TTGGCGGCGG GACCCACTGT 
GCGACAACAT AGGGAACTAC CTCGGAAATC 
ACTTATGCGT ACGAACGGGA ACACAAGCCG 
GCTTTATTGG GCGTGGTGGT GTTCTTGGGC 
TCGCGCGCAT TTCAAAGGGG CGCACCATCC 
TTGCTCATCC CCGGCCTGTT CGGCGTTTTG 
TACGGCGATT TGGCTGAATG ACGGGGTTGC 
TGACCTCCTC TCCGGAAACG CTGCTTTTTA 
CTGCCCGAA7 TGACGAGCAT CGTCAGCCTG 
TGTAACTTCT GCCGATTCCG GGATTTATGT 
GGGACAAAGG CTTGAGCGCG CCACGGTGGC 
CTGATGTCTG CCGTTGCCGT TTTGCTGATG 
CCTGCAGTCT ATGACCCTGA TTGTTTCCCT 
TGATAATGTG TTTCAGCCTG TGGAAAGGCT 
TTTGAGACCC GGGTTAACCC TACCAGTGTA 
GAAAGAACGG CTGGTGCAGA TAATGAGCCA 
TAAAATTCCT C AAA C AG ACT GCATCGCCCG 
GAGCTTTCGG AAGAATACGG CTTGAGCGTC 
TCGGGACGAG CCCGCAATCG AGTTCGTCAT 
ATTTTATGTA CGGGATTAAG TCTGTCGGGC 
ATTAACGACG GCAAGCTGCC GCATATCCGG 
CTACGCTTAT TTTTTCGACG GGCGCGTCGG 
ACAAGGACGA GCTGATTGCC GACATTTTGA 
ATGTTGTTGG ATGATGTCGG TCAGGAACTG 
ATTGGCAGAG TAA 



This corresponds to the amino acid sequence <SEQ ID 676; ORF 163>: 

ml 63 . pep 

1 M VILTTLFFV CVLWLVL TV PDQVQMWLDR AKEVIFTEF5 WFYVLTFSIF 

51 LGFLLILSVS SLGNIRLGRD EDVPEFGFLS wl amlfaagm GVGLMFFGV A 

101 EPLMHYFSDI TAGTPEHRQQ QALLHTVFHW GVHAWSVYGT IALALAYFGF 

151 RYKLPLALRS CFYPLLKEKI SGRFGDAID I MALLATFFGI ITTLGF GASQ 

201 LGAGLOEMGW IAENSFSVQ V LI I AA VMS LA VVSAIS GVGK GVKVLSELML 

2 51 GLAFLLLFFV LAAG PTVYLL SAFGDNIGNY LGNLVRLSFK TYAYEREHKP 

301 WFESWTVLYW AWWCSWAPFV GLFIARISKG RTIRE FVFGV LLIPGLFGVL 

351 WFTVFGNTAI WLNDGVAGGM LEKMTSSPET LLFKFFNYLP LPE LTSIVSL 

401 LVISLFFVTS ADSGIYVLNN ITSRDKGLSA PRW QAVMWGV LMSAVAVLLM . 

4 51 RSGGLGNLQS M TLIVSLPFA LLMLIMCF SL WKGLSADKKY FETRVNPTSV 

501 FWTGGKWKER LVQIMSQTQE QDILKFLKQT ASPAMHELQR ELSEEYGLSV 

551 RVDKMFHRDE PAIEFVIRKE TMRDFMYGIK SVGQDVSDQL INDGKLPHIR 

601 HQTTYKPYAY FFDGRVGYDV QYMNKDELIA DILKNYERYL MLLDDVGQEL 

651 MAHEQVELAE * 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 



ml63 / gl63 98.6% identity in 660 aa overlap 

10 20 30 40 50 60 

ml 63. pep MVILTTLFFVCVLWLVLTVPDQVQMWLDRAKEVIFTEFSWFYVLTFSIFLGFLLILSVS 
i I I I I i H It I I i I H I I I I 1 I I i II II M i t S I f I i t I I t I I I i I i I I I I t I t i I I I 1) 
gl63 MVILTTLFFVCVLWLVLTVPDQVQMWLDRAKEVIFTEFSWFYVLTFSIFLGFLLILSVS 

10 20 30 40 50 60 
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ml 63 . pep SLGNIRLGRDEDVPEFGFLSWLAMLFAAGMGVGLMFFGVAEPLMHYFSDITAGTPEHRQQ 
• f I I I II I I I I I I t i I I i I I I i t I I 1 1 I I I M f I I ill I i I I I I I || | f M : f : | M I | | 
9163 GLGNIRLGRDEDVPEFGFLSWLAMLFAAGMGVGLMFFGVAEPLMHYFSDITVGAPEHRQQ 
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201 


CGGACGGGAT 


GAAGATGTGC 


251 


TGCTGTTTGC 


GGCCGGGATG 


301 


GAGCCGTTGA 


TGCATTATTT 


351 


CAGGCAGCAG 


CAGGCATTGC 


401 


CTTGGTCGGT 


GTACGGTACG 


451 


CGCTACAAGC 


TGCCGCTTGC 


501 


AGAAAAAATT 


TCCGGAAGGT 


551 


TTGCTACTTT 


TTTCGGCATC 


601 


CTGGGCGCCG 


GATTGCAGGA 


651 


CGTGCAGGTT 


TTGATTATCG 


701 


CAATATCCGG 


CGTGGGGAAG 


751 


GGTCTTGCGT 


TTTTGCTGCT 


801 


TTACCTGTTG 


TCGGCATTCG 


851 


TGGTGCGCCT 


CAGTTTTAAA 


901 


TGGTTTGAAT 


CTTGGACGGT 


951 


GCCGTTTGTG 


GGTTTGTTTA 


1001 


GCGAGTTTGT 


CTTCGGGGTT 


1051 


TGGTTTACCG 


TCTTCGGCAA 


1101 


GGGGGGAGTG 


CTCGAAAAGA 


1151 


AATTCTTTAA 


TTACCTCCCC 


1201 


CTGGTCATTT 


CTCTGTTTTT 


1251 


CCTGAACAAT 


ATTACCTCTC 


1301 


AGGCGGTTAT 


GTGGGGCGTG 


1351 


CGCTCGGGCG 


GACTCGGCAA 


1401 


GCCGTTTGCC 


CTGCTGATGC 


1451 


TGAGTGCGGA 


TAAGAAATAT 


1501 


TTTTGGACGG 


GCGGCAAGTG 


1551 


GACGCAGGAG 


CAGGATATTT 


1601 


CTATGCACGA 


GTTACAACGG 


1651 


CGGGTCGATA 


AGATGTTTCA 


1701 


TCGGAAAGAG 


ACGATGCGCG 


1751 


AGGATGTATC 


CGACCAGTTG 


1801 


CATC AG AC AA 


CTTACAAACC 


1851 


GTACGATGTG 


CAGTATATGA 


1901 


AAAACTACGA 


ACGTTATTTG 


1951 


ATGGCGCACG 


AGCAGGTGGA 



CGGAATTCGG CTTCCTGTCG TGGCTGGCGA 
GGCGTGGGTC TGATGTTTTT CGGCGTGGCA 
TTCGGACATT ACGGCCGGCA CGCCGGAACA 
TGCACACGGT GTTCCATTGG GGCGTTCACG 
ATTGCATTGG CTTTGGCTTA TTTCGGTTTC 
CCTGCGTTCT TGTTTTTACC CCCTGTTGAA 
TCGGCGATGC CATTGATATT ATGGCGTTGC 
ATCACCACAT TGGGGTTCGG GGCTTCGCAA 
AATAGGCTGG ATTGCCGAAA ACAGCTTCAG 
CCGCCGTCAT GTCCCTCGCC GTCGTTTCGG 
GGTGTGAAGG TGTTGAGCGA GTTGAACCTG 
GTTTTTTGTT TTGGCGGCGG GTCCCACTGT 
GCGACAACAT AGGGAACTAC CTCGGAAATC 
ACTTATGCGT ACGAACGGGA ACACAAGCCG 
GCTTTATTGG GCGTGGTGGT GTTCTTGGGC 
TCGCGCGCAT TTCAAAGGGG CGCACCATCC 
TTGCTCATCC CCGGCCTGTT CGGCGTTTTG 
TACGGCGATT TGGCTGAATG ACGGGGTTGC 
TGACCTCCTC TCCGGAAACG CTGCTTTTTA 
CTGCCCGAAT TGACGAGCAT CGTCAGCCTG 
TGTAACTTCT GCCGATTCCG GGATTTATGT 
GGGACAAAGG CTTGAGCGCG CCACGGTGGC 
CTGATGTCTG CCGTTGCCGT TTTGCTGATG 
CCTGCAGTCT ATGACCCTGA TTGTTTCCCT 
TGATAATGTG TTTCAGCCTG TGGAAAGGAT 
TTTGAGACCC GGGTTAACCC TACCAGTGTA 
GAAAGAACGG CTGGTGCAGA TAATGAGCCA 
TAAAATTCCT CAAACATACC GCATCGCCCG 
GAGCTTTCGG AAGAATACGG CTTGAGCGTC 
TCAGGACGAG CCCGCAATCG AGTTCGTCAT 
ATTTTATGTA CGGGATTAAG TCTGTCGGGC 
ATTAACGACG GCAAGCTGCC GCATATCCGG 
CTACGCTTAT TTTTTCGACG GGCGCGTCGG 
ACAAGGACGA GCTGATTGCC GACATTTTGA 
ATGTTGTTGG ATGATGTCGG TCAGGAACTG 
ATTGGCAGAG TAA 



This corresponds to the amino acid sequence <SEQ ID 678; ORF 163.a>: 

a 163 . pep 

1 M VILTTLFFV CViWLVL TV PDQVQMWLDR AKEVIFTEFS WFY VLTFSIF 

51 LGFLLIL5V5 SLGNIRLGRD EDVPEFGFLS WL AMLFAAGM GVGLMFFGV A 

101 EPLMHYFSDI TAGTPEHRQQ QALLHTVFHW GVHAWSVYGT IALALAYFGF 

151 RYKLPLALRS CFYPLLKEKI SGRFGDAID I MALLATFFGI ITTLGF GASQ 

201 LGAGLQEIGW IAENSFSVQ V LIIAAVMSLA WSAIS GVGK GVKVLSELNL 

251 GLAFLLLFFV LAAG PTVYLL SAFGDNIGNY LGNLVRLSFK TYAYEREHKP 

301 WFESWTVLYW AWWCSWAPFV GLFIARISKG RTIRE FVFGV LLIPGLFGVL 

351 WFTVFGNTAI WLNDGVAGGV LEKMT3SPET LLFKFFNYLP LPELTSIVSL 

^01 LVXSLFFVTS ADSGIYVLNN ITSRDKGLSA PRW QAVMWGV LMSAVAVLLM 

4 51 RSGGLGNLQS M TLIVSLFFA LLMLIMCF SL WKGLSADKKY FETRVNPTSV 

501 FWTGGKWKER LVQIMSQTQE QDILKFLKHT ASPAMHELQR ELSEEYGLSV 

551 RVDKMFHQDE PAIEFVIRKE TMRDFMYGIK SVGQDVSDQL INDGKLPHIR 

601 HQTTYKPYAY FFDGRVGYDV QYMNKDELIA DILKNYERYL MLLDDVGQEL 

651 MAHEQVELAE * 



ml63/al63 99.4% identity in 660 aa overlap 



10 20 30 40 50 60 

ml 63. pep MVILTTLFFVCVLVVLVLTVPDQVQMWLDRAKEVIFTEFSWFYVLTFSIFLGFLLILSVS 
I I M I \ I ! II I I I I I I I i I I 1 I I t I I t I I I 1 I I I i t I I I I I M | | | | | M | | M | || || | 
al63 MVILTTLFFVCVLWLVLTVPDQVQMWLDRAKEVIFTEFSWFYVLTFSIFLGFLLILSVS 
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ml 63 . pep SLGNIRLGRDEDVPEFGFLSWLAMLFAAGMGVGLMFFGVAEPLMHYFSDITAGTPEHRQQ 
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The following partial DNA sequence was identified in A ; . gonorrhoeae <SEQ ID 679>: 

gl64.seq {partial} 

1 . . ATGAACACAT TTTTGAAAAA CAGCGAATAC GCGTATATCC TGAACGACTG 

51 CAAGGCGCGC TTCCTGTTCG CCTCGGCCGG CCTGTCAAAA GAATTGGCGG 

101 GCCTGAAGGC GCAAACGCCC GTCGAAAAAA TCATTTGGAC GGACAAAAGC 

151 CGGCCGGCCG GCGAAACGGC GGAAGGCGAT GCCTTTTTTG AAAACGTGCG 

201 CCGCTTCCCC GAAAAACCCG ACTTGGGCCG CCAACCCCGG ATAAATGATT 

251 TGGCACACAT CATCTACACC TCCGGCACGA CGGGGCATCC CAAAGGCGCG 

301 CTAATCAGTT ACGCCAACCT GTTCGCCAAC CTGAACGGCA TCGAACGCAT 

351 CTTtaaAATT TCCAAACGCG ACCGCTTTAT CGTTTTCctg ccgatgTTCC 

401 ACAGCTTCAC GCTGACGGCT ATGGTGCTGC TGCCGATTTA TATGGCGTGT 

4 51 TCGATTATTT TGGTCAAAtc cgttttCCCc ttttccaacG TTTTGAAACA 

501 GGCCCTGCTC AAACGCGCAA CCGTGTTTTT GGGCGTACCC GCGATTTACA 

551 CCGCGATGAG CAAGGCAAAA ATCCCTTGGT ATTTCAGATG GTTCAACCGC 

601 ATCCGCCTGT TTATCAGCGG CGGCGCGCCT TTGGCGGAAC AAACCATCCT 

651 CGATTTTAAA GCCAAGTTCC CCCGCGCCAA ATTGCTGGAA GGCTACGGAC 

701 TGAGCGAAGC CTCGCCCGTC GTCGCCGTCA * ^ACGCCCGA ACGGCAAAAA 

7 51 GCCCGCAGCG TCGGCATCCC CCTGCCCGGT TTGGAAGCCA AAGCCGTCGA 

801 TGAAGAATTG GTCGAAGTGC CGCGCGGCGA AGTGGGCGAA CTGATCGTCA 

851 GGGGCGGTTC GGTGATGCGG GGCTACCTCA ATATGCCTGC CGCCACCGAT 

901 GAAACCATCG TCAACGGCTG GTTGAAAACG GGCGATTTCG TTACCATAGA 

951 CGAGGACGGC TTTATCTTTA TCGTCGACCG CAAAAAAGAT TTGATTATTT 

1001 CCAAAGGTCA AAACGTCTAT CCGCGCGAGA TCGAAGAAGA AATCCACAAA 

1051 CTCGATGCCG TCGAAGCCGC CGCCGTCATC GGCGTGAAAG ACCGTTATGC 

1101 CGACGAGGAA ATCGTCGCCT TCGTCCAATT GAAGGAAGGT ATGGATTTGG 

1151 GCGAGGACGA aatccgccgc caccTGCGTA CCGTGCTGGC AAATTTCAAA 

1201 ATCCCCAAAC AGATCCACTT TAAAGACGGG CTGCCGCGCA ACGCTACGGG 

1251 CAAAGTATTG AAACGGGTGC TGAAGGAGCA GTTTGAAGGA AACAAATGA 

This corresponds to the amino acid sequence <SEQ ID 680; ORF 164.ng>: 

gl64.pep (partial) 

1 . . MNTFLKNSEY AYILNDCKAR FLFASAGLSK ELAGLKAQTP VEKIIWTDKS 

51 RPAGETAEGD AFFENVRRFP EKPDLGRQPR INDLAHIIYT SGTTGHPKGA 

101 LISYANLFAN LNGIERIFKI SKRDRFIVFL PMFHSFTL TA MVLLPIYMAC 

151 SIILV KSVFP FSNVLKQALL KRATVFLGVP AIYTAMSKAK IPWYFRWFNR 

201 IRLFISGGAP LAEQTILDFK AKFPRAKLLE GYGLSEASPV VAVNTPERQK 

251 ARSVGIPLPG LEAKAVDEEL VEVPRGEVGE LIVRGGSVMR GYLNMPAATD 

301 ETIVNGWLKT GDFVTIDEDG FIFIVDRKKD LII SKGQNVY PREIEEEIHK 

351 LDAVEAAAVI GVKDRYADEE IVAFVQLKEG MDLGEDEIRR HLRTVLANFK 

4 01 IPKQIHFKDG LPRNATGKVL KRVLKEQFEG NK* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 68 1>: 

ml64 . seq 

1 ATGAACCGGA CTTATGCCAA TTTCTACGAA ATGCTCGCCG CCGCCTGCCG 

51 CAAAAACGGA AACGGCACGG CAGTGTTCGA CGGCAAGGAA AAAACCGCCT 

101 ACCGCGCGCT CAAGCAGGAG GCCGAAGCCG TCGCGGCGTA TCTGCAAAAT 

151 ATCGGCGTGA AGTTCGGCGA CACGGTCGCG CTGGCGGTTT CCAATTCCAC 

201 AGAATTTATT ACCGCCTATT TCGCCATCTC CGCCATCGGC GCGGTCGCCG 

251 TACCGATGAA CACATTTTTG AAAAACAGCG AATACGCGTA TATCCTGAAC 

301 GACTGCAAGG CGCGCTTCCT GTTCGCCTCG GCCGGCCTGT CAAAAGAATT 

351 GGCGGGCTTG AAGGCGCAAA CGCCCGTCGA AAAAATCATT TGGACGGACA 

401 AAAGCCGTCC GACCGGCGAA ACGGCGGAAG GCGATGCCTT TTTTGAAGAC 

4 51 GTGCGCCGCT TCCCCGAAAA ACCCGACTTG GGCCGCCAAC CCCGGATAAA 

501 TGATTTGGCA CACATCATCT ACACCTCCGG CACGACGGGG CAT CCC AAA G 

551 GCGCGCTAAT CAGTTACGCC AACCTGTTCG CCAACCTGAA CGGCATCGAA 

601 CGCATCTTTA AAATTTCCAA GCGCGACCGC TTTATCGTTT TCCTGCCGAT 

651 GTTCCACAGC TTCACGCTGA CGGCTATGGT GCTGCTGCCG ATTTATATGG 

7 01 CGTGTTCGAT TATTTTGGTC AAATCCGTTT TTCCGTTTTC CAACGTTTTG 

7 51 AAACAGACAC TGCTCAAACG CGCGACCGTG TTTTTGGGCG TACCCGCGAT 

801 TTACACCGCG ATGAGCAAGG CGAAAATCCC TTGGTATTTC AGATGGTTCA 

851 ACCGCATTCG CCTGTTTATC AGCGGCGGCG CGCCTTTGGC GGAACAAACC 

901 ATCCTCGATT TCAAAGCCAA GTTCCCCCGC GCCAAATTGC TGGAAGGCTA 
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ml 64 .pep VEVPRGEVGEL.IVRGGSVMRGYLNMPAATDETIVNGWLKTGDFVTIDEDGFIFIVDRKKD 
I I f I I I I I I 1 I I I I I I I I I I M I I I | | | | | | J | | j | | | | | | | | | | | | | | | | | | | | | } | | J 
gl64 VEVPRGEVGELIVRGGSVMRGYLNMPAATDETIVNGWLKTGDFVTIDEDGFIFIVDRKKD 
280 290 300 310 320 330 

420 430 440 450 4 60 470 

ml 64 . pep LIISKGQNVYPREIEEEIYKLDAVEAAAVIGVKDRYADEEIVAFVQLKEGMDLGENEIRR 
i ! f 1 I I I ! 1 I M I I I I ! I : I M i I 1 I II I I I I I I II M I I 1 I I t I I I t I I I j M I : I I | | 
9164 LIISKGQNVypREIEEEIHKLDAVEAAAVIGVKDRYADEEIVAFVQLKEGMDLGEDEIRR 
340 350 360 370 380 390 

480 490 500 510 

ml64 .pep HLRTVLANFKIPKQIHFKDGLPRNATGKVLKRVLKECFDGNKX 
I I ( I I [ I I I I I I I I I I I I I I ! f I I i M I I I ! ( I II ! 1 ! : i ! I i 
gl64 HLRTVLANFKIPKQIHFKDGLPRNATGKVLKRVLKEQFEGNKX 
400 410 420 430 

The following partial DNA sequence was identified in A r . meningitidis <SEQ ID 683>: 

al 64 . seq 

1 ATGAACCGGA CTTATGCCAA TTTCTACGAA ATGCTGACCG CCGCCTGCCG 

51 CAAAAACGGC AACGGCACGG CAGTGTTCGA CGGCAAGGAA AAAACCGCCT 

101 ACCGCGCGCT CAAGCAGGAA GCCGAAGCCG TTGCGGCGTA TCTGCAAAAT 

151 ATCGGCGTGA AGTTCGGCGA CACGGTCGCG CTGGCGGTTT CCAATTCCAC 

201 GGAATTTATT ACCGCCTATT TCGCCGTATC CGCCATCGGC GCGGTTGCCG 

251 TACCGATGAA CACATTTTTG AAAAACAGCG AATACGCGTA TATCCTGAAC 

301 GACTGCAAGG CGCGCTTCCT GTTCGCCTCG GCCGGCCTGT CAAAAGAATT 

351 GGCGGGCTTG AAGGCGCAAA CGCCCGTCGA AAAAATCATT TGGACGGGCC 

401 AAAGCCGTCC GGACGGCGAA ATGGCGGAAG GCGATGCCTT TTTTGAAGAC 

4 51 GTGCGCCGCT TCCCCGAAAA ACCCGACTTG GGCCGCCAAC CCCGGATAAA 

501 TGATTTGGCA CACATCATCT ACACCTCCGG CACGACGGGG CATCCCAAAG 

551 GTGCGCTAAT CAGCTACGCC AACCTGTTCG CCAACCTGAA CGGCATCGAA 

601 CGCATCTTTA AAATCTCCAA GCGCGACCGC TTTATCGTTT TCCTGCCGAT 

651 GTTCCACAGC TTCACGCTGA CZGCTLIOG'i GCTGCTGCCG ATTTATATGG 

701 CGTGTTCGAT TATTTTGGTC AAATCCGTTT TCCCCTTTTC CAACGTTTTG 

751 AAACAGGCAC TGCTCAAACG CGCGACCGTG TTTTTGGGCG TGCCCGCGAT 

801 TTACACCGCG ATGAGCAAGA CGAAAATCCC TTGGTATTTC AGATGGTTCA 

851 ACCGCATCCG CCTGTTTATC AGCGGCGGAG CACCTTTGGC GGAACAAACC 

901 ATCCTCGATT TCAAAGCCAA GTTCCCCCGC GCCAAATTGC TGGAAGGCTA 

951 CGGACTGAGC GAAGCCTCGC CCGTCGTCGC CGTCAATACG CCCGAGAGGC 

1001 AAAAAGCCCG CAGCGTCGGC ATCCCCCTGC CCGGTTTGGA AGTCAAAGCC 

1051 GTCGATGAAG AATTGGTCGA AGTGCCGCGC GGCGAAGTGG GCGAACTGAT 

1101 CGTCAGGGGC GGTTCGGTGA TGCGGGGCTA CCTCAATATG CCTGCCGCCA 

1151 CCGATGAAAC CATCGTCAAC GGCTGGTTGA AAACGGGCGA TTTCGTTACC 

1201 ATAGACGAAG ACGGCTTTAT CTTTATCGTC GACCGCAAAA AAGATTTGAT 

1251 TATTTCCAAA GGTCAAAATG TCTATCCGCG CGAAATCGAA GAAGAAATCT 

1301 ACAAACTCGA TGCCGTCGAA GCCGCCGCCG TCATCGGCGT GAAAGACCGT 

1351 TATGCCGACG AGGAAATCGT CGCCTTCGTC CAATTGAAGG AAGGTATGGA 

14 01 TTTGGGCGAG AACGAAATCC GCCGCCACCT GCGTACCGTG CTGGCAAATT 

14 51 TCAAAATCCC CAAACAAATC CACTTTAAAG ACGGGCTGCC GCGCAACGCT 

1501 ACGGGCAAGG TATTGAAACG GGTGTTGAAG GAGCAGTTTG ACGGAAACAA 

1551 ATGA 

This corresponds to the amino acid sequence <SEQ ID 684; ORF 164.a>: 

al64 -pep 

1 MNRTYANFYE MLTAACRKNG NGTAVFDGKE KTAYRALKQE AEAVAAYLQN 

51 IGVKFGDTVA LAVSNSTE FI TAYFAVSAIG AVAVP MNTFL KNSEYAYILN 

101 DCKARFLFAS AGLSKELAGL KAQTPVEKII WTGQSRPDGE MAEGDAFFED 

151 VRRFPEKPDL GRQPRINDLA HIIYTSGTTG HPKGALISYA NLFANLNGIE 

2 01 RIFKISKRDR FIVFLPMFHS FTL TAMVLL? IYMACSIILV KSVFPFSNVL 
251 KQALLKRATV FLGVPAIYTA MSKTKIPWYF RWFNRIRLFI SGGAPLAEQT 
301 ILDFKAKFPR AKLLEGYGLS EASPWAVNT PERQKARSVG IPLPGLEVKA 

3 51 VDEELVEVPR GEVGELIVRG GSVMRGYLNM PAAT DETIVN GWLKTGDFVT 
401 IDEDGFI FIV DRKKDLIISK GQNVYPREIE EEIYKLDAVE AAAVIGVKDR 

4 51 YADEEIVAFV QLKEGMDLGE NEIRRHLRTV LANFKIPKQI HFKDGLPRNA 
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51 GACTTTGGGC GTTTTGCTCA AAGAACTCGA ACCGTCTTGG GAAATCACCC 

101 TGATTGAACG CTTGGAagat gTGGCGTTGG AATCGTCAAA cGCGTGGAAC 

151 AACGcCGgca CGGGGCATTC CGcGCTGTGc GAATTGAACT AtgcgccGCT 

201 GGGtgcggac ggcgtcatCA ATCCGGCGCg cgCCCTGAAT ATTGCCGAAC 

251 AGTTTCATGT CAGCCGCCAG TTTTGGGcga cgctgGTCGC GGAAGGCAAG 

301 TTGGAagaCA ATTCCTTCAT CAATGCcgtg ccgcatatGT Ctttggtgat 

351 gAacgaagac cacrgCCgtt acCTGCAAAA ACGCTATGAT GTGTTTAAAA 

4 01 CGCAGAAACT TTTTGAAAAT ATGGAATTTT CCACCGATCG GAACAAAATT 

4 51 TCCGATTGGG CtccgCTGAT TATGCGCGGC CGGgacgaaA ACCAACCCGT 

501 CGCCGCCAAC TATTCCGCCG Aaggcacgga tgtcgATTTC GGACGGCTGA 

551 CGCGCCAGAT GGTGAAATAT TTGCAGGGCA AGGGCGTAAA AACCGAGTTC 

601 AACCGCCACG TCGAAGACAT CAAACGCGAA TCCC-ACGGCG CGTGGGTGCT 

651 CAAAACCGCC GATACCCGCA ACCCAGACTG GCAGCTCACC CTCCGCACCC 

701 GCTTCCTCTT CCTCGGCGCG GGCGGCGGCG CACTGACCCT GCTGCAAAAA 

751 TCCGGCATCC CCGAAGGCAA AGGCTACGGC GGCTTACCCG TGTCCGGCCT 

801 GTTCTTCCGC AACAGCAACC CCGAAACCGC CGAACAACAC AACGCCAAAG 

851 TGTACGGGCA GGCTTCCG7C GGCGCGCCGC CGATGTCCGT CCCGCACCTC 

901 GACACACGCA ACGTAGACGG CAAACGACAC CTTATGTTCG GTCCTTACGC 

951 AGGTTTCCGT TCCAACTTCC TCAAGCAAGG CTCGTTTATG GATTTGCCGC 

1001 TGTCCATCCA TATGGACAAC CTCTATCCTA TGCTGCGCGC CGGCTGGGCG 

1051 AATATGCCGC TGACCAAATA CcTGCTGGgC gAaTTGCgtt aa 

This corresponds to the amino acid sequence <SEQ ID 686; ORF 1 65. ng>: 

g 165 .pep 

1 MAEATD WLV GGG1MSATLG VLL KELEPSW EITLIERLED VALES SNAWN 

51 NAGTGHSALC ELNYAPLGAD GVINPARALN IAEQFHVSRQ FWATLVAEGK 

101 LEDNSFINAV PHMSLVMNED HCRYLQKRYD VFKTQKLFEN MEFSTDRNKI 

151 SDWAFLIMRG RDENQPVAAN YSAEGTDVDF GRLTRQMVKY LQGKGVKTEF 

201 NRHVEDIKRE SDGAWVLKTA DTRNPDWQLT LRTRFLFLGA GGGALTLLQK 

251 SGIPEGKGYG GLPVSGLFFR NSNPETAEQH NAKVYGQASV GAPPMSVPHL 

301 DTRNVDGKRR LMFGPYAGFR SNFLKQGSFM DLPLSIHMDN LYPMLRAGWA 

3 51 NMPLTKYLLG ELR* 

i I*w following partial DNA sequence was identified in N. meningitidis <SEQ ID 687>; 

ml65.seq (partial) 

1 ATGGCTGAAG CGACAGACGT TGTCTTGGTG GGCGGCGGCA TTATGAGCGC 

51 GACTTTGGGC GTTTTGCTCA AAGAACTCGA ACCGTCTTGG GAAATCACCC 

101 TGATTGAACG CTTGGAAGAT GTGGCGTTGG AATCGTCAAA CGCGTGGAAC 

151 AACGCCGGCA CGGGGCATTC CGCGCTGTGC GAATTGAACT ATGCGCCGTT 

201 GGGTGCAAAT GGGATTATCG ATCCGGCGCG CGCCCTCAAT ATTGCCGAAC 

251 AGTTTCATGT CAGCCGCCAG TTTTGGGCGA CGCTGGTCGC GGAAGGCAAG 

301 TTGGAAGACA ATTCCTTCAT CAATGCCGTG CCGCATATGT CTTTGGTGAT 

351 GAATGAAGAC CATTGTTCTT ATCTTCAAAA ACGTTATGAC GCGTTTAAAA 

401 CCCAAAAACT TTTTGAAAAT ATGGAATTTT CCACCGATCG GAACAAAATT 

451 TCCGATTGGG CTCCGCTGAT GATGCGCGGC CGGGACGAAA ACCAACCCGT 

501 CGCCGCCAAC TACTCCGCCG AAGgTACGGA TGTCGATTTC GGACGGCTGA 

551 CGCGCCAAAT GGTGAAATAT TTGCAGGGCA AGGGCGTAAA AACCGAGTTC 

601 AACCGCCACG TCGAAGACAT CAAACGCGAA TCCGACGGCG CGTGGGTGCT 

651 CAAAACCGCC GATACCCGCA ACCCCGACGG GCAGCTCACC CTCCGTACCC 

701 GCTTCCTCTT CCTCGGCGCG GGCGGCGGCG CGCTGACCCT GCTGCAAAAA 

751 TCCGGCATCC CCGAAGGCAA AGGCTACGGC GGCTTCCCCG TGTCCGGCCT 

801 GTTCTTCCGC AACAGCAACC CCGAAACCGC CGAACAACAC AACGCCAAAG 

851 TGTACGGGCA GGCTTCCGTC GGCGCGCCGC CGATGTCCGT CCCGCACCTC 

901 GACACACGCA ACGTGGACGG CAAACGCCAC CTTATGTTCG GCCCTTACGC 

951 AGGCTTCCGT TCCAACTTCC TCAAGCAAGG CTCGCTTATG GATTTGCCGC 

1001 TGTCCATCCA TATGGACAAC CTCTATCCTA TGCTGTGCGC CGGCTGGGCG 

1051 AATATGCCGC TGACCAAA . . . 

This corresponds to the amino acid sequence <SEQ ID 688; ORF 165>: 

ml65.pep (partial) 

1 MAEATD WLV GGGIMSATLG VLL KELEPSW EITLIERLED VALES SNAWN 

51 NAGTGHSALC ELNYAPLGAN GIIDPARALN IAEQFHVSRQ FWATLVAEGK 

101 LEDNSFINAV PHMSLVMNED HCSYLQKRYD AFKTQKLFEN MEFSTDRNKI 

151 SDWAPLMMRG RDENQPVAAN YSAEGTDVDF GRLTRQMVKY LQGKGVKTEF 
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601 AACCGCCACG TCGAAGACAT CAAACGCGAA TCCGACGGCG CGTGGGTGCT 

651 CAAAACCGCC GATACCCGCA ACCCCGACGG GCAGCTCACC CTCCGTACCC 

701 GCTTCCTCTT CCTCGGCGCG GGCGGCGGCG CGCTGACCCT GCTGCAAAAA 

7 51 TCCGGCATCC CCGAAGGCAA AGGCTACGGT GGCTTTCCCG TGTCCGGCCT 

801 GTTCTTCCGC AACAGCAACC CCGAAACCGC CGAACAACAC AACGCCAAAG 

851 TGTACGGGCA GGCTTCCGTC GGCGCGCCGC CGATGTCCGT CCCGCACCTC 

901 GACACACGCA ACGTGGACGG CAAACGCCAC CTTATGTTCG GCGCTTACGC 

951 AGGCTTCCGT TCCAACTTCC TCAAGCAAGG CTCACTTATG GATTTGCCGC 

1001 TGTCCATCCA TATGGACAAC CTCTATCCTA TGCTGCGCGC CGGCTGGGCG 

1051 AATATGCCGC TGACCAAATA CCTGCTGGGC GAATTGCGTA AAACCAAAGA 

1101 AGAACGCTTC GCCTCCCTGC TGGAATACTA CCCCGAGGCA AACCCCGACG 

1151 ACTGGGAACT CATCACCGCA GGGCAACGCG TTCAAATCAT TAAAAAAGAC 

1201 TCCGAAAAAG GCGGCGTGTT GCAGTTTGGT ACGGAGATTG TCGCACACGC 

12 51 CGACGGCTCG CTCGCCGCAT TGCTGGGCGC GTCGCCGGGC GCATCGACCG 

1301 CCGTGCCGCT GATGATCCGG CTGATGCACC AATGCTTCCC CGAACGCACC 

1351 CCGTCTTGGG AAGGCCGTCT GAAAGAGCTG GTACCGGGTT ACGGCATCAA 

14 01 GTTGAACGAA AACCCCGAAA GGGCGGATGA AATTATCGCC TATACCGCGA 

14 51 AAGTGTTGGA TATTTAA 

This corresponds to the amino acid sequence <SEQ ID 690; ORF 165. a>; 

a 165 .pep 

1 MAEATD WLV GGG IMSATLG VLL KELEPSW EITLIERLED VALES SNAWN 

51 NAGTGHSALC ELNYAPLGAN GIIDPARALN IAEQFHVSRQ FWATLVAEGK 

101 LEDNSFINAV PHMSLVMNED HCSYLQKRYD AFKTQKLFEN MEFSTDRNKI 

151 SDWAPLMMRG RDENQPVAAN YSAEGTDVDF GRLTRQMVKY LQGKGVKTEF 

201 NRHVEDIKKE SDGAWVLKTA DTRNPDGQLT LRTRFLFLGA GGGALTLLQK 

251 SGIPEGKGYG GFPVSGLFFR NSNPETAEQH NAKVYGQASV GAPPMSVPHL 

301 LTRNVDGKRH LMFGPYAGFR 5NFLKQGSLM DLPLSIHMDN LYPMLRAGWA 

351 NMPLTKYLLG ELRKTKEERF ASLLEYYPEA NPDDWELITA GQRVQIIKKD 

4 01 SEKGGVLQFG TEIVAHADGS LAALLGASPG ASTAVPLMIR LMHQCFPERT 

4 51 PSWEGRLKEL VPGYGIKLNE NPERADEIIA YTAKVLDI * 

ml65/al65 99.7% identity in 356 aa overlap 

10 20 30 40 50 60 

m!65.pep MAEATDWLVGGGIMSATLGVLLKELEPSWEITLIERLEDVALESSNAWNNAGTGHSALC 
I M I I I I I I I I I I I I II I I M M I I ! M I i ! I J M ! I M I I I I I ! I I I i I I M I i f I I I I 
a 165 MAEATDWLVGGGIMSATLGVLLKELEPSWEITLIERLEDVALESSNAWNNAGTGHSALC 

10 20 30 40 50 60 

70 80 90 100 110 120 

ml 65. pep ELNYAPLGANGIIDPARALNIAEQFHVSRQFWATLVAEGKLEDNSFINAVPHMSLVMNED 
M i I M It I II I I II i I I I I M I I I I I I I I II i i 1 i I I I I I U \ I I I I I S I M 1 t I M I I 
a!65 ELNYAPLGANGIIDPARALNIAEQFHVSRQFWATLVAEGKLEDNSFINAVPHMSLVMNED 

70 80 90 100 110 120 

130 140 150 160 170 180 

ml 65 -pep HCSYLQKRYDAFKTQKLFENMEFSTDRNKISDWAPLMMRGRDENQPVAAN YSAEGTDVDF 

I II I I I i I I 1 I M I I I I I I 1 I I I If I t M I I II I M I I i ( I U I I I I I M II M ! I M I f 
a 165 HCSYLQKRYDAFKTQKLFENMEFSTDRNKISDWAPLMMRGRDENQPVAAN YSAEGTDVDF 

130 140 150 160 170 180 

190 200 210 220 230 240 

ml 65 . pep GRLTRQ^5VKYLQGKGVKTEFNRHVEDIKRESDGAWVLKTADTRNPDGQLTLRTRFLFLGA 
i I I I I I I M I I I II I I M I I I I I I I I I I I I I I i I I I | | M M | | | !| I | | I | | ! M | M I 
al 65 GRLTRQMVKYLQGKGVKTEFNRHVEDIKRESDGAWVLKTADTRNPDGQLTLRTRFLFLGA 

190 200 210 220 230 240 

250 260 270 280 290 300 

ml 65. pep GGGALTLLQKSGIPEGKGYGGFPVSGLFFRNSNPETAEQHNAKVYGQASVGAPPMSVPHL 
1 I I I I I 1 M II M I I I I I II I I I I I I I i M I M I (I | | ! | | [ | ( | || i i | I i ( | | | ! I (( 
a 165 GGGALTLLQKSGIPEGKGYGGFPVSGLFFRNSMPETAEQHNAKVYGQASVGAPPMSVPHL 

250 260 270 280 290 300 
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351 TGTACGGGCA GGCTTCCGTC 

901 GACACACGCA ACGTGGACGG 

951 AGGCTTCCGT TCCAACTTCC 

1001 TGTCCATCCA TATGGACAAC 

1051 AATATGCCGC TGACCAAATA 

1101 AGAACGCTTC GCCTCCCTGC 

1151 ACTGGGAACT CATCACCGCA 

1201 TCCGAAAAAG GCGGCGTGCT 

1251 CGACGGCTCA CTCGCCGCAT 

1301 CTGTGCCGCT GATGATCCGG 

1351 CCGTCTTGGG AAGACCGTCT 

14 01 GTTGAACGAA AACCCTGAAA 

14 51 AAGTATTGGA TATTTAA 



GGCGCGCCGC CGATGTCCGT CCCGCACCTC 
CAAACGCCAC CTTATGTTCG GCCCTTACGC 
TCAAGCAAGG CTCGCTTATG GAT7TGCCGC 
CTCTATCCTA TGCTGTGCGC CGGCTGGGCG 
CCTGCTGGGC GAATTGCGTA AAACCAAAGA 
TGGAATACTA CCCCGAGGCA AACCCCGACG 
GGGCAACGCG TCCAAATCAT 7AAAAAAGAC 
CCAGTTTGGT ACGGAGATTG TCGCCCACGC 
TGCTGGGCGC GTCGCCGGGC GCATCGACCG 
CTGATGCACC AATGCTTCCC CGAGCGCGCC 
GAAAGAGCTG GTACCGGGTT ACGGCATCAA 
GGGCGGATGA AATTATCGCC TATACCGCGA 



This corresponds to the amino acid sequence <SEQ ID 694; ORF 165- 1>: 

ml 65-1 .pep 

1 MAEATD VVLV GGGIMSATLG VLL KELEPSW EITLIERLED VALESSNAWN 

51 NAGTGHSALC ELNYAPLGAN GIIDPARALN IAEQFHVSRQ FWATLVAEGK 

101 LEDNSFINAV PHMSLVMNED HCSYLQKRYD AFKTQKLFEN MEFSTDRNKI 

151 SDWAPLMMRG RDENQPVAAN YSAEGTDVDF GRLTRQMVKY LQGKGVKTEF 

201 NRHVEDIKRE SDGAWVLKTA DTRNPDGQLT LRTRFLFLGA GGGALTLLQK 

251 SGIPEGKGYG GFPVSGLFFR NSNPETAEQH NAKVYGQASV GAPPMSVPHL 

301 DTRNVDGKRH LMFGPYAGFR SNFLKQGSLM DLPLSIh.-IDN LYPMLCAGVfA 

351 NMPLTKYLLG ELRKTKEERF ASLLEYYPEA NPDDWELITA GQRVQIIKKD 

401 SEKGGVLCFG TEIVAHADGS LAALLGASPG ASTAVPLMIR LMP * ~~?£RA 

4 51 PSWEDRLKEL VPGYGIKLNE NPERADEII A YTAKVLDI * 



ml€5-l/gl65-l 89.7% identity in 428 aa overlap 



10 20 30 40 50 60 

ml65-l .pep MAEATDVVLVGGGIMSATLGVLLKELEPSWEITLIERLEDVALESSNAWNNAGTGHSALC 
I I I I I I I ! I I t I I I I I II I I I ! I I I I i I I I I I I I I I I I I I I ! ! ! I M ! I I I I M I ! I ! I 1 
g 165-1 MAEATDVVLVGGGIMSATLGVLLKELEPSWEITLIERLEDVALESSNAWNNAGTGHSALC 

10 20 30 40 50 60 



70 B0 90 100 110 120 

ml65-l .pep ELNYAPLGANGIIDPARALNIAEQFHVSRQFVATLVAEGKLEDNSFINAVPHMSLVMNED 
( I I I I ! f I I : I : I : I I I I ( I U f I I I I I I t I ! I I I ! I [ ! i ! I I I I II ! I I I 1 I I i t I ( I I 
g 165-1 ELNYAPLGADGVINPARALNIAEOFHVSRQFWATLVAEGKLEDNSFINAVPHMSLVMNED 

70 80 90 100 110 120 



130 140 150 160 170 180 

ml 65-1 .pep HCSYLQKRYDAFKTQKLFENMEFSTDRNK I 5DWAPLMMRGRDENQPVAAN YSAEGTDVDF 
I I II i I I I I : I I I I I M I I i I I i M I I I II i i I 1 i : I i M I i I i I M I S I 1 M I I I i I I 
g 165-1 HCRYLQKRYDVFKTQKLFENMEFSTDRNKISDWAPLIMRGRDENOPVAAN YSAEGTDVDF 

130 140 150 160 170 180 



190 200 210 220 230 240 

B1165-1 .pep GRLTRQMVKYLQGKGVKTEFNRHVEDIKRESDGAWVLKTADTRNPDGQLTLRTRFLFLGA 
I 1 1 I I M t i I I 11 I I t I i 1 ) 1 I I )! 1 i 1 I 1 1 M t M 1 ! 1 I I i I 1 i t I I II M t 1 ) 1 M I 
gl65-l GRLTRQMVKYLQGKGVKTEFNRHVEDIKRESDGAWVLKTADTRNPDWQLTLRTRFLFLGA 

190 200 210 220 230 240 



250 260 270 280 290 300 

ml 65-1. pep GGGALTLLQKSGIPEGKGYGGFPVSGLFFRNSNPETAEQHNAKVYGQASVGAPPMSVPHL 
M I I I I I I I M I 1 I ! I ! M M : ! I II I ! I i I I II I I f I i If I I I I I I I I I I I I i I 1 i I ! I 
g!65-l GGGALTLLQKSGIPEGKGYGGLPVSGLFFRNSNPETAEQHNAKVYGQASVGAPPMSVPHL 

250 260 270 280 290 300 



310 320 330 340 350 360 

ml 65-1 .pep DTRNVDGKRHLMFGPYAGFRSNFLKQGSLMDLPLSIHMDNLYPMLCAGWANMPLTKYLLG 
I I I I I II i I I I I I I I I I I I M I If I I I I : I I I II I || I I || I [ I I | I 1 | | I I || I I | I I 
g!65-l DTRNVDGKRHLMFGPYAGFRSNFLKQGSFMDLPLSIHMDNLYPMLRAGWANMPLTKYLLG 

310 320 330 340 350 360 

370 380 390 400 410 420 

ml 65-1 .pep ELRKTKEERFASLLEYYPEANPDDWELITAGQRVQIIKKDSEKGGVLQFGTEIVAHADGS 

I I I I ! I I ! I I I I i t I III : : III I : I I ill Ml: : | : 

gl6S-l ELRKTKEERFASLLEYYPR-QTRRLVLITQXTR-HIIXYDS-KLRVLQLYEIVPRDARSR 

370 380 390 400 410 

430 440 450 460 470 480 
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a 1 65 - 1 . pep GRLTRQMVKYLQGKGVKTEFNRHVEDIKRESDGAWVLKTADTRNPDGQLTLRTRFLFLGA 
I S I 1 I i I I I I I I I I I I I I I I I i I I I t I I i I I I II I I i I t II I II I I I i I I I ! I I I I ! I 1 1 
ml 65-1 GRLTRQMVKYLQGKGVKTEFNRHVEDIKRESDGAWVLKTADTRNPDGQLTLRTRFLFLGA 
190 200 210 220 230 240 

250 260 270 230 290 300 

aI65-l .pep GGGALTLLOKSGIPEGKGYGGFPVSGLFF RNSNPETAEQHNAKVYGQASVGAPPMSVPHL 
I ! I M i I 1 1 I I I 1 I t ) I I ) 11 I ) 1 ) i I ! I I t I I M I I I I I I i i i M i ! I I I I I I ! I I I I I 
ml 65-1 GGGALTLLQKSGIPEGKGYGGFPVSGLFFRNSNPETAEQHNAKVYGQASVGAPPMSVFHL 

250 260 270 280 290 300 



310 320 330 340 350 360 

a 1 65 - 1 . pep DTRNVDGKRHLMFGPYAGFRSNFLKQGSLMDLPLS IKMDNL YPMLRAGWANMPLTKYLLG 
f I I I I I! I I I I M t I II I I i I I 1 I I I I J I I I I i ! ! I I f d I I j I I I II I ( I I I I ! I I ! I 
ml65-l DTRNVDGKRHLMFGPYAGFRSNFLKQGSLMDLPLSIHMDNLYPHIiCAGWANMPLTKYLLG 

310 320 330 340 350 360 



370 380 390 400 410 420 

al 65-1 .pep ELRKTKEERFASLLEYYPEANPDDWELITAGQRVQIIKKDSEKGGVLQFGTEIVAHADGS 
I I i I I i 1 I II I I I i I I I I H ! I 1 i I i I I I I 1 I I M I I I I I I I I I I I I I I I I I I I i I I i I I 
m!65-l ELRKTKEERFASLLEYYPEANPDDWELITAGQRVQIIKKDSEKGGVLQFGTEIVAHADGS 

370 380 390 400 410 420 



430 440 450 460 470 480 

al65-l .pep LAALLGASPGASTAVPLMIRLMHQCFPERTPSWEGRLKELVPGYGIKLNENPERADEIIA 
II 1 M ! I 1 ! t I I I I ] 1 I 1 I ! 11 i 1 I I I 1 1 : I I I I I I I [ I I i ! M I ! I i I ! I I I I I I I I I 
ml 65-1 LAALLGASPGASTAVPLMIRLMHQCFPERAPSWEDRLKELVPGYGIKLNENPERADEIIA 

430 440 45Q 460 470 480 



al65-l .pep 
ml 65-1 



489 

YTAKVLDIX 
I I I ! i I I I 1 
YTAKVLDIX 



al65-l/p33940 

Sp|P3394 0| YOJH_ECOLI HYPOTHETICAL 60.2 KD PROTEIN IN ECO-ALKB INTERGENIC REGION 
>gi I 1736851 Ignll PIDjdl016718 (D90850) ORF_ID:o37 2*t5; similar to [SwissProt Accession Number 
P33940] (Escherichia colij >gi| 1788539 (AE000310) f548; This 548 aa ORF is 100 pet identical 
to 490 residues of YOJH_ECOLI SW: P33940 (492 aa) but contains 56 additional N-ter aa; 100 pet 
identical to GB : ECOHU49_33 

ACCESSION: U00008 {490 aa) but contains 53 aditional N-term resi... Length = 548 
Score = 458 bits (1167), Expect = e-128 

Identities * 233/490 (47%), Positives = 303/490 (61%), Gaps = 5/490 (1%) 



Query: 3 EATDVVLVGGGIMSATLGVLLKELEPSWEITLIERLEDVALESSNAWNNAGTGHSALCEL 62 

+ TDV+L+GGGIMSATLG L+ELEP W +T++ERLE VA ESSN WNNAGTGHSAL EL 

Sbjct: 30 QETDVLLIGGGIMSATLGTYLRELEPEWSMTMVERLEGVAOESSNGV3NKAGTGH5ALMEL 8 9 

Query: 63 NYAPLGANGIIDPARALNIAEQFHVSRQFWATLVAEGKLED-NSFINAVPHMSLVMNEDH 121 

NY P A+G I +A+ I E F +SRQFWA V G L SFIN VPHMS V ED+ 

Sbjct: 90 NYTPQNADGSISIEKAVAINEAFQISRQFWAHQVERGVLRTPRSFINTVPHMSFVWGEDN 14 9 

Query: 122 CSYLQKRYDAFKTQKLFENMEFSTDRNKISDWAPLMMRGRDENQPVAANYSAEGTDVDFG 181 

++L+ RY A + LF M +S D +1 +WAPL+M GRD Q VAA + GTDV++G 

Sbjct: 150 VNFLRARYAALQQSSLFRGMRYSEDHAQIKEWAPLVMEGRDPQQKVAATRTEIGTDVNYG 209 

Query; 182 RLTRQMVKYLQGKG-VKTEFNRHVEDIKRESDGAWVLKTADTRNPDGQLTXXXXXXXXXX 24 0 

+TRQ++ LQ K + + V +KR D W + AD +N Q 

Sbjct: 210 EITRQLIASLQKKSNFSLQLSSEVRALKRNDDNTWTVTVADLKNGTAQ-NIRAKFVFIGA 268 

Query: 241 XXXXXXXXQKSGIPEGKGYGGFPVSGLFFRNSNPETAEQHNAKVYGQASVGAPPMSVPHL 300 

Q+SGIPE K Y GFPV G F + NP+ H AKVYG+ASVGAPPMSVPH+ 

Sbjct: 269 GGAALKLLQESGIPEAKDYAGFPVGGQFLVSENPDVVNHHLAKVYGKASVGAPPMSVPHI 328 

Query: 301 DTRNVDGKRHLMFGPYAGFRSNFLKQGSLMDLPLSIHMDNLYPMLRAGWANMPLTKYLLG 360 

DTR +DGKR ++FGP+A F + FLK GSL DL S N-J- PM+ G N L KYL + 

Sbjct: 329 DTRVLDGKRWLFGPFATFSTKFLKNGSLWDLMSSTTTSNVMPMMHVGLDNFDLVKYLVS 388 

Query: 361 ELRKTKEERFASLLEYYPEANPDDWELITAGQRVQIIKKDSEKGGVLQFGTEIVXXXXXX 420 

+ + + + E+RF +L EYYP+A L AGQRVQI IK+D+EKGGVL+ GTE+V 

Sbjct: 389 QVMLSEEDRFEALKEYYPQAKKEDWRLWQAGQRVQI I KRDAEKGGVLRLGTEVVS DQQGT 448 

Query: 421 XXXXXXXXXXXXXXVPLMIRLMHQCFPER— TPSWEGRLKELVPGYGIKLNENPERADEI 478 
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Homology with a predicted ORF from N. gonorrhoeae 

ORF 204 shows 82.0% identity over a 250 aa overlap with a predicted ORF (ORF 204.ng) 
from N. gonorrhoeae: 

m204/g204 

10 20 30 40 50 60 

MAAAE IKRP FAVD FQH I AS VLHGG I AA FACL I GLQGGMRNX V I RQ FAAV FGD I AHQ FGKQ 
I I I I I I I I I : I I I I I I I I I I i I I I I i I I I I 1 I I I I I I I I I II I I I I I i I I I I I I I I i I 
MAAAEIKRPLAVDFQHIASVLHGGIAAFACLIGLOGGMRNOVISQFAAVFGDIAHOFGKO 
10 20 30 40 50 60 

70 80 90 100 110 120 

GMAHAVFCPTCRTVLI I GFHTFAADGFQVAGMLADQSDNI LFRQAFNRITDLFFAWGFA 

I IMI I I h I II =111111 INIhllN I |::|| nun II II I II I I III 
GMAHAV FR P ARRRVLS VG FHT F ADDG FQ WGMLS GQ PDG VL F RQ AFNR I TD L F F AWG FA 
70 80 90 100 110 120 



m2 04 . pep 
9204 

m204 . pep 
g204 



130 140 150 160 170 180 

m204 .pep FAALSQ IQTGNRRI VD I YDFENRFRRALYRVLRLYRRI XGFAATAXQQAAAQYGKXXXQH 

11 = 111 i mm mm mum mmm mi : i mil: ii 11 

g2 04 FATLSQSQTGNRRIVDVFDFENRFRRALCRILRLFRRI FG F AAGGKQQ AAAQHG KR Y FQH 

130 140 150 160 170 180 



190 200 210 220 230 

m204 .pep STXLMVSKCRLK RGRRRFGRHRVHFNGRMPTASGTLSNNSRASLRAFAAPACKISS 

i= iii iii m iiiiiiiii imimm mmmmi mum 

g204 SALIjMVSKCRLKCRLKRGRRRFGRHWVYFNGRMPTASRTLSNNSRASLRAFCAPACKISS 
190 200 210 220 230 240 



240 

m2 0 4 . pep I CEGSAVSSLX 

mi mi 

g2 04 ICEGLEVNAL 
250 



The following partial DNA sequence was identified in N. meningitidis <SEQ ED 701 >: 

a204 .seq 

1 ATGGCGGCGG CGGAAATAAA ACGCCCCCTC GCTGTCGATT TCCAGCACAT 

51 AGCGTCCGTT CTGCACGGCG GCATAGCCGC TTTTGCCTGC CTGATAGGGT 

101 TGCAGGGCGG AATGCGAAAT CAGGTAATCC GTCAGTTTGC CGCCGTCTTC 

151 GGCGATATTG CCCACCAGTT TGGCAAACAA GGTATGGCAC ACGCCGTTTG 

201 CCGCCCAGCC CGAAGGCGCG CCCTTTCCGT CGGTTTCCAT ACATTTGCCG 

251 ACGACGGCTT CCAAGTCGTT GGGATGCTTG CCGGTCAGCC GGACGACGTT 

301 TTGTTCCGGC AAGCCTTT 

351 

401 

451 

501 

551 AAGAG GTTCGGACGG 

601 CATTGGGTTT ATTTCAACGG GCGGATACCG ACCGCATCAC GTACTTTGCC 

651 CAATAATTCG CGTGCTTCTT TACGCGCTTT TTGCGCGCCT GCCTGCAAAA 

701 TCTCTTCGAT TTGCGAAGGG TCGGCGGTCA GCTCGTTGTA G 



This corresponds to the amino acid sequence <SEQ ID 702; ORF 204.a>: 

a204 .pep 

1 MAAAEIKRPL avdfqh iasv LHGGIAAFAC lig lqggmrw qvirqfaavf 

51 GDIAHQFGKQ gmahavcrpa rrralI>.vgfh tfaddgfqw gmlagqpddv 

101 lfrqaf 

151 KRFGR 

201 HWVYFNGRIP TASRTLPNNS RASLRAr CAP ACKISSICEG SAVSSL* 
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This corresponds to the amino acid sequence <SEQ ID 706; ORF 205>: 

m2 05 .pep 

1 MLXTXFAVLG GCLLX CRCGK SXNTAVQPQN AVQSAPKPVF KVIYIDNTAI 

51 AGLDLGQSSE GKTNDGKKQI SYPIKGLPEQ NVIRLIGKKP GDLEAVSGKC 

101 METDDKDSPA GWAENGVCHT LFAKLVGNIA EDGGKLTDYL VSHAALQPYQ 

151 AGKSGYAAVQ NGRYVLEIDS EGAFYFRRRH Y* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 205 shows 88.4% identity over a 181 aa overlap with a predicted ORF (ORF 205. ng) 
from N. gonorrhoeae: 

m205/g205 



10 20 30 40 50 60 

m205 . pep MLXTX FAVLGGCLLXCRCGKSXNT AVQPQNAVQS AP KPVFKV I Y I DNTAI AGLDLGQSSE 

ii iiiiiiiii mi in iiiihiiiiiiiiM minim mill 

g205 MLKIPFAVLGGCLLLAACGKSENTAEQPQNAAQSAPKPVFKVKYIDNTAIAGLALGQSSE 
10 20 30 40 50 60 



70 80 90 100 110 120 

m2 05 . oep GKTNDGKKQISYPIKGLPEQKVIRLIGKHPGDLEAVSGKCMETDDKDSPAGWAENGVCHT 

immmmmimmii mi mm mini immimim 

g2 05 GKTNDGKKOISYPIKGLPEQNAVRLTGKHPNDLEAWGKCMETDGKDAPSGWAENGVCHT 
70 80 90 100 110 120 



130 140 150 160 170 180 

m205 . pep LFAICLVGNIAEDGGKLTDYLVSHAALQPYQAGKSGYAAVQNGRYVLEIDSEGAFYFRRRH 

iiiMiiiMiiiiiiiMhiimimmiimiiimiiiiiiimiimi 

g205 LFAKLVGNIAEDGGKLTDYLISHSALQPYQAGKSGYA^VQNGRYVLEIDSEGAFYFRRRH 
130 140 150 16C 170 180 



m205.pep YX 

I 

g205 Y 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 707>: 

a205.seq (partial) 

1 TCCGAACCTC TTAAAGGCTT GCCGGAACAA AACGTCGTCC GGCTGACCGG 

51 CAAGCATCCC AACGACTTGG AAGCCGTCGT CGGCAAATGT ATGGAAACCG 

101 ACGGAAAGGG CGCGCCTTCG GGCTGGGCGG CAAACGGCGT GTGCCATACC 

151 TTGTTTGCCA AACTGGTGGG CAATATCGCC GAAGACGGCG GCAAACTGAC 

201 GGATTACCTG ATTTCGCATT CCGCCCTGCA ACCCTATCAG GCAGGCAAAA 

2 51 GCGGCTATGC CGCCGTGCAG AACGGACGCT ATGTGCTGGA AATCGACAGC 

301 GAGGGGGCGT TTTATTTCCG CCGCCGCCAT TATTGA 



This corresponds to the amino acid sequence <SEQ ID 708; ORF 205.a>: 

a205.pep (partial) 

1 SEPLKGLPEQ NVVRLTGKHP NDLEAVVGKC METDGKGAPS GWAANGVCHT 

51 LFAKLVGNIA EDGGKLTDYL ISHSALQPYQ AGKSGYAAVQ NGRYVLEIDS 

101 EGAFYFRRRH Y* 



m205/a205 88.3% identity in 1 1 1 aa overlap 

50 60 70 80 90 100 

m205.pep KVI YI DNTAI AGLDLGQSSEGKTNDGKKQISYPIKGLPEQNVIRLIGKHPGDLEAVSGKC 

I 1:11111)11:11 1111:11111 11 I 
a205 SEPLKGLPEQNWRLTGKHPNDLEAWGKC 

10 20 30 

110 120 130 140 150 160 

m205 . pep METDDKDSPAGWAENGVCHTLFAKLVGNIAEDGGKLTDYLVSHAALQPYQAGKSGYAAVQ 
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70 80 90 100 110 120 

130 140 150 160 170 

g205-l .pep LF AKLVGN I AEDGG KLTD Y L I S H S ALQ P YQAG KSG YAA VQNGR YVLE I D S EG A F 

Mltli!ltlitll!!!ilt:ihlli!llltl!lljttlt;iltMtM!il! 

m20 5 - 1 LFAKLVGNI AEDGG KLTD YLVSHAALQPYQAGKSGYAA VQNGR YVLEIDSEGAFYFRRRH 

130 140 150 160 170 180 

m205-l YX 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 71 3^ 

a205-l.seq (partial) 

1 CCTCTTAAAG GCTTGCCGGA ACAAAACGTC GTCCGGCTGA CCGGCAAGCA 

51 TCCCAACGAC TTGGAAGCCG TCGTCGGCAA ATGTATGGAA ACCGACGGAA 

101 AGGGCGCGCC TTCGGGCTGG GCGGCAAACG GCGTGTGCCA TACCTTGTTT 

151 GCCAAACTGG TGGGCAATAT CGCCGAAGAC GGCGGCAAAC TGACGGATTA 

201 CCTGATTTCG CATTCCGCCC TGCAACCCTA TCAGGCAGGC AAAAGCGGCT 

251 ATGCCGCCGT GCAGAACGGA CGCTATGTGC TGGAAATCGA CAGCGAGGGG 

3 01 GCG TTTTATT TCCGCCGCCG CCATTATTGA 

This corresponds to the amino acid sequence <SEQ ID 714; ORF 205-1. a>: 

a205-l.pep (partial) 

1 PLKGLPEQNV VRLTGKHPND LEAWGKCME TDGKGAPSGW AANGVCHTLF 
51 AKLVGNIAED GGKLTDYLIS HSALQPYQAG KSGYAAVQNG RYVLEIDSEG 
101 AFYFRRRHY* 

m205-l/a205-l 89.0% identity in 109 aa overlap 

50 €0 70 80 90 100 

m205-l . pep KYIDNTAIAGLDLGQSSEGKTNDGKKQISYPIKGLPEQNVIRLIGKHPGDLEAVSGKCME 

MMiliiHI illplMII Kill 

a205-l PLKGLPEQNWRLTGKHPND LEAWGKCME 

10 20 30 

110 120 130 140 150 160 

m2 0 5 - 1 . pep TDDKDS PAG WAENG VCHT L FAKL VG N I AEDGGKLTD YL VS HAALQ P YQAG K SG YAA VQNG 

I I I :hlM MlillllllllllillllllllllhlMlllllflllMllllil 
a2G5 - 1 TDGKGAPSGWAANGVCHTLFAKLVGNIAEDGGKLTDYLISHSALQPYQAGKSGYAAVQNG 
40 50 60 70 80 90 

170 180 
tn205-l.pep RYVLEIDSEGAFYFRRRHYX 

liimiimmmti! 

a205-l RYVLEIDSEGAFYFRRRHYX 
100 110 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 71 5>: 

g206 -seq 

1 atgttttccc ccgacaaaac ccttttcctc tgtctcggcg cactgctcct 

51 cgcctcatgc ggcacgacct ccggcaaaca ccgccaaccg aaacccaaac 

101 agacagtccg gcaaatccaa gccgtccgca tcagccacat cggccgcaca 

151 caaggctcgc aggaactcat gctccacagc ctcggactca ccggcacgcc 

201 ctacaaatgg ggcggcagca gcaccgcaac cggcttcgac tgcagcggca 

251 tgattcaatt ggtttacaaa aacgccctca acgtcaagct gccgcgcacc 

3 01 gcccgcgaca tggcggcggc aagccgcaaa atccccgaca gccgcctcaa 

3 51 ggccggcgac atcgtattct tcaacaccgg cggcgcacac cgctactcac 

4 01 acgtcggact ctacatcggc aacggcgaat tcatccatgc ccccggcagc 
4 51 ggcaaaacca tcaaaaccga aaaactctcc acaccgtttc acgccaaaaa 
501 ctaccttgga gcgcacacgt ttcttacaga atga 

This corresponds to the amino acid sequence <SEQ ID 716; ORF 206. ng>: 

g206 .pep 

1 MFSPDKTLFL CLGAXLLASC GTTSG KHRQP KPKQTVRQIO AVRISKIGRT 
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This corresponds to the amino acid sequence <SEQ ID 720; ORF 206.a>: 

a206. pep 

1 MFPPDK TLFL CL5ALLLASC GTT SGKHRQP KPKQTVRQIQ AVRISHIDRT 

51 QGSQELMLHS LGLIGTPYKW GGSSTATGFD CSGMIQFVYK NALNVKLPRT 

101 ARDMAAASRK I PDSRLKAGD LVFFNTGGAH RYSHVGLYIG NGEFIHAPSS 

151 GKTIKTEKLS TPFYAKNYLG AHTFFTE* 

m206/a206 99.4% identity in 1 77 aa overlap 

10 20 30 40 50 60 

m206 . pep MFPPDKTLFLCLSALLLASCGTTSGKHRQPKPKQTVRQIQAVRISHIDRTQGSQELMLHS 
I I 1 I I t I I I M 1 I I { I t 1 I I I I I I i i I ( I ! ! i ! M I I I ( I I I M I I I I t I I i i I I 1 I I ( ! 
a20 6 MFPPDKTLFLCLSALLLASCGTTSGKHRQPKPKQTVKQIQAVRISHIDRTQGSQELMLHS 

10 20 30 40 50 60 

70 80 90 100 110 120 

m206.pep LGLIGTPYKWGGSSTATGFl^^5GMIQFVYKNALNVKLPRTARDMAAASRKI PDSRXKAGD 

I I I I I I I II I I i I I I I I I I I t I I I I i I I I I I I I I ! I I I I I I I I I I M I f j I i I I I till 
a206 LGLIGTPYKWGGSSTATGFDCSGMIQFVYKNALNVKLPRTAKDMAAASRKIPDSRLKAGD 
70 80 90 100 110 120 

130 140 150 160 170 

m20 6.pep LVFFNTGGAHRYSHVGLYIGNGEFIHAPSSGKTIKTEKLSTPFYAKNYLGAHTFFTEX 
I I I I I I I I I I I I I I I I ! II I I I I I I I ! I I I I I I I II ! J I I I I I! I II I I ( I I I I I I I I 
a206 LVFFNTGGAHRYSHVGLYIGNGEFIHAPSSGKTIKTEKLSTPFYAKNYLGAHTFFTEX 

130 140 150 160 170 



The following partial DNA sequence was identified in TV. gonorrhoeae <SEQ ID 72 1>: 

q209 . seq 

1 atgctgcggc atttagge^s cgrctt~~r.- ttgggcgcgt tgtttttcg- 

51 tgctgcggtt gatgtgccac tgctgggcga tggtcaggag gtrgttgacc 

101 acccagtaga gaaccaaacc ggcagggaag aagaagaaca tgacggagaa 

151 aaccaacggc atgattttca tcattttcgc ctgcatcggg tcggtcggcg 

201 gcgggttcag ataggtttgg gcgaacatcg ttgccgccat aatgatgggc 

251 aggatgtagt aggggtcggc gcggctgagg tcggtaatcc agcccagcca 

301 aggtgcctgg cgcaattcta cggaggcgaa caatgcccag tacaagccga 

351 tgaagacggg gatttgcaac agcataggca gacagccgcc cagcgggttg 

401 atttcctcgt cttcgaaaag ctgcarcatc gcttgctgtt gcgccatacg 

4 51 gtcgtcgccg tatttttctt tgatggtctg cagttcgggt gcggcggcac 

501 gcattttcgc catcgaacgg taggaggcgt tggtcaatgg atacagtacg 

551 gctttgacga tgatggtcaa aacgacgatt gcccagcccc agttgccgat 

601 aatgttgtgc agttggttca ggagccagaa gagcggcgat gcgaaccagt 

651 gtactttacc gtagtctttt gccagttgca ggttgtcggc gatgtttgcg 

701 ataacggatg tggtttgcgg accggcatac aggttgaccg ccattttcgg 

7 51 ttttggcccc cgggttggga tagcggttaa 

This corresponds to the amino acid sequence <SEQ ID 722; ORF 209.ng>: 

g209 . pep 

1 MLRHLGNDFA LGALFFDAAV DVPLLGDGQE WDH PVENQT GREEEEHDGE 

51 NQRHDFHHFR LHRVGRRRVQ IGLGEHRCRH NDGQDVVGVG AAEVGWPAQP 

101 RCLAQFYGGE QCPVQADEDG DLQQHRQTAA QRVDFLVFEK LHHRLLLRHT 

151 VVAVFFFDGL QFGCGGTHFR HRTVGGVGQW IQYGFDDDGQ NDDCPAPVAD 

2 01 NWQLVQEPE ERRCEPVYFT WFCQLQWG DVCDMGCGLR TGIQVDRHFR 

251 FWPPGWDSG* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 723>; 

m209. seq 

1 ATGCTGCGGC ATTTAGGAAA CGACTTCGCC TTGGgGCGTT GTTTTTCGAT 

51 GCTGCGGTTG ATGTGCCATT GCTGGGCGAT GGTCAGGAGG TTGTTGACTA 

101 CCCAGTACAA TACCAGACCG GCAGGGAAGA AGAAGAACAT GACGGAGAAA 

151 ACCAACGGCA TGATTTTCAT CATTTTCGCC TGCATCGGGT CGGTCGGCGG 

201 CGGGTTCAGA TAAGTTTGGG CGAACATCGT TGCCGCCATA ATGATGGGCA 

251 GGATGTAGTA GGGGTCGGCG CGGCTGAGGT CGGTAATCCA ACCCAGCCAA 



WO 99/57280 



PCT/US99/09346 



485 



51 TGCTGCGGTT GATGTGCCAT TGCTGGGCGA TGGTCACGAG GTTGTTGATC 

101 ACCCAGTACA ATACCAGACC GGCAGGGAAG AAGAAGAACA TGACGGAGAA 

151 AACCAAAGGC ATGATTTTCA TCATTTTCGC CTGCATCGGG TCGGTCGGCG 

201 GCGGGTTCAG ATAGGTTTGG GCGAACATCG TTGCCGCCAT AATGATGGGC 

251 AGGATGTAGT AGGGGTCGGC GCGGCTGAGG TCGGTAATCC AACCCAGCCA 

301 AGGTGCCTGG CGCAATTCTA CGGAGGCGAA CAATGCCCAA TACAATCCGA 

351 TGAAGACGGG GATTTGCAAC AGCATAGGCA GGC^GCCGCC CAGCGGGTTG 

4 01 ATTTTCTCGT CTGTGTAAAG CTGCATCATG GCTTGTTGCT GCGCCATACG 

4 51 GTCGTCGCCG TATTTCTCTT TGATGGCTTG CAGTTTGGGC GCGGCGGCAC 

501 GCATTTTCGC CATCGAACGG TAAGAGGCGT TGGTCAATGG ATACAGTACG 

551 GCTTTGACGA TGATGGTTAA AACGATAATC GCCCAGCCCC AGTTGCCGAT 

601 GATGTTGTGC AGTTGGTTCA AAAGCCAAAA GAGGGGGGAG GCGAACCAGT 

651 GTACTTTGCC GTAGTCTTTG GCCAGTTGCA GGTTGTCGGC GATGTTTGCG 

701 ATAACGGATG TGGTCTGTGG GCCGGCGTAG AGGTTGATGG AGGCTTCGGT 

7 51 TTCGCACCGT TTTGGATAGC GGCTAAAGGC ACGCTGACGC TGGTGCTGTA 

801 CAGCTTGTCG TTGCGGCGTT TGATGTCGAT ACGGCAGTCG CCAGCGGCGC 

851 AAACGCTTTG TCCGCCTTTC GGTTGGAGGA TCCAGGTGGA CATGAAGTGG 

901 TGTTCAATCA TGCCGAGCCA GCCGGTCGGG GTTTTGCGGA TGTATTCGGC 

951 CTCGGATTTG CCGGATTTGG CATCGTCGTC CAAGTCGGAG AAGCTGACTT 

1001 TTTGGAAGTT GCCTTCAGGG GTATAA 

This corresponds to the amino acid sequence <SEQ ID 726; ORF 209. a>: 

a209.pep 

1 MLRHLGND Fa LGALFFDAAV DVPLL GDGQE VVDHPVQYQT GREESEHDGE 

51 NQRHDFHHFR LHRVGRRRVQ IGLGEHRCRH NDGQDWGVG AAEVGNPTQP 

101 RCLAQFYGGE QCPIQSDEDG DLQQHRQAAA QRVDFLVCVK LHHGLLLRHT 

151 VVAVFLFDGL QFGRGGTHFR HRTVRGVGQW IQYGFDDDG*" NDNRFAPVAD 

201 DVVQLVQKPK EGGGEPVYFA WFGQLQWG DVCDNGCGLW AGVEVDGGFG 

251 FAPFWIAAKG TLTLVLYSLS LRRLMSIRQS PAAQTLCPPL GWRIQVDMKW 

301 CSIMPSQPVG VLRMYSASDL PDLASSSKSE KLTFWKLPSG V* 

m209/a209 95.6% identity in 341 aa overlap 

10 20 30 40 50 60 

m20 9 . pep MLRHLGNDFALGALFFDAAVDVPLLGDGQEWDYPVQYQTGREEEEHDGENQRHDFHKFR 
[ I I II I [ I I I I I 1 I I i I I I II ! i ! It 1 i 1 I I i i : I I t I M I I I I It I I M II M I 1 I i t i 
a209 MLRHLGNDFALGALFFDAAVDVPLLGDGQEWDHPVQYQTGREEEEHDGENQRHDFHHFR 

10 20 30 40 50 60 

70 80 90 100 110 120 

m20 9.pep LHRVGRRRVQISLGEHRCRHNDGQDVVGVGAAEVGNPTQPRCLAQFYGGEQCPIQSDEDG 
I I I I I I I \\ I i : I I I I M I I I I II I t 1 I 11 I I i II I I I i ! ( I I I ( I I I t ! I I i i I I 1 1 I I 
a209 LHRVGRRRVQIGLGEHRCRHNDGQDWGVGAAEVGNPTQPRCLAQFYGGEQCPIQSDEDG 

70 80 90 100 110 120 

130 140 150 160 170 180 

m209 . pep DLQQHRQAAAQRVDFLVCVKLHHRLLLRHTVVAVFLFDGLQFGCGGTHFRHRAVRGVGQW 
I I I I I I I f I I I f I I I I M i I I I I I II I I I I I I I If I I I I I I I I I I i I I I I : I I I I i I I 
a209 DLQQHRQAAAQRVDFLVCVKLHHGLLLRHTVVAVFLFDGLQFGRGGTHFRHRTVRGVGQW 

130 140 150 160 170 180 

190 200 210 220 230 240 

m209.pep IQYGFDDDGXNDNRPAPVADDWQLVQE PEERGGEPVYFAVVFGQLQWGDVCDDGCGLR 

I I If I i I I I II I I M f M I I II I I I I I : I : I I I I I I I I If I I I M I I I I I I I I : I I I I 
a209 IQYGFDDDGXNDNRPAPVADDWQLVQKPKEGGGEPVYFAWFGQLQVVGDVCDNGCGLW 

190 200 210 220 230 240 

250 260 270 280 290 300 

m2 0 9 . pep AGVEVDGG FG FAPFWMAAKGTLTLVLY SLS LRR LMSMLK S PAAQT LCLP LGWRIQV DMKW 

M I I I I I I ! I I ! It i : I 1 I I I I I II II II 1 I i I I ! 1 : : I II 1! 1 II I I I I I II I II I I 
a209 AGVEVDGGFGFAPFWIAAKGTLTLVLYSLSLRRLMSIRQSPAAQTLCPPLGWRIQVDMKW 

250 260 270 280 290 300 



m209.pep 



310 320 330 340 

CSIMPSQPVGVLRMYSASDLPDLASSSKSEKLTFWKLPSGVX 
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I i I h I i I i i I 1 I 1 I 1 I I I I I I I I I ! I I 1 i I I i I I M - I i I = I ! ! ! 1 I I I I : : I I I : I 
g2 1 1 AFWLQREVTFFGEDDWAAVFVLFAQEQVFHDAG FGVKGFDEINPAVALAQVIELAGFH 

70 80 90 100 110 120 



m2 1 1 . pep 
g211 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 73 1 >: 

a21 1 . seq 

1 ATGTTGCGGG TTGCTGCTGC CAATCAGTTG GGCGGTCGGA ATGGTACGGC 

51 GGTGGGAAAC GGGGTCGATG AGTTTGGGCG TGGTGCTGAT AATCAGGTTG 

101 AGTTTTTGGA AGGAAACCTG ATTGTAGTCG GCGCGTCCGG GCGTGCCGCT 

151 GTAACGGTAG CCGTGGCGCA ATTCGAGCGT GCGTTTGTTG TCGTTCAGCG 

201 AGAAGTTACT TTCTTTGGCG AAGATGATGT TGTCGCCGCC GTTTTTGTCC 

2 51 TGTTCGCGCA GGAACAGGTT TTTCATGATG CCGGATTCGG TATCGAAGGT 

301 TTCGACAAAA TAAACCCTGC CGTTGCGCTT GCCCAAACTG TTGAACCCGC 

351 CTGCCTCCAC CAAAGACAAT TCCTGCTTCT GCTTCAGGAT TTCAGCGTAT 

401 TCGCGGCTGC GTAGCTCTGC CCACGGTATC ACCCAAAGCT GCATGACGGC 

4 51 AACCAAAACG GCAAACGGCA CGGCAAACTG CATCACCGGG CGTATCCATT 

501 GTTTCAATGC CAATCCGCAG GAT AG 

This corresponds to the amino acid sequence <SEQ ID 732; ORF 21 1 .a>: 

a211 .pep 

1 MLRVAAANQL GGRNGTAVGN GVDE FGRGAD NQVEFLEGNL IWGASGRAA 

51 VTVAVA QFER AFVWQRE VT FFGEDDWAA VFVLF AQEQV FHDAGFGIEG 

101 FDKINPAVAL AQTVEPACLH QRQ FLLLLQD FSVFAAA*LC PRYHPKLHDG 

151 NQNGKRHGKL KHRAYPLFQC QSAG* 

m21 l/a21 1 99.4% identity in 1 74 aa overlap 

10 20 30 40 50 60 

m2 1 1 . pep MLRVAAANQLGGRNGTAVGNGVDEFGRGADNQVEFLEGNLIVVGASGRAAVTVAVAQFER 
1 I I i I I I I I I M I I f I I I I I t M M I I I II I I I ! i I ! I I II II I ! I I I I I ! ! I I I I I I t ! 
a21i MLRVAAANQLGGRNGTAVGNGVDEFGRGADNQVEFLEGNLIWGASGRAAVTVAVAQFER 

10 20 30 40 50 60 

70 80 90 100 110 120 

m211 . pep AFVVVQREVTFFGEDDVVAAVFVLFAQEQVFHDAGFGIEGFDKINPAVALAQTVELACLH 
I I I I I I I I I I I I I I I M I II I I I I I I i I I I I I I I I I I I I II I I I M I I I I I I I I I MM 
a211 AFWVQREVTFFGEDDWAAVFVLFAQEQVFHDAGFGIEGFDKINPAVALAQTVEPACLH 

70 80 90 100 110 120 

130 140 150 160 170 

m2 1 1 . pep QRQFLLLLQDFSVFAAAXLCPRYHPKLHDGNQNGKRHGKLHHRAYPLFQCQSAGX 
I I I M I I M I M M M II I I II M I I I I I M M II I M I II I I I I M M M 11 M 
a2U QRQFLLLLQDFSVFAAAXLCPRYHPKLHDGNQNGKRHGKLHHRAYPLFQCQSAGX 

130 140 150 160 170 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 733>: 

g212.seq (partial) 

1 atggacaatc tcgtatggga cggcattccc gacatccgca cactcgacca 

51 aaccatccgc aaacacgcac acccgctcaa cctgaccgtc tgcctccccg 

101 ataatcagat tcccgatttt caaaccgcac aagatgcttc ggactcggaa 

151 tgccgtctga agcaccgttt ggatcaggca acccagtgcc tccagttcga 

201 cagcatcaac ctcatcgaac acatcctgcc cgatgtccgc ttctggctgg 

251 ttcccccttc acgcacccgc cgcctgcacg aacacttcca ccacatttcc 

301 tggcagaccg aagccatccc gcaaaccgaa agcaagtccg acaaaccctg 

351 gtttgcactt ccacaaacat ccgaacggaa aaaaccggaa cacgtcctcg 



130 140 150 160 170 

QRQFLLLLODFSVFAAAXLCPRYHPKLHDGNQNGK^HGKLHHRAYPLFQCOSAGX 

1 1 1 1 1 1 1 1 1 ■ I - 1 1 1 1 1 1 1 1 1 1 1 i I ! 1 1 1 M I i I ! 1 1 1 i 1 1 1 1 ! ! 1 1 M 

QRQFLLLLQNFGIFAAAQLCPRYKPKLHDGNODGKRHGKLKDGAYPLFQRQSAG 
130 140 150 160 170 
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1501 GmCAwTGCAG CCCAAATCsT AGGCyTGCCC CATCCCTTTT yAcAAcGCCT 
1551 gCGCCACGCC cTAcACCCCA ACCGCACCAT CATCCGCGCC ATCGTCAGAA 
16 01 GGAAGGATCT AACCCCTTAA 

This corresponds to the amino acid sequence <SEQ ID 736; ORF 2I2>: 

m212 .pep 

1 MDNLWDGIP DIRTLDQAIR KHAPPLNLI I CLPDNQIPDF QTAQDASDAE 

51 CRLKHRLDQA MQCLQFDSIN LIEHILPDVR FWLVPPSRTH HLHEHFHHIS 

101 WQTEAIPQTE SKPDKPWFAL PQTSERQKPE H ILVIGAGIS GAATAHAL AS 

151 HGISVTVLEA RKAAQAASGN RQGLLYAKIS PHDTEQTELL LAGYGYTKRL 

201 LGKILPESET WGGNGI IHLN YSRTEQQRNH ELGLQKHHNH LYRSITSAEA 

251 EKIAGIPLSV PYDHPSCGLY WQHGVWLNPP AFVRTLLNHP LIGLHEDTPL 

3 01 TDISHDGEKW IASTPNGTFT ATHIIYCTGA NSPYLPETNL AALPLRQIRG 

3 51 QTGLTPSTPF SEQLRCAVSG ESYISPSWHG LHCYGASFIP NSSHTGWNEA 
401 EEASNTRQALA HLNPALSESL FAANPNPQKH QGKAAIRCDS PDHLPLVGAL 

4 51 GDIAAMRQTY TKLALDKNYR IDTPCPYLPN AYVNTAHGTR GLATAPICAA 
501 XXAAQIXGLP HPFXQRLRHA LHPNRTI IRA IVRRKDLTP* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 212 shows 92.9% identity over a 421 aa overlap with a predicted ORF (ORF 212. ng) 
from N. gonorrhoeae: 

m212/g212 

10 20 30 40 50 60 

MDNLVWDGI PD I RTLDQ A I RKHAP PLNL 1 1 CLPDNQI PDFQTAQDAS DAE CRLKHRLDQA 

iiimiiimiMiMiiii iiMhMiiiimiiimuMimiHiii 

MDNLWDGIPDIRTLDQTIRKHAHPLNLIVCLPDNQI PDFOTAQDASDSECRLKKRLDQA 
10 20 30 40 50 60 

70 80 90 100 110 120 

MQCLQFDS I NL I EH I LPDVRFWLVPPSRTHHLHEHFHHI SWQTEAI PQTES KPDKPWFAL 

IIIIIIIIIMIIIIIMIMIlllMhMMIIIIMIIIIMIIIIM lllllll 

TQCLQFDSINLIEHILPDVRFWLVPPSRTRRLKEHFHHISWQTEAIPQTESKSDKPWFAL 
70 80 90 100 110 120 

130 140 150 160 170 180 

PQTSERQKPEH I LVI GAGI S GAATAHALAS HG I S VTVLEARKAAQAASGNRQGLL YAKI S 

l!!(lhli!h!!llll|:||:!!lll!!lllllll[IMIIMII!ll(lllll!lll 

PQTSERKKPEHVLVIGAG I AGASTAHALASHG I S VTVLEARKAAQAASGNRQGLL YAKI S 
130 140 150 160 170 180 

190 200 210 220 230 240 

PHDTEQTELLLAGYGYTKRLLGHILPESETWGGNGIIHLNYSRTEQQRNHELGLQKHHNH 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 •!-- 1 1 ! I II M 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 ! I M 1 1 1 

PHDTGQTELLLAGYGYTKRLLGHILPDSDTWGGNGIIHLNYSRTEQQRNHELGLQKHHNH 
190 200 210 220 230 240 

250 260 270 280 290 300 

LYRS ITSAEAEKIAGI PLSVPYDHPSCGLYWQHGVWLNPPAFVRTLLNHPLIGLKEDTPL 

i iii in ii ii ii ii 1 1 m m m mm mm mi m mi mm imi i 

LYRS ITSAEAEKIAGI PLNTPYAEPLCGLYWQHGVWLNPPAFVRTLLSHPLIELYENTTL 
250 260 270 280 290 300 

310 320 330 340 350 360 

TDISHDGEKW I ASTPNGTFTATHI I YCTGANSPYLPETNLAALPLRQIRGQTGLTPSTPF 

I 1 1 1 1 1 1 1 M 1 1 M 1 1 1 ! 1 1 1 II 1 1 ) 1 1 h 1 1 i 1 1 1 1 II 1 1 i 1 1 ) M 1 1 1 1 1 li 1 1 1 1 

TGISHDGEKWI ASTPNGTFTATHI I YCTGAHSPCLPETNLAALPLRQIRGQTGLTPSTPF 
310 320 330 340 350 360 

370 380 390 400 410 420 

SEQLRCAVSGESYISPSWHGLHCYGASFIPNSSHTGWKr.^EEASNRQALAKLNPALSESL 



m212 .pep 
g212 

m212 .pep 
g212 

m212 .pep 
g212 

m212 .pep 
g212 

m212 .pep 
g212 

m212 .pep 
g212 

m212 .pep 
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10 20 30 AO 50 60 

70 80 90 100 110 120 

m212 .pep MQCLQFDSINLIEKILPDVRFWLVPPSRTHHLHEHFHHISWQTEAIPQTESKPDKPWFAL 
I i I I I M I 1 II ! I t I i I I I I M I I I II I : : M i i I I I M M I I I I I t I I 11 i I I I M ! I 
a212 TQCLQFDSINLIEHILPDVRFWLVPPSRTRRLHEKFHHISWQTEAIPOTESKPDKPWFAL 

70 80 90 100 110 120 

130 140 150 160 170 180 

m212 .pep PQTSERQKPEHILVIGAGISGAATAHALASHGISVTVLEARKAAOAASGNRQGLLVAKIS 
I I I I I I I M I I 1 I I N I I t I I M I ! ! I 1 I I : I I I I I I I ! t I I I I I ! I 1 i i ! I I i I I I ! M 
3212 PQTSERQKPEHILVIGAGISGAATAHALASYGISVTVLEARKAAOAASGNROGLLYAKIS 

130 140 150 160 170 180 

190 200 210 220 230 240 

m212 .pep PHDTEQTELLLAGYGYTKRLLGHILPESETWGGNGIIHLNYSRTEQQRNKELGLQKHHNH 
I I I I I I I I I I I M I I I I I I I i i I I I I I I I ! I I I I i I 1 I I I I I I II I I I H I I I ! I I I I I I 
a2 1 2 PHDTEQTELLLAGYGYTKRLLGH I LPESETWGGNG 1 1 HLNYSRTEQQRNHELGLQKKHNH 

190 200 210 220 230 240 

250 260 270 280 290 300 

m212 .pep LYRSITSAEAEKIAGIPLSVPYDHPSCGLYWQHGVWLNPPAFVRTLLNHPLIGLHEDTPL 
I 1 I M I : I I I I I I M ' 1 , : : ! I : i I ) I : ! I : I I I I 1 I I : I I i : I I : j I I 1 I II 11 I I i 
a212 LYRSITQAEAEKIAGI PLNTPYAEPLCGLFWQYGVWLNPPTFVRALLSHPLIGLHEDTPL 

250 260 270 280 290 300 

310 320 330 340 350 360 

m212 . pep TDISHDGEKWIASTPNGTFTATHII YCTGANSPYLPETNLAALPLRQIRGQTGLTPSTPF 

I I I I I II I II I I i M I I I 1 I I I I I ! I I 1 ! I I II II I I I I I I : I I I I I I I 1 ! I I I I ( 1 I i i 
a212 TDISHDGEKWIASTPNGTFTATHI IYCTGANSPYLPETNLATLPLRQIRGQTGLTPSTPF 

310 320 330 340 350 360 

370 360 390 400 410 420 

m2 12 . pep SEQLRCAVSGESYISPSWHGLHCYGASFIPNSSHTGWNEAEEASNRQA1AHLNPALSESL 

II I I I I I I M I I I I I I I I I I I I 1 I I I I 1 I I 1 i I M I 1 M M ' i I 1 I I I I 1 I 1 I 1 I 1 1 I I 1 
a212 SEQLRCAVSGESY1SPSWHGLHCYGASFIPNSSHTGWNEAEEASNRQALAHLNPALSESL 

370 360 390 400 410 420 

430 440 450 460 470 480 

m212 . pep FAANPNPQKHQGHAAIRCDSPDHLPLVGALGDIAAMRQTYTKLALDKNYRIDTPCPYLPN 
I I II II ( I I I I I I I I i I I I I I I I I I I I I II I I i I I I : I I I : I I I I I I I I I ! I : I I I I I II 
a212 FAANPNPQKHQGHAAIRCDSPDHLPLVGALGDIAAMQOTYAKLALDKNYRIDAPCPYLPN 

430 440 450 460 470 480 

490 500 510 520 530 540 

m212 .pep AYVNTAHGTRGLATAPICAAXXAAQIXGLPHPFXQRLRHALKPNRTIIRAIVRRKDLTPX 
li : I I I I I I t I II \\ I I I I i I I "■ I Mill: : I I I II! Ill I : I HI I I i I III il! 
a212 AYANTAHGTRGLATAPICAAAVAAEILGLPHPLSKRLRHALHPNRAIIRAIVRRKDLTPX 

490 500 510 520 530 540 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 739>; 

g2I4 . seq 

1 atgatacaaa agatatgtaa gctatttgtt 

51 ttctcccgct tttgcccttc aaagcgacag 

101 aagccgacca aggttcgctc gatcaagcca 

151 ggcaatgtca ccatcagaca gggtacgctc 

201 caacgtcaca cgcggcaggc aaaggcggcg 

251 tcgcccgtcc gcttcagcca aacgttggac 

301 cggtcaggca aacaacgtta cccattcctc 

3 51 tgaccggcaa tgccaaagtg cagcgcggcg 

4 01 gtcattacct acaacaccaa aaccgaagtc 
451 gaaatcgggt gcgaaatccg cttccaaaac 
501 tccagccttc aagcacacaa aaaaccgaat 
551 cggaaacgca gttcagacgg catttgccga 
601 tattga 

This corresponds to the amino acid sequence <SEQ ID 740; ORF 214.ng>: 

g214 .pep 



ttaattgtaa 
csgacggccc 
accaaaggac 
aacattcccg 
aatccgtgag 
gggggcaaag 
cgcaggaagc 
gcgacgttgc 
tataccatca 
cggcagggtc 
aaccccgacg 
ccgaaatgcc 



tttt cgcaac 
atccaaatcg 
cacattcagc 
cctcgtgtgt 
ggcggaaggt 
ggacggcgcg 
actgtcgttc 
cgaaggtgcg 
acggcagcac 
agcgtcgtca 
ccgtctgaaa 
gagaagagat 
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3 51 TAACCGGTAA TGCCAAAGTA CAGCGCGGCG GCGATGTCGC CGAAGGTGCG 

4 01 GTGATTACAT ACAACACCAA AACCGAAGTC TATACCATCA GCGGCAGCAC 
4 51 AAAATCCGGC GCAAAATCCG CTTCCAAATC CGGCAGGGTC AGCGTCGTTA 
501 TCCAGCCTTC GAGTACGCAA AAATCCGAAT AATCCCAATG CCGTCTGAAA 
551 CATAAACCTG GTT CGGACGG CATTTGCCGA CCGAAATATT GAAGAGATAT 
601 TTATGA 

This corresponds to the amino acid sequence <SEQ ID 744; ORF 214.a>: 

a214 . pep 

1 MIQKICKLFV LIAFFSASPA FA LQSDSRQP IQIEADQGSL DQANQSTTFS 
51 GNWIRQGTL MISAARVNVT RGXQRRRXRE GGRFASPLQP DIGRRQRHGA 
101 RTGKQRCLFJ CRQHRSLNR* CQSTARRRCR RRCGrYIQHQ NRSLYHQRQH 
151 KIRRKIRFQI RQGQRKVPAF EYAKIRIIPM PSETTWFGR HLPTEILKRY 
201 L* 

m2 1 4/a2 1 4 99.3% identity in 1 52 aa overlap 

10 20 30 40 50 60 

MIQKICKLFVLIAFFSASPAFALQSDSRQPIQIEADQGSLDQANQSTTFSGNVVIRQGTL 
11 M ) t i I i I ! I I I I I I i I I i I I II M t I It I I I t I I II i t I 1 I I I It I t I I I t I t t II ! 
MIQKICKLFVLIAFFSASPAFALQSDSRQPIQIEADQGSLDQANQSTTFSGNWIRQGTL 
10 20 30 40 50 60 

70 80 90 100 110 120 

NISAARVNVTRGRQRRRIREGGRFASPLQPDIGRRQRHGARTGKQRCLFICRQHRSLNRX 
I I t I I I 1 I I i II I 1 1 1 1 I I 1 I 1 I I II I t I I I I 1 I I I I I I M I I I I 1 I i I t I 1 f I I M M 
NISAARVNVTRGXQRRRIREGGRFASPLQPDIGRRQRHGARTGKQRCLFICRQKRSLNRX 
70 80 90 100 110 120 

130 140 150 

CQSTARRRCRRRCGDYIQHQNRSLYHQRQHKI 
I II I I I I I I If I M I I I i I II I I If I I I I I I I 

CQSTARRRCRRRCGDYIQHQNRSLYHQRQHKIRRKIRFQIRQGQRRYPAFE YAKIRI I PM 
130 140 150 160 170 180 

PSETXTWFGRHLPTE ILKRYLX 
190 200 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 745>: 

g214-l . seq 

1 ATGATACAAA AGATATGTAA GCTATTTGTT TTAATTGTAA TTTTTGCAAC 

51 TTCTCCCGCT TTTGCCCTTC AAAGCGACAG CAGACGGCCC ATCCAAATCG 

101 AAGCCGACCA AGGTTCGCTC GATCAAGCCA ACCAAAGTAC CACATTTAGC 

151 GGC AATGTCA TCATCAGACA GGGTACGCTC AACATTTCCG CCTCGCGCGT 

201 CAACGTCACA CGCGGCGGCA AAGGCGGCGA ATCCGTGAGG GCGGAAGGTT 

251 CGCCCGTCCG CTTCAGCCAA ACGTTGGACG GGGGCAAAGG GACGGTGCGC 

301 GGTCAGGCAA ACAACGTTAC CTATTCCTCC GCAGGAAGCA CCGTCGTTCT 

351 GACCGGCAAT GCCAAAGTGC AGCGCGGCGG CGACGTTGCC GAAGGTGCGG 

4 01 TCATTACCTA CAACACCAAA ACCGAAGTCT ATACCATCAA CGGCAGCACG 

4 51 AAATCGGGTG CGAAATCCGC TTCCAAAACC GGCAGGGTCA GCGTCGTCAT 

501 CCAGCCTTCA AGCACACAAA AAACCGAATA A 

This corresponds to the amino acid sequence <SEQ ID 746; ORF 214-1. ng>: 

g214-l .pep 

1 MIQKICKLFV LI VI FATS PA FA LQSDSRRP IQIEADQGSL DQANQSTTFS 

51 GNVI IRQGTL NISASRVNVT RGGKGGESVR AEGSPVRFSQ TLDGGKGTVR 

101 GQANNVTYSS AGSTVVLTGN AKVQRGGDVA EGAVITYNTK TEVYTINGST 

151 KSGAKSASKT GRVSVVIQPS STQKTE* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 747>: 

m214-l . seq 

1 ATGATACAAA AGATATGTAA GCTATTTGTT TTAATAGCAT TTTTTTCGGC 

51 GTCCCCCGCT TTTGCCCTTC AAAGCGACAG CAGGCAGCCT ATTCAGATTG 

101 AGGCCGACCA AGGTTCGCTC GATCAAGCCA ACCAAAGCAC CACATTCAGC 

151 GGAAACGTCG TCATCAGACA GGGTACGCTC AATATTTCCG CCGCCCGCGT 



ra2 14 . pep 
a214 

m214 . pep 
a2l4 

m214 . pep 
a214 

a214 
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51 GNVVIROGTL NISAARVNVT RGGKGGESVR AEGSPVRFSQ TLDGGKG7VR 
101 GQANNVAYSS AGSTVVLTGN AKVQRGGDVA EGAVITYHTK TEVYT ISGST 
151 KSGAKSASKS GRVSVVIQPS STQKSE* 

a214-l/m214-l 100.0$ identity in 176 aa overlap 

10 20 30 40 50 60 

a214-l -pep miqkicklfvliaffsaspafalosdsropiqieadcjsldqanqsttfsgnvvirqgtl 

I I II M II I I I ! I I I I I I I I I II II I I I I I II I I I I I I I f I I I I I I I I M I I I I I I I I I I 
m214-l MIQKICKLFVLIAFFSASPAFALQSDSRQPIQIEADOGSLDQANQSTTFSGNVVIRQGTL 
10 20 30 40 50 60 

70 80 90 100 110 120 

a2 1 4 - 1 . pep NI SAARVNVTRGGKGGESVRAEGSPVRFSQTLDGGKGTVRGQANNVA YSSAGSTVVLTGN 
I I I I I I ! I ( I I I I I 1 t I I I I I ! I II I I I I I I I I I M I I I I I 1 I ( I t I . ! I i I I I 1 I I I I I 
m2 1 4 - 1 N I S AARVNVTRGGKGGESVRAEGS PVRFSQTLDGGKGTVRGQANN VAYSS AGSTVVLTGN 

70 80 90 100 110 120 

130 140 150 160 170 

a214-l -pep AKVQRGGDVAEGAVITYNTKTEVYTISGSTKSGAKSASKSGRVSVVIQPSSTQKSEX 
I I 1 1 M M I I II 1 I I I I I I 1 I I t I I t 1 1 II I II I ft t I I M ) I I M I 1 I I M I I t I I 
m21 4 - 1 AKVQRGGDVAEGAVITYNTKTEVYTI SGSTKSGAKSASKSGRVSVVIQPSSTQKSEX 

130 140 150 160 170 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 75 1>: 

g215 . seq 

1 atgaaagtaa gatggcggta cggaattgcg ttcccattga tattggcggt 

51 tgccttgggc agcctgtcgg catggttggg ccgtatcagc gaagtcgaaa 

101 tcgaggaagt caggctcaat cccgacgaac ctcaatacac aatggacggc 

151 ttggacggaa ggcggtttga cgaacaggga tacttgaaag aacatttgag 

201 cgcgaaaggt gcgaaacagt ttcccgaaaa cagcgacatc cattttgatt 

251 cgccgcatct cgtgttcttc caagaaggca ggctgttgta cgaagtcggc 

301 agcgatgaag ccgtttacca caccgaaaac aaacaggttc tttttaaaaa 

351 caacgttgtg ctgaccaaaa ccgccgacgg caggcggcag gcgggtaaag 

401 tcgaaaccga aaaaccgcac gtcgataccg aatctcaata tgcccaaacc 

4 51 gatacgcctg tcagtttcca atatggcgcg tcgcacggtc aggcgggcgg 

501 tatgacccac aaccacaaaa caggcatgtt gaacctctca tctaaagtga 

551 aagccgcgat ttatgataca aaagatatgt aa 

This corresponds to the amino acid sequence <SEQ ID 752; ORF 215.ng>: 

g215 -pep 

1 MKVRWRYGIA FPLILAVALG SLS AWLGRIS EVEIEEVRLN PDEPQYTMDG 

51 LDGRRFDEQG YLKEHLSAKG AKQFPENSDI HFDSPHLVFF QEGRLLYEVG 

101 SDEAVYHTEN KQVLFKNNW LTKTADGRRQ AGKVETEKLH VDTESQYAQT 

151 DTPVSFQYGA SHGQAGGMTY NHKTGMLNFS SKVKAAIYDT KDM* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 753>: 

m215.seq (partial) 

1 . .AGCCTGTCGG CATGGTTGGG TCGTATCAGC GAAGTCGAGA TTGAAGAAGT 

SI CAGGCTCAAT CCCGACGAAC CGCAATACAC AATGGACAGC TTGGACGGCA 

101 GGCGGTTTGA CGAACAGGGA TACTTGAAAG AACATTTGAG CGCGAAGGGC 

151 GCGAAACAGT TTCCGGAAAG CAGCGACATC CATTTTGATT CGCCGCATCT 

201 CGTGTTCTTC CAAGAAGGCA GGTTGTTGTA CGAAGTCGGC AGCGACGAAG 

251 CCGTTTACCA TACCGAAAAC AAACAGGTTC TTTTTAAAAA CAACGTTGTG 

3 01 CTGACCAAAA CCGCCGACGG CAAACGGCAG GCGGGTAAAG TTGAAGCCGA 

3 51 AAAGCTGCAC GTCGATACCG AATCTCAATA TGCCCAAACC GATACGCCTG 
401 CAGTTTCCA ATATGGTGCA TCGCACGGTC AGGCGGGCGG CATGACTTAC 

4 51 GACCACAwwA CAGGCATGTT GAACTTCTCA TCTAAAGTGA AAGCCACGAT 
501 TTATGATACA AAAGATATGT AA 

This corresponds to the amino acid sequence <SEQ ID 754; ORF 21 5>: 

m215.pep (partial) 

1 . . SLSAWLGRIS EVEIEEVRLN PDEPQYTMDS LDGRRFDEQG YLKEHLSAKG 

51 AKQFPESSDI HFDSPHLVFF QEGRLLYEVG SDEAVYHTEN KQVLFKNNW 

101 LTKTADGKRQ AGKVEAEKLH VDTESQYAQT DTPVSFQYGA SHGQAGGMTY 
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I l I I I I I : I j I ! I ! I I I I ( I I t I I 1 ! ! i ! I I I I I I I M I I 1 ( I I [ 1 I I I t 1 I i t I II I M 
a215 YLKEHLSSKGAKQFPESSDIHFDSPHLVFFQEGRLLYEVGSDEAVYHTENKQVLFKl^NVV 

70 80 90 100 110 120 

110 120 130 140 150 160 

m215 . pep LTKTADGKRQAGKVEAEKLHVDTESQYAQTDTPVSFQYGASHGQAGGMTYDKXTGMLNFS 
I ( I I II ! i I I i I ! i I I ! I I ! I ! I I i II II i I i i I i t I i I I I I I ! I I I I t II I i I I II I I 
a215 LTKTADGKRQAGKVEAEKLHVDTESQYAQTDTPVSFQYGASHGQAGGMTYDHKTGMLNFS 
130 140 150 160 170 180 

170 

m215.pep SKVKATIYDTKDMX 

II I ! I I I I I II I I I 
a215 SKVKATIYDTKDMX 

190 



The following partial DNA sequence was identified in N gonorrhoeae <SEQ ED 757>: 

g2l6,seq (partial) 

1 . . atgatatcga tttcgagctc ggtacccagc gacgaaatca ccgccaccat 

51 ccccgcactc aaacgcaaag acattaccct cgtctgcatc accgcccgcc 

101 ccgattcaac catggcgcgc catgccgata tccacaccac cgcatcggtt 

151 tcgcaagaag cctgcccgtt ggggcttgcc ccgaccacca gcaccaccgc 

201 cgctatggct ttgggcgacg cgttggcggt cgtcctgctg cgcgcccgcg 

251 cgttcacgcc cgacgacttc gccttgatcc accctgccgg cagcctcggc 

3 01 aaacgcctgc tcttgcgcgt tgccgacatt acgcacaaag gcggcggcct 

3 51 gcccgccgtc cgactcggca cgcccttgaa aggagccatc gccagcatga 

4 01 gcgagaaagg tttgggcatg tgggcgggaa cggacgggca aaggccgtct 
451 gaaaggcctt tttactga 

This corresponds to the ammo acid sequence <SEQ ID 758; OP.F 216.ng>: 

g216.pep (partial) 

1 . .MISISSSVPS DEITAIIPAL KRKDITLVCI TARPDSTMAR HADIHITASV 
51 SQEACPLGLA PTT STTAVMA LGDALAWLL RARAFTPDDF ALIHPAGSLG 
101 KRLLLRVADI MHKGGGLPAV RLGTPLKGAI VSMSEKGLGM WAGTDGQRLS 
151 ERPFY* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 759>: 



m216 . seq 












1 


ATGGCAATGG 


CAGAAAACGG 


AAAATATCTC 


GACTGGGCAC 


GCGAAGTGTT 


51 


GCACGCCGAA 


GCGGAAGGCT 


TGCGCGAAAT 


TGCAGCGGAA 


TTGsACAAAA 


101 


ACTTCGTCCT 


TGCGGCAGAC 


GCGTTGTTGC 


ACTGCAAGGG 


CAGGGTCGTT 


151 


ATCACGGGCA 


TGGTCAAGTC 


GGGACATATC 


GGGCGCAAAA 


TGGCGGCAAC 


201 


TATGGCCTCG 


ACCGGCACGC 


CTGCGTTTTT 


CGTCCACCCT 


GCGGAAGCGG 


251 


CACACGgCGA 


TTTGGGTATG 


ATTGTGGACA 


rCGACGTGGT 


CGTCGCGATT 


301 


TCCAATTCCG 


GCGAAAGCGA 


CGAAATCGCC 


GCCATCATCC 


CCGCACTCAA 


351 


ACGCAAAGAC 


ATCACGCTTG 


TCTGCATCAC 


CGCCCGCCCC 


GATTCAACCA 


401 


TGGCGCGCCA 


TGCCGACATC 


CACATCACGG 


CGTCGGTTTC 


CAAAGAAGCC 


451 


TGCCCGCTGG 


GGCTTGCCCC 


GACCACCAGC 


ACCACCGCCG 


TCATGGCTTT 


501 


GGGCGATGCG 


TTGGCGGTCG 


TCCtGCTGCG 


CgcACGCGCG 


TTCACGCCCG 


551 


ACGATTTCGC 


CTTGAGCCAT 


CCTGCCGGCA 


GCCTCGGCAA 


ACGCCTACTT 


601 


TTGCGCGTTG 


CCGACATTAT 


GCACAAAGGC 


GGCGGCCTGC 


CTGCCGTCCG 


651 


ACTCGGCACG 


CCCTTGAAAG 


AAGCCATCGT 


CAGCATGAGT 


GAAAAAGGGC 


701 


TGGGCATGTT 


GGCGGTAACG 


GACGGGCAAG 


GCCGTCTGAA 


AGGCGTATTC 


751 


ACCGACGGCG 


ATTTGCGCCG 


CCTGTTTCAA 


GAATGCGACA 


ATTTTACCGG 


801 


TCTTTCGATA 


GACGAAGTCA 


TGCATACGCA 


TCCTAAAACC 


ATCTCCGCCG 


851 


AACGTCTCGC 


CACCGAAGCC 


CTGAAAGTCA 


TGCAGGCAAA 


CCATGTGAAC 


901 


GGGCTTCTGG 


TTACCGATGC 


AGATGGCGTG 


CTGATCGGCG 


CGCTGAATAT 


951 


GCACGACCTG 


CTGGCGGCAC 


GGATTGTATA 


G 





This corresponds to the amino acid sequence <SEQ ID 760; ORF 216>: 

n\216 . pep 

1 MAMAENGKYL DWAREVLHAE AEGLREIAAE LXKNFVLAAD ALLHCKGRW 
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301 GLLVTDADGV LIGALNMHDL LAARIV* 

m216/a216 97.2% identity in 326 aa overlap 

10 20 30 40 50 60 

m21 6 . pep MAMAENGKYLDWAREVLHAEAEGLREIAAELXKNFVLAADALLHCKGRWITGMVKSGKI 
till I ! I I I 1 ! I ! t I I : ! I I I I ! f I I I : I : I ! : II I i I I ! ! ! M I I II ! ! ! i ! I I i 
a216 MAMAGNEKYLDWAREVLHTEAEGLREIAADLDENFALAADALLHCKGRWITGMGKSGHI 

10 20 30 40 50 60 

70 80 90 100 110 120 

m2 1 6 . pep GRKMAATMASTGTPAFFVHPAEAAHGDLGMI VDXDVVVAI SNSGESDE I AAI I PALKRKD 

f It I I I [ I I M I I II ! ! I I I I I I I I I I I I I I 1 I I I I I I I I M I I I I I I I I I I f I I I j If 
a 2 1 6 GRKMAATMASTGTPAFFVHPAEAAHGDLGMI VDNDV.^AI SNSGES DEI AAI I PALKRKD 

70 80 90 100 110 120 

130 140 150 160 170 180 

m2 1 6 . pep ITLVCITARPDSTMARHADT^TTASVSKEACPLGLAPTTSTTAVMALGDALAVVLLRARA. 

M ] I i ) I I I I I I I I I 1 I I f I I i I I I I I I 1 I I N I I I I I I I II I I i i I I I I ! II I I I i i I I 
a21 6 ITLVCITARPDSTMARHADIHITASVSKEACPLGLAPTTSTTAVMALGDALAWLLRARA 

130 140 150 160 170 180 

190 200 210 220 230 240 

m216.pep FT PDDFALSHPAGSLGKRLLLRV AD IMHKGGGLPAVRLGTPLKEAI VSMSEKGLGMLAVT 

! I f M 1 I I t I II II II i I I ! I i II I t I! I It 11 i I t II I I i It i ! I I I ) I i i II ! I i 1 i I 
a216 FT PDDFALSHPAGSLGKRLLLRVADIMHKGGGLPAVRLGTPLKEAI VSMSEKGLGMLAVT 

190 200 210 220 230 240 

250 260 270 280 290 300 

m216.pep DGQGRLKGVFTDGDLRRLFQECDNFTGLSIDEVMHTHPKTISAERLATEALKVMQANHVN 
I H I I I M I i I I I I I i I i I i I I I M I II II I I I I I i I I II I I M I I I i II M ! i ! I M I S 
a216 DGQGRLKGVFT DGDLRRLFQECDNFTGLS I DEVMHTHFKT I SAERLATEALKVMQANHVN 

250 260 270 280 290 300 

310 320 
m2 1 6 . pep GLLVTDADGVLIGALNMHDLLAARIVX 
I I I I I t I I I i ( I ! I ( M I I I i ! i I I I I 
a216 GLLVTDADGVLIGALNMHDLLAARIVX 

310 320 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 763- 



g217 . seq 




1 


atggcggatg 


51 


tgctctcttc 


101 


ttttggccat 


151 


acgcgccacc 


201 


tatccgacag 


251 


tgattcacgt 


301 


gccttcgggc 


351 


aggcaaacag 


401 


tggacgttca 


451 


ggtcagcccg 


501 


ctttgccgac 


551 


gcataaaaaa 


601 


cggcaatgcc 


651 


aaggcgtata 



jttcgata ttcggggtta a 

This corresponds to the amino acid sequence <SEQ ID 764; ORF 21 7.ng>: 

g217 .pep. . 

1 MADDGLLRQL SEKPSQSALF LPFDPFVFEV LDCLLVI GPG LKQCFKQIPA 
51 TRHPFADRRR LPPYPGNIRQ GFEEGGKTSE HGGLIHVGIP RADILPHRVA 
101 AFGQHPAQYH TVCRLLPGKQ LIRAAAHFSV QSPMDVQIGN HIVQKRQIVP 
151 GQSETAQHGR GFQKREHFAD FKTAFQQVGH ALQRIKKRLP AADFHVRNGI 
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201 TATCCGTCAG GGTTTCGAGG AAGGCGGCAA AACGTCCGAA CAGGGCGGTT 

251 TGGTTCACGT CGGCATACCA CGCGCTGACC CCCTGCCACA TCGGATTGCC 

301 GCCTTCGGGC AGCATCCAGC CCAATATCAT GCGTTCTACC GCCTGCTTCC 

351 AGGTGAACAG CTGATCCGTG CCGCCGCGCA TTTCTCCGTC CAAACCCCAG 

4 01 CGGACGTTCA AATCGGCAAC CATGTCGTGC AAAAGCGGCA AATCGTCCTC 

4 51 AGTCAGTCCG AAATGGCGCA ACACGGGCGC GGTTTCTAAA AGCACAAGCA 

501 CTTTATCGAC TTCAAATCGG CTTTCCAACA AGTCGAACAG GCATGACAAA 

551 GCATGAAACA GCGGTTGTCG GCGGCTGATT TTCACATCCG AAACGGAATA 

601 CGGCAATGCC TGCGCGCCGG GCTGCGCCTG TCCGAACACG GCTTCGATAA 

651 AAGGCGTATA GGATTCGATA TTCGGGGTTA A 



This corresponds to the amino acid sequence <SEQ ID 768; ORP 217.a>: 

a217 .pep 

1 VADDGVQRQL SGKLRQFGFR LPFD PFVFEA LDCLLVIAFD L EQCFKQIPA 

51 TRHPFVNRRR LPPYPYNIRQ GFEEGGKTSE QGGLVHVGIP RADPLPHRIA 

101 AFGQHPAQYH AFYRLLPGEQ LIRAAAHFSV QTPADVQIGN HVVQKRQIVL 

151 SQSEMAQHGR GF*KHKHFID FKSAFQQVEQ A'QSMKQRLS AADFHIRNGI 
201 RQCLRAGLRL SEHGFDKRRI GFDIRG* 



m217/a217 90.3% identity in 226 aa overlap 

10 20 30 40 bO 60 

m217 . pep MADDGVRRQLSGKLRQFGFRLPFDPFVFKVLDXLLVIGFSLEQCFKQIPATRHPFADRCG 

: I It ! I : t M I t I I I i 1 i It I t I I II I I : : I I I I I I : I ; I i I I I ! I I I i I 1 i ) I : : I 
a217 VADDGVQRQLSGKLRQFGFRLFFDPFVFEALDCLLVIAFDLEQCFKQT PATRHPFVNRRR 

10 20 30 40 50 60 



70 80 90 100 110 120 

m217 .pep LPPYPYNIRQGFEEGGKTSEHGGLIHVGIPRADILPHRIAAFGQHPAQYHhFYRLLPGEQ 
t I I ! i I ( I I ( I I ! I ! ! I I I I : I ! I : t I I t I I I I I I I I I ! I I i I I t I I ! i I II I I I ! I I ! 
a217 LPPYPYNIRQGFEEGGKTSEQGGLVHVGIPRADPLPHRIAAFGQHPAQYHAFYRLLPGEQ 

70 80 90 100 110 120 



130 140 150 160 170 180 

m217.pep LIRAAAHFSVQTPVDVQIGNHWQKRXIVLSQSETAQHGRGFXKHKHFIDFKSAFQQVEQ 
I I I i i I I I I M I I : I I I I ! I I I I I I I I I I I I II I I I I I I I i I I I I I I I I I I I 1 I I I I I 
a217 LIRAAAHFSVQTPADVQIGNHVVOKROIVLSOSEMAQHGRGFXKHKHFIDFKSAFQQVEQ 

130 140 150 160 170 1&0 



190 200 210 220 

m217 .pep AXQSMKQRLAAADFHVXHGIRQCLRTGLRLSEHGFDKRRIGFDIRGX 
I I I I I ) I 1 I : I I I I I : : I 1 I I I I I : I I I I I I I I I M M ! I I ! I I ! ! 
a217 AXQSMKQRLSAADFHIRNGIRQCLRAGLRLSEHGFDKRRIGFDIRGX 

190 200 210 220 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 769>: 
g218 . seq 

1 atggttgcgg tggatcctta tacggcaaaa gtggtcaaca ccatgccgcg 

51 caatcagggt tggtatcaca ctatggatga aatccacggc gatatgatgc 

101 tcggtgcggc aggcgattat cttttggaaa cggcagctcc actgaccatc 

151 attatggttg tcagcggctc gcacctttgg tgggcgaaac agcgcggcac 

201 taaagcgatg ctgctgccgc caaaaagcag ggcgcgttct tggtggcgga 

251 atctgcacgg cgcgtttgga acttgggtgt cgttgatttt actgttgttc 

301 tgcctgtcgg gtattgcttg ggcaggtatt tggggcggca aattcgtgca 

3 51 ggcttggaat cagttcccgg ccggcaaatg gggtgtcgaa ccgaaccccg 

401 tttcaatcgt gccgacccac ggcgaggtat tgaatgacgg caaggttaag 

451 gaagtgccgt ggattttgga gcttatgcct atgcctgtct cagggacgac 

501 tgtgggtgaa aacggcatca accccaccga gcccaataac attggaaacc 

551 gtrcgaccgtt tcgcgcggga aatcggtttc aaagggcgtt atcagtcgaa 

601 tttgcccaaa ggcgaggacg gggtatggac tttgtcgcag gactctatga 

651 gttatga 

This corresponds to the amino acid sequence <SEQ ID 770; ORF 218.ng>: 

g213 .pep 

1 MVAVDPYTAK WNTMPRNQG WYHTMDEIHG DMMLGAAGDY LLETAA2LTI 
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a21B . seq 




1 


ATGGTCGCGG 


51 


CAATCAGGGT 


101 


TCGGTTCGAC 


151 


ATCATGATAA 


201 


CAAGGCG AT G 


251 


ATCTGCACGG 


301 


TGCCTGTCGG 


351 


GGCTTGGAGT 


401 


TTTCAGTCGT 


451 


GAAGTGCCGT 


501 


TGTGGGCAAA 


551 


TCGACCGTTT 


601 


TTTGCCCAAA 


651 


GTTA 



TCGATCCTTA TACGGCAAAA 
TGGTATTACG CGATGGATGA 
AGGTGATTAT CTTTTGGAAA 
TCAGCGGTTT GTACCTTTGG 
CTGCTGCCGC CAAAAGGCAG 
CGCGTTTGGA ACTTGGGTGT 
GTATTGCTTG GGCAGGTATT 
CAGTTCCCGG CAGGCAAATG 
GCCGACCCAC GGCGAGGTAT 
GGGTTTTGGA GCTTACGCCT 
GACGGTATTA ACCCTGACGA 
TGCGCGG . GA AATCGGTTTC 
GGCGAGGACG GCGTATGGAC 



GTGGTCAGTA CCATGCCGCG 
AATCCACAGC GATATGATGC 
CGGCTGCATC GCTGACGATT 
TGGGTGAAAC GGCGCGGCAT 
GGCGCGTTCT TGGTGGCGGA 
CGTTGATTTT ACTGTTGTTC 
TGGGGCGGCA AGTTCGTGCA 
GGGTGTCGAA CCGAACCCTG 
TGAATGACGG CAAGGTTAAG 
ATGCCTGTTT CAGGGACGAC 
GCCGATGACA TTGGAAACCG 
AAAGGGCGTT ATCAGCTGAA 
TTTGTCGCAG GATTCTATGA 



T his corresponds to the amino acid sequence <SEQ ID 774; ORF 218.a>: 

a218 .pep 

1 MVAVDPYTAK VVSTMPRNQG WYYAMDEIHS DMMLGSTGDY LLETAASLTI 

51 IMIISGL YLW WVKRRGIKAM LLPPKGRARS WWRNLHGAFG TWV SLILLLF 

101 CLSGIAWAGI WGGKFVQAWS QFPAGKWGVE PNPVSWPTH GEVLNDGKVK 

151 EVPWVLELTP MPVSGTTVGK DGINPDEPMT LETVDRFARX NRFQRALSAE 

201 FAQRRGRRMD FVAGFYEL 



m218/a218 95.9% identity in 218 aa overlap 

10 20 30 40 50 60 

m218.pep MVAVDPYTAKVVSTMPRNQGWYYTMDEIHSDMMLGAAGDYLLETAASLTIIMVVSGLYLW 
i I I ! M I i I ! ! I I I I I I ! I I I 1 1 : if I ( i I I I t I I : : I I 1 I f I I I ( I I I ! I t : : ( I I I i I 
a218 MVAVDPYTAKWSTMPRNQGWYYAMDEIHSDMMLGSTGDYLLETAASLTI IMIISGLYLW 

10 20 30 40 50 60 



70 80 90 100 110 120 

m218.pep WVKRRGIKAMLLPSKGXARSWWRNLHGTFGTWVSLILLLFCLSGIAWAGIWGGKFVQAWS 
1 I ! 1 ! I I I M I i I t I 1 I ) i M I j i 1 : I i I I I I I ) 1 ) i II I t I I) I i I I M i 1 1 I t i I ! 
a218 WVKRRGIKAMLLPPKGRARSWWRNLHGAFGTWVSLILLLFCLSGIAWAGIWGGKFVQAWS 

70 80 90 100 110 120 



130 140 150 160 170 180 

m2 1 8 . pep QFPAGKWGVEPNPVS VVPTHGEVLNDGKVKEVPWVLELTPMPVSGTTVGKDGINPDEPMT 

I I I ! I I I I I II I I I I ( I I I I I f I I I I I I II I I I I I ! ! I I II I ! I I i I I I I I I I I I I I I I I 
a218 QFPAGKWGVEPNPVS WPTHGEVLNDGKVKEVPWVLELTPMPVSGTTVGKDGINPDEPMT 

130 140 150 160 170 180 



190 2Q0 210 

m2 1 8 . pep LET VDRFARXNRFQRALSVEFAQRRGRRMDFVAGFYEL 

! II I I 1 I I I i t I I I II I i : I I I I II i I I I II M II i I I 
a 2 1 8 LETVDRFARXNRFQRALSAEFAQRRGRRMDFVAGF YEL 

190 200 210 
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70 80 90 100 110 120 

130 140 150 160 170 180 

LGWWSVLANVLFCLAVIFIGISGCVMW 

l!lllllMhilllllllllllll!lill!hl = :!ll!ll!h!!llll MUM I 

LGWWSVLAmA/'FCIAVIFIGISGCVl^KRRPSGVAGIVPPAQKIKLPVVWAMALPLLLI 
130 140 150 160 170 180 

190 200 210 

m219 -pep ALLFPTSLLAIAVIWLLDTLLLSRIPVLRRWFKX 

MMIMIilMIMIilllllllllihiil 
g219 ALLFPTALLAIAVIWLLDTLLLSRIPVLRKWFK 
190 200 210 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 779>: 

a219 . seq 

1 ATGACGGCAA GGTTAAGGAA GTGCCGTGGG TTTTGGAGCT TACGCCTATG 

51 CCTGTTTCAG GGACGACTGT GGGCAAAGAC GGTATTAACC CTGACGAGCC 

101 GATGACATTG GAAACCGTCG ACCGTTTTGC GCGG . GAAAT CGGTTTCAAA 

151 GGGCGTTATC AGCTGAATTT GCCCAAAGGC GAGGACGGCG TATGGACTTT 

201 GTCGCAGGAT TCTATGAGTT ACGACAT^AT CAGCCCGTTT GCTGACCGCA 

2 51 CGGTGCATAT CGACCAGTAC AGCGGCAAGA TTCTTGCCGA CATCCGTTTT 

301 GACGATTACA ACCCGTTCGG CAAATTTATG GCr~^\AGCA TTGCGCTGCA 

351 TATGGGGACT TTGGGCTGGT GGAGCGTGTT GGC^AAOGTT TTGTTCTGCC 

401 TTGCCGTGAT TTTTATCGGC ATCAGCGGCT GCGTGATGTG GTGGAAACGC 

4 51 CGTCCGTCCG GCGCGGTGGG CATGGTTCCG CCGGCGCAAA AAATCAAGCT 

501 GCCCGTCTGG 7GGGCAATGG CGGTGCCGCT GCTGCTGATT GCATTGCTTT 

551 TCCCGACCGC GTTGCTTGCC ATTGCCGTGA TTTGGCTGTT GGATACGCTG 

601 CTGTTGTCGC GGATTCCTGT TTTGAGGAGA TGGTTTAAAT GA 

This corresponds to the amino acid sequence <SEQ ID 780; ORF 219.a>: 

a219.pep 

1 MTARLRKCRG FW3LRLCLFQ GRLWAKTVLT LTSR*HWKPS TVLRXEIGFK 

51 GRYQLNLPKG EDGVWTLSQD SMSYDMISPF ADRTVHIDQY SGKILADIRF 

101 DDYNPFGKFM AASIALHMGT LGWWSVLAN V LFCLAVIFIG ISGCVM WWKR 

151 RPSGAVGMVP PAQKIKLPVW WAMAVPL LLI ALLFPTALLA IAVI WLLDTL 

201 LLSRIPVLRR WFK* 

m219/a219 94.8% identity in 213 aa overlap 

10 20 30 40 50 60 

m21 9 . pep MTARLRKCRGFWSLRLCLFQGRXWAKTALTLTSRXHWKPSTALRGEIGFKGRYQLNLPKG 
I I II I I I i I I I I I I I I M I I I I 1111:1111111111111:11 I I I I I ( II I I I I I II 
a21 9 MTARLRKCRGFWSLRLCLFQGRLWAKTVLTLTSRXKWKPSTVLRXEIGFKGRYQLNLPKG 

10 20 30 40 50 60 

70 80 90 100 110 120 

m219.pep EDGVWTLSQDSMSYDMISPFADRTVHIDQYSGKILADIRFDDYNPFGKFMAASIALHMGT 
I I I I I I I I I ! I I I I I I I I I If I I I I I I ! II I I I 1 I I I I I I I I ] M I I ! II I I I I I I I M I 
a219 EDGVWTL3QDSMSYDMISPFADRTVHIDQYSGKILADIRFDDYNPFGKFMAASIALHMGT 
70 80 90 100 110 120 

130 140 150 160 170 180 

m219.pep LGWWSVLANVLFCLAVIFIGISGCVMWWKRRPTGAVGIVPPAQKVKLPVWWMMALPLLAI 
I 1 I I I I I I I I I I I I I I I I M I I II I I I I II I i : i I I I : I I I II I : I I I I I I i I : I I I I 
a219 LGWWSVLANVLFCLAVIFIGISGCVMWWKRRPSGAVGMVPPAQKIKLPVWWAMAVPLLLI 

130 140 150 160 170 180 

190 200 210 

m219.pep ALLFPTSLLAIAVIWLLDTLLLSRIPVLRRWFKX 
1 I M I 1 : i I I I i I I I I ! I ! I I ! I I 11 I I I II I I I 
a219 ALLFPTALLAIAVIWLLDTLLLSRIPV1RRWFKX 

190 200 210 



m219 -pep 
g219 
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m221 .pep CPAEVQI^KLVPSVOMWSEOFHFFFKIFDVGIGAAFVFGDDFVAAAWAIXSVAERtr^VK 

MI!>ll!llllllll!lllMMIIillltlt!illlllllllMllll|:!IMII 

g2 2 1 AAAEVQLGKLVPSVQMWSEOFHFFFKI FDVG I GAA FV FGDD FVAAA WADGVAKRNVNVK 

130 140 150 ISO 170 180 



m221.pep GKRFVX 

IIMK 

g221 GKRFVX 



The following partial DNA sequence was identified in N. meningitidis <SEQ ED 785>: 

a221 . seq 

1 ATGGTGGTTT TGATGCTCCG AAGTCTGGTG CGGCAGGCCG TAAATCAAAT 

51 CGACGCTGAC GGATTTGAAC CCCGCTTCGC GCGCCGCATC GATGACTTCT 

101 TTGGTTTCTT CGTAACTTTG GATGCGGTTG ACCGCCGCCT GCACTTTGGG 

151 GTCGAAATCC TGAATGCCGA TGCTCATGCG GTTGAAGCCG AGTCTGCCGA 

201 GCATGAGGAC GGTGTCGCGG CTGACTTTGC GCGGGTCGAT TTCGATGGAG 

251 TATTCGCCGG TGGGGATTAA CTCGAAATGT TTGCGTATCA TGCGGAAGAC 

301 ACGTTCGATT TGGTCGTCGC TCAAAAAGGT CGGCGTGCCG CCGCCGAAGT 

351 GCAGTTGGGC AAGCTGGTGC CGTCCGTTCA GATGTGGAGC GAGCAGTTCC 

4 01 ATTTCTTTTT CAAGAAATTC GATGTAGGCA TCGGCGCGGC TTTTGTCTTT 

4 51 GGTGATGATT TTGTTGCAGC CGCAGTAGTA GCAGATGGTG TTGCAGAACG 

501 GAATGTGAAT GTAAAGGGAA AGCGGTTTGT TTAA 



This corresponds to the amino acid sequence <SEQ ID 786; ORF 221.a>; 

a221 .pep 

1 MVVLMLRSLV RQAVNQIDAD GFEPRFARRI DDFFGFFVTL DAVDRRLHFG 

51 VEILHADAHA VEAESAEHED GVAADFARVD FDGVFAGGD* LEMFAYHAED 

101 TFDLWAQKG RRAAAEVQLG KLVPSVQMWS EQFHFFFKKF DV GIGAAFVF 

151 GDDFVAAAVV ADGVAERNVN VKGKRFV* 



m221/a221 95.5% identity in 177 aa overlap 

10 20 30 40 50 60 

m221 . pep MXVLMXRSLVRQAVNQIDADGFEFRFARRIDDFFGFFVTLDAVDRRLHFGVEILNADAHA 
S Ml I i I I II I I I I I I i II ! II 1 I I! II t I (I i i I i I I I I I I I II II I I II I ! I t I I I 
a221 MWLMLRSLVRQAVNQIDADGFEPRFARRIDDFFGFFVTLDAVDRRLHFGVEILNADAHA 

10 20 30 40 50 60 



70 80 90 100 110 119 

m221 . pep VEAESAEHEDGVAADFARVDFDGVFAGGDXLEMFAYHAEDTFDLFVAQKGA-CPAEVQLG 
I t M 1 I II I I I I li I t I I 1 I I 1 I I I I 1 I I I I I I i i I I 1 1 I II I 1 lilt) I I II 1 i 

a 2 2 1 VEAE SAEHEDGVAADFARVDFDGVFAGGDXLEM FAYHAE DT FDL WAQKGRRAAAEVQLG 

70 80 90 100 110 120 



120 130 140 150 160 170 

m221 . pep KLVPSVQMWSEQFHFFFKIFDVGIGAAFVFGDDFVAAAVVADGVAERNVNVKGKRFVX 
I I I I I i I I I i I I I i I I I ' I ! I I I II I I I If f I I I I I I I I i I I I I I I I I I M ! I I I I I 
a221 KLVPSVQMWSEQFHFFFKKFDVGIGAAFVFGDDFVAAAVVADGVAERNVNVKGKRrVX 
130 140 150 160 170 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 787>: 

g223 . seq 

1 atggaattca ggcaccaggt agtggtagtt ggtgtcgaac catttggtca 

51 tttcgatggc gaattggtct ttgttgccgc gcgccagttg gaagaattgt 

101 tccaaaggca ggttttggct atcgaagccg aaacgggcgg gaatcgcgcc 

151 cgtggatact tgcaggtcga ggatgtgatg gtagaaagtg aaatcacgta 

201 cagcaacgta atcagcgtta ggagcagctt ggtgtttcca gtttttcccg 

251 cgcaggtcct tggcaacgtc gagcagctct tgttcactga tctctttgcg 

301 ccagtatttt tcttgggcga atttcaattc acggaaggcg ccgacacgcg 

351 ggaagcctga 

This corresponds to the amino acid sequence <SEQ ID 788; ORF 223. ng>: 

g223 . pep. . 
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The following partial DNA sequence was identified in TV. gonorrhoeae <SEQ ID 793>: 



g225 . seq 






1 


acggattctt 


t t t tcaaacc 


51 


tgccgtccgc 


cccgcccttg 


101 


gcgagcagat 


tctcagacag 


151 


gtcaaccgag 


cccccgcccg 


201 


cggcgcgatg 


gggcttaacg 


251 


ccgcccggcg 


ggcgggcaat 


301 


cttttgggta 


ttgcctaccg 


351 


tgactgcagc 


ggatt^atgc 


401 


acctgccgcg 


cacgtcggcg 


451 


cgaagcgaat 


tgcagcccgg 


501 


cagccgcatt 


tcccatgtcg 


551 


acgcgccgcg 


cacggggaaa 


601 


tattggagcg 


gcaaatatgc 


651 


gtcacgcttt 


ctgaactga 



gaccaacctg ctcagcagcc 



This corresponds to the amino acid sequence <SEQ ID 794; ORF 225.ng>: 

g22 5 .pep 

1 MDSFFKPAW? AVLWLMFAVR PAIADELTNL LSSREQILRQ FAEDEQPVLP 

51 VNRAPARRAG NADELIGGAM GLNEQPWRV NRAXARRAGN ADKLIGSAMR 

101 LLGIAYRYGG TSVSTGFDCS GFMQHIFKRA MGINLPRTSA EQARMGAPVA 

151 RSELQPGDMV FFRTLGGSRI SHVGLYIGNN RFIHAPRTGK NIEITSLSHK 

201 YWSGKYAFAR RVKKNDPSRF LN* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 795>: 



m225 . seq 


(partial } 










1 


, . TTTTCAAACC 


CGG CAGTTTG 


GGCGGTTTTG 


TGGCTGAwGT 


TTGCCGTCCG 


51 


CCCCGCCCTT 


GCCGACGAGT 


TGACCAACCT 


GCTCAGCAGC 


CGCGAGCAGA 


101 


TTCTCAGACA 


GTTTGCCGAA 


GACGAACAGC 


CCGTTTTACC 


CATCAACCGA 


151 


GCCCCCGCCC 


GGCGGGCGGG 


CAATGCCGAC 


GAACTCATCG 


GCAGCGCGAT 


201 


GGGGCTTAAC 


GAACAGCCCG 


TTTTACCCGT 


CAACCGAGTC 


CCCGCCCGGC 


251 


GGGCGGGCAA 


TGCCGACGAA 


CTCATCGGCA 


ACGCGATGGG 


GCTTAACGAA 


301 


CAGCCCGTTT 


TACCCGTCAA 


CCGAGCCCCC 


GCCCGGCGGG 


CGGGCAATGC 


351 


CGACGAACTC 


ATCGGCAACG 


CGATGGGACT 


TTTGGGTATT 


GCCTACCGCT 


401 


ACGGCGGCAC 


ATCGGTTTCT 


ACCGGTTTTG 


ACTGCAGCGG 


CTTCATGCAG 


451 


CACATCTTCA 


AACGCGCCAT 


GGGCATCAAC 


CTGCCGCGCA 


CGTCGGCAGA 


501 


ACAGGCACGG 


ATGGGTACGC 


CGGTTGCCCG 


AAGCGAATTG 


CAGCCCGGAG 


551 


ATATGGTGTT 


TTTCCGCACG 


CTCGGCGGCA 


GCCGCATTTC 


CCATGTCGGA 


601 


CTTTATATCG 


GCAACAACCG 


CTTCATCCAC 


GCGCCGCGCA 


CGGGGAAAAA 


651 


TATCGAAATC 


ACCAGCCTGA 


GCCACAAATA 


TTGGAGCGGC 


AAATACGCGT 


701 


TCGCCCGCCG 


GGTCAAGAAA 


AACGACCCGT 


CCCGCTTTCT 


GAACTGA 



This corresponds to the amino acid sequence <SEQ ID 796; ORF 225>: 

m225.pep (partial) 

1 . . FSHPAVWAVL WLXFAVRPAL A DELTHLLSS REQIbROFAE DEQPVLPINR 

51 APARRAGNAD ELIGSAMGLN EQPVLPVNRV PARRAGNADE LIGNAMGLKE 

101 QPVLPVNRAP ARRAGNADEL IGNAMGLLGI AYRYGGTSVS TGFDCSGFMQ 

151 HIFKRAMGIN LPRTSAEQAR MGTPVARSEL QPGDMVFFRT LGGSRISHVG 

201 LYIGNNRFIH APRTGKNIEI TSLSHKYWSG KYAFARRVKK NDPSRFLN* 

Computer analysis of this amino acid sequence gave the following results; 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 225 shows 83-5% identity over a 248 aa overlap with a predicted ORF (ORF 225. ng) 
from N. gonorrhoeae: 

m225/g225 

10 20 30 40 50 

m225 ,pep FSNPAWAVLWLXFAWPALADELTNLLSSREQILRQFAEDEQPVLPINRAPARRAG 

I : 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 ! I II 1 1 1 1 1 1 1 1 1 1 1 !• 1 1 1 1 1 1 1 1 1 

g22 5 MDSFFKPAVWAVLWLMFAVRPALADELTNLLSSREQI LROFAEDEQPVLPVNRAPARRAG 
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l I I I ! I ! I I I I I ( I I ! I I I I t ! I I 1 I 1 I I ! [ 

a2 2 5 NADELIGSAt^GLNEQPVLPVNRXPARRAGNADXLIGNAMGLNEQPVLPVNRVPARRAGNA 

70 80 90 100 110 120 

90 100 110 120 130 140 

m22 5. pep DELIGNAMGLNEQPVLPVNRAPARRAGNADELIGNAMGLLGIAYRYGGTSVSTGFDCSGF 

I If I I I I f I I I ! I I f I II I I II II ! I I I I I I ! M i I I i I i I I I I I I I I ! I : I I I I I f II I 
a22 5 DELIGNAMGLNEQPVLPVNRAPARRAGNADELIGNAMGLLGIAYRYGGTSISTGFDCSGF 
130 140 150 160 170 180 

150 160 170 180 190 200 

m225.pep MQHIFKRAMGINLPRTSAEQARMGTPVARSELQPGDMVFFRTLGGSRISHVGLYIGNNRF 
I I I I I I I I f I I I I II I I I I I I M I i I I I I II I I I II ! I I I I I I I I I II I I I I I i I I I I ( 
a225 MQKIFKRAMGINLPRTSAEQARMGTPVARSELQPGDMVXFRTLGGSRISHVGLYIGNNRF 
190 200 210 220 230 240 

210 220 230 240 249 

m22 5 . pep I HAPRTGKN I E IT S LSHKYWSGKYAFARRVKKNDP SRFLNX 

I I I I I I I I I 1 I I I I II I ! I 1 ! I I II I M I I 1 1 1 1 M 1 I I I i 
a225 IHAPRTGKNIEITSLSHKYWSGKYAFARRVKKNDPSRFLNX 
250 260 270 280 



The following partial DNA sequence was identified in N, gonorrhoeae <SEQ ID 799>: 

g225-l.seq 

1 atggattctt ttttcaaacc ggcagtttgg gcggttttgt ggctgatgtt 

51 tgccgtccgc cccgcccttg ccgacgagtt gaccaacctg ctcagcagcc 

101 gcgagcagat tctcagacag tttgccgaag acgaacagcc cgttttaccc 

151 gtcaaccgag cccccgcccg gcgggcgggc aatgccgacg aactcatcgg 

201 cggcgcgatg gggcttaacg aacagcccgt tgtacgcgtc aaccgagccn 

2 51 ccgcccggcg ggcgggcaat gccgacaaac tcatcggcag cgcgatgcgg 

301 cctttgggta ttgcctaccg ctacggcggc acatcggtgt ctaccggttc 

351 tgactgcagc ggattcatgc agcacatctt caaacgcgcc atgggcatca 

4 01 acctgccgcg cacgtcggcg gaacaggcgc ggatgggcgc acccgttgcc 

4 51 cgaagcgaat tgcagcccgg ggatatggtg tttttccgca cgctcggcgg 

501 cagccgcatt tcccatgtcg gactttatat cggcaacaac cgcttcatcc 

551 acgcgccgcg cacggggaaa aatatcgaaa tcaccagccc gagccacaaa 

601 tatcggagcg gcaaatatgc gttcgcccgc cgggtcaaga aaaacgaccc 

651 gtcacgcttt ctgaactga 

This corresponds to the amino acid sequence <SEQ ID 800; ORF 225-l.ng>: 

g225-l.pep 

1 MDSFFKPAVW AVLWLMFAVR PALA PELTNL LSSREQILRQ FAEDEQPVLP 

51 VNRAPARRAG NADELIGGAM GLNEQPWRV NRAXARRAGN ADKLIGSAMR 

101 LLGIAYRYGG TSVSTGFDCS GFMQHIFKRA MGINLPRTSA EQARMGAPVA 

151 RSELQPGDMV FFRTLGGSRI SHVGLYIGNN RFIHAPRTGK NIEITSLSHK 

2 01 YWSGKYAFAR RVKKNDPSRF LN* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 801>: 

m225-l . seq 

1 ATGGATTCTT TTTTCAAACC GGCAGTTTGG GCGGTTTTGT GGCTGATGTT 

51 TGCCGTCCGC CCCGCCCTTG CCGACGAGTT GACCAACyTG CTCAGCAGCC 

101 GCGAGCAGAT TCTCAGACAG TTTGCCGAAG ACGAACAGCC CGTTTTACCC 

151 ATCAACCGAG CCCCCGCCCG GCGGGCGGGC AATGCCGACG AACTCATCGG 

2 01 CAGCGCGATG GGGCTTAACG AACAGCCCGT TTTACCCGTC AACCGAGTCC 

2 51 CCGCCCGGCG GGCGGGCAAT GCCGACGAAC TCATCGGCAA CGCGATGGGG 

3 01 CTTAACGAAC AGCCCGTTTT ACCCGTCAAC CGAGCCCCCG CCCGGCGGGC 

3 51 GGGCAATGCC GACGAACTCA TCGGCAACGC GATGGGACTT TTGGGTATTG 
401 CCTACCGCTA CGGCGGCACA TCGGTTTCTA CCGGTTTTGA CTGCAGCGGC 

4 51 TTCATGCAGC ACATCTTCAA ACGCGCCATG GGCATCAACC TGCCGCGCAC 
501 GTCGGCAGAA CAGGCACGGA TGGGTACGCC GGTTGCCCGA AGCGAATTGC 
551 AGCCCGGAGA TATGGTGTTT TTCCGCACGC TCGGCGGCAG CCGCATTTCC 
601 CATGTCGGAC TTTATATCGG CAACAACCGC TTCATCCACG CGCCGCGCAC 
6 51 GGGGAAAAAT ATCGAAATCA CCAGCCTGAG CCACAAATAT TGGAGCGGCA 
701 AATACGCGTT CGCCCGCCGG GTCAAGAAAA ACGACCCGTC CCGCTTTCTG 
751 AACTGA 
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a225-l/m225-l 88.6% identity in 280 aa overlap 



10 20 30 40 50 60 

a225-l .pep [^SFFKPAVWAVXWLMFAVRPMJU^ELTNLLSSREQILRQFAEDEQPVLPINRXPARRAG 

1 i 1 1 i i I i 1 1 S i ! 1 1 i 1 1 ! I ! 1 1 1 1 i I S ! I i 1 1 1 1 ) i 1 1 1 i S [ I ( i I i 1 1 ! 1 1 Mill 

m225- 1 ^SFFKPAVWAVLWLMFAVRPAIADELTNLLSSREQILRQFAEDEQPVXPINRAPARRAG 
10 20 30 40 50 60 



70 80 90 100 110 120 

a225-l .pep NATELIGSAMGLNEQPVLPVNRXPARRAGNADXLIGNAMGLNEOPVLPVNRVPARRAGNA 

iiiiimMNim i jiiiiimiiM! 

m225-l NADELIGSAMGLNEQP VLPVNR V PARRAGNA 

70 80 90 



130 140 150 160 170 180 

a225-l .pep DEL I GNAMGLNEQ P VL P VNRAPARRAGNADEL I GNAMG LLG I A YRYGGT S I STG FDCSG F 

l!!IU!iMI{[liiil!l((Mlili|{!(lli!!Ml!tllli!i!i:!i!liilll 

tn22 5 - 1 DEL I GNAMGLNEQ PVLP VNRAPARRAGNADE L I GNAMG LLG I A YRYGGT SV STG FDC SG F 

100 110 120 130 140 150 



190 200 210 220 230 240 

a22S-l .pep MQHI FKRAMG INLPRTS AEQARMGTPVARSELQPGDMVXFRTLGGSRI SHVGLY IGNNRF 

i'l liiiilMilllillilMI liliiMIMir ijlnlilHliilMtlitt 

m22 5 - 1 MQHIFKRAMGINLPRTSAEQARMGTPVARSELQPGDbfVFFRTLGGSRISHVGLYIGNNRF 
160 170 180 190 200 210 



250 260 270 280 

a22 5 - 1 . pep IHAPRTGKNI EITSLSHKYWSGKYAFARRVKKNDPSRFLNX 

I ! i I M I i i 1 1 1 1 1 1 1 II 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 

m225-l rHAPRTGKNIEITSLSHKYWSGKYAFARRVKKNDPSRFLNX 
220 230 240 250 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 805>: 

g226 . seq 

1 ATGAGCGAAA TCCTCAGGCA GCCCAGCGTT CTGCTTTTCC TCACGCTTGC 

51 CGTGTACGCG CTTGCGATTA TCGTGCGCAC GCGCACGGGC AATATCTTCT 

101 GCAACCCCGT ACTCGTCAGC ACTATCGTGC TGATTGCCTA CCTGAAAATC 

151 CTCGGTATCG ATTATGCGGT GTACCACAAC GCCGCGCAAT TCATTGATTT 

201 TCGGCTGAAA CCCGccgtCG TCGTGCTTGC CGTGCCGCTC TACCAAAACC 

251 GCCGTAAAAT CTTCAACCAG TGGCTGCCCG TCATCGTTTC GCAGCTTGCG 

3Q1 GGCAGCGTTA cggGCATTGT tacggggATG TATTTTgccg ctcggctcgg 

351 gccggatacc caattctcct tcccgcctcg tctxcaatat ctgtcatcta 

401 caccctctgg aatcccaatt cacaccctgt atgcgcgggt tcccccgcca 

451 tttctgttgc ctccgcctct cctgccgcgc ctcggcccgc atacattgcg 

501 ccggttcaca atacttccaa aaaaactacg gccgtttaag cccctcctcc 

551 cagttgtggt cctttctcct Ccgggcctcg cccctcccct cttataa 

This corresponds to the amino acid sequence <SEQ ID 806; ORF 226.ng>; 

g226 .pep 

1 MSEILRQPSV LLFLTLAVYA LA IIVRTRTG NIFCNP VLVS TIVLIAYLKI 

51 LGIDYAVYHN AAQFIDFRLK PAVWLAVPL YQNRRKI FNQ WLPVIVSQLA 

101 GSVTGIVTGM YFAAWLGPDT QFSFPPRLQY LLFTPSGIPI HTLYARVLPP 

151 FLLPPPLLPR LGPHTLRRFT ILPKKLRPFK P LLPVWLS P PGLAFPLL* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 807>: 

m226 . seq 

1 ATGAACGAAA TCCTCAGGCA GCCCAGCGTT CTGCTTTTCC TCACGCTTGC 

51 CGTGTACGCG CTTGCGATTA TCGtGCGCAC GCGCACGGGC AATATCTTCT 

101 GCAACCCCGT ACTCGTCAGC ACTATCGTGC TGATTGCCTA CCTGAAAATC 

151 CTCGGTATCG ATTATGCGGT GTACCACAAC GCCGCGCAAT TCATTGATTT 

201 TTGGCTGAAA CCCGCCGTCG TCGTGCTTGC CGTGCCGCTC TACCAAAACC 

251 GCCGTAAAAT CTTCAACCAG TGGCTGCCCG TCATCGTTTC ACAGCTTGCG 

3 01 GGCAGCGTTA CGGGCATTGT TACAGGGATG TATTTTGCCA AATGGCTGGG 

3 51 CGCGGAACGC GAAGTCGTCC TCTCGCTCGC GTCCAAATCT GTTACCAACC 

4 01 CCATCGCTAT TGAAATCACC CGCTCCATCG GCGGCATTCC CGCCATTACC 
4 51 GCCGCCACCG TCATCATTGC CGGTCTGGTC GGACAGATTG CCGGTTACAA 
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m2 2 6 . pep MNEI LRQPSVLLFLTLAVYALAI I VRTRTGN I FCNPVLVST I VLIAYLKI LGI DYAVYHN 

I I! I I M It : I I ! I I I II II I I I I I I I I M I I I II I I I I I 11 I I I I II f I II I II I I I I I 
a22 6 MNEILRQPSILLFLTLAVYALAIIVRTRTGNI FCNPVLVSTIVLIAYLKILGI DYAVYHN 

10 20 30 40 50 60 

70 80 90 100 110 120 

m22 6.pep AAQFIDFWLKPAWVLAVPLYQNRRKIFNQWLPVIVSQLAGSVTGIVTGMYFAKWLGAER 
I I I I I I I I I I I I I I I I I II I I I i I I I I I I I I I I I M II I ! I I I II I I I I 1 I t I! I I I I I I 
a22 6 AAQFIDFWLKPAWVLAVPLYQNRRKIFNQWLPVIVSOLAGSVTGIVTGMYFAKWLGAER 

70 80 90 100 110 120 

130 140 150 160 170 180 

m226.pep E WLSLASKSVTNPIAIEITRS I GGIPAITAATV I IAGLVGQI AG YKNLKNTVVM PS SVG 

I I I I I I I I I I i I I i I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I M I I I I I I I ! I I I 
a22 6 EVVLSLASKSVTNPIAIEITRSIGGIPAITAATVI IAGLVGQIAGYKMLKNTVVMPSSVG 

130 140 150 160 170 180 

190 200 210 220 230 

m22 6 . pep MSLGTASHAMGIAASLER5RRMAAYAGLGLTFNGVLTALIAPLLIPVLGFX 
I 1 I I I I I I I II I II I I II I II I I I I I M I I I I I I I I I I I I I I i I I I II I I I 
a22 6 MSLGTASHAMGIAASLERSRRMAAYAGLGLTFNGVLTALIAPLLIPVLGFX 

190 200 210 220 230 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 81 1>: 

g227 . seq 

1 atgaacatca tccgcgcgct cctcatcatc ctcggctgcc tcgccgccgg 

51 cgaaaccgcc gttttcctag caggcatcaa actgcccggc agcatcgtcg 

101 gcatgggcgt gctgtttgcg cttttgcagg cgggttggct caaaacgtct 

151 tggctgcaac agcttaccga cgcgctgatg gcaaacctga cgctgttcct 

201 cgtgccgccc tgcgtggcgg tcatcagcta tttggatttg attgccgacg 

251 attggttttc gatactggtt tccgcctccg ccagcacttt gtgcgtactg 

301 ctggttacgg gcaaggt^ca c~gcr~7ata cggagcatta tctga 

This corresponds to the amino acid sequence <SEQ ID 812; ORF 227.ng>: 

g227 .pep 

1 MNIIRALLII LGCLAAGETA VFLAGIKLPG SIVGMGVLFA LLQAGW LKT S 

51 WLQQLTDA LM ANLTLFLVPP CVAVI SYLDL IADDW FSILV SASASTLCVL 

101 LVTGKVHRWI RSII* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 813>: 

m227.seq (partial) 

1 . . ACGTCTTkGC TGCAACAGCT TACCGACGCG CTGATGTCGA ACCTGACGCT 

51 GTtCCTCGTG CCgCC.TGCG TGGCGGTCAT CAGCTATTTG GATTTGATTG 

101 CCGACGATTG GTTTTCGATA CTGGTTTCCG CCTCCGCCAG cACTTTGTGC 

151 GTACTGCTGG TTACGGGCAA AGTCCACCGG TGGATACGGG GTATTATCCG 

201 ATGA 

This corresponds to the amino acid sequence <SEQ ID 814; ORF 227>: 

m227.pep (partial) 

1 . . T5XLQQLTD A LMSNLTLFLV PPCVAVI SYL DLIADDW FSI LVSASASTLC 
51 VLLV TGKVHR WIRGIIR* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from TV. gonorrhoeae 

ORF 227 shows 95.5% identity over a 66 aa overlap with a predicted ORF (ORF 227.ng) 
from N. gonorrhoeae: 

m227/g227 

10 20 30 

m227 .pep TSXLQQLTDALMSNLTLFLVPPCVAVISYL 

M I I I I I I I I I : I I I I I I I I I I I I I I II I 
g227 TAVFLAGIKLPGSIVGMGVLFALLQAGWLKTSWLQQLTDALMANLTLFLVPPCVAVISYL 
20 30 40 50 60 70 

40 50 60 
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a228 . pep 

I MKKLLIAAMM AAALAA CSQE AKQEVKEAVQ AVESDVKDTA ASAAESAASA 

51 VEEAKDQVKD AAADAKASAE EAVTEAKEAV TEAAKDTLNK AADATQEAAD 

101 KMKDAAK* 

m228/a228 1 00.0% identity in 1 07 aa overlap 

10 20 30 O 50 60 

m228 . pep MKKLLIAAMMAAALAACSQEAKQEVKEAVQAVESDVKDTAASAAESAASAVEEAKDQVKD 
I I I I I I I II I I I I I I! I I f I I I I I I I I I t I I I I I I I I I I I I I I I M ! i I I 1 i I I I I I i M 
a22 8 MKKLLIAAMMAAALAACSQEAKQEVKEAVOAVESDVKDTAASAAESAASAVEEAKDQVKD 

10 20 30 40 50 60 

70 80 90 100 

m2 2 8 . pep AAADAKASAEEAVTEAKEAVTEAAKDTLNKAALATQEAADKMKDAAKX 
I I I M I I I I II I I II I I II I ! 1 I I II I I I I I I I I II II I M I II I I II 
a228 AAADAKASAEEAVTEAKEAVTEAAKDTLNKAADATQEAADKMKDAAKX 

70 80 90 100 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ED 82 1>: 

g229 . seq 

1 atggctgccg tatcgggcgg cggtgcggtc ttcctgataa tgcttccaca 

51 tattgcccgc gttcagcgtc agccgccagc gttcgcccaa gcgtcgggag 

101 aaaccggcat tgaagccgcc ggcgaaattg tatcggctgc cgcccaagag 

151 gttttgcccg acaaacggca cggtgccgaa cgagcgcgtt accgaacggt 

201 tttgatggcc gaacgacagg cgcaggttct gttcgctgaa atctttgtta 

251 ccccaataat gcacgccgcg gctgatgccg ccgtagagga aatgatgccc 

301 gcccgcattg atttcgcgcg acacgcccaa gccgtagcgc aaaccgtgtg 

351 ccttttgcgg caggctgtcg gcggttttcg tccagcttct gcccgcaaat 

401 tcaatcgttt tttcggacga agcgttgttt atagcggatc aacaaaaatc 

4 51 aggacaaggc ggcgggccgc aggcagtacg gatggtacgg aaccggttcg 

501 cccggtgctt ggacgcctta gggaaccgtt ccctttgagc cggggcgggg 

551 caacccgtac cggtttttgt tcatccgcca tattgtgttg a 

This corresponds to the amino acid sequence <SEQ ID 822; ORF 229.ng>: 

g22 9 .pep 

1 MAAVSGGGAV FLIMLPHIAR VQRQPPAFAQ ASGEIGIEAA GEIVSAAA QE 

51 VLPDKRHGAE RARYRTVLMA ERQ AQVLFAE IFVIPIMHAA ADAAVEEMMP 

101 ARIDFARHAQ AVAQTVCLLR QAVGGFRPAS ARKFNRFFGR SWYSGLTKI 

151 RTRRRAAGST DGTEPVRPVL GRLREPFPLS RGGATRTGFC SSAILC* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 823>: 

m229.seq (partial) 

1 . . GCTCAAGCGT TGGGAGAAAT CGGCATTGAA GCCGCCGACG AAATTGTATC 

51 GGCTGCCGCC TAAGAGGTTT TGCTCGACAA ACGGCACGAT GCCGAACGAG 

101 CGCGTTACCG AACGGTTTTT ATAGCCGAAC GACAGGCGCA GGCTCTGTTC 

151 GCTGAAATCT TTGTTATCCC AATAATGCAC GCCGCCGCCG CTGATGCCGC 

2 01 CGTAGAGGAA ATGATGCCTG CCCGCATTGA TTTCGCGCGA CACGCCTAAG 
251 CCCTAGCGCA AACCGTGTGC CTTTTGCGGC AGGCTGTCGG CGGTTTTCGT 

3 01 CCAGCTTCTG CCCGCAAATT CAATCGTTTT TTCGGACGAA GCGTTGTTTA 

3 51 TAGCGGATTA AC AAAAAT C A GGACAAGGCA ACGAAGCCGC AGACAGTACA 

4 01 AATAGTACGG AACCGATTCA CTTGGTGCTT CAGCACcTTA GAGAATCGTT 
4 51 CTCTTTTTTG TTCATCCGCT ATATTGTGTT GA 

This corresponds to the amino acid sequence <SEQ ID 824; ORF 229>: 

m229.pep (partial) 

1 - . AQALGEIGIE AADEIVSAAA XEVLLDKRHD AERARYRTVF IAERQ AQALF 

51 AEIFVIPIMH AAA ADAAVEE MMPARIDFAR HAXALAQTVC LLRQAVGGFR 

101 PASARKFNRF FGRSWYSGL TKIRTRQRSA DSTNSTEPIH LVLQHLRESR 

151 SLFCSSAILC * 

Computer analysis of this amino acid sequence gave the following results: 
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100 110 120 130 140 149 

m22 9. pep RQAVGGFR PASARKFNRFFGRSWYSGLTKIRTRQRSADSTNSTEPIHLVLQHLRES 

I I i i I I I I I I I I I I I I t I I i I I I I I I I I I I I I i : i II I I I : i I I f I I i | I | | | [ j | 
a22 9 RQAVGGFRPASACKFNRFFGRSWYSGLTKTRTRRRSADSTDSTEPIHLVLQHLRESSLX 
130 140 150 160 170 180 

150 160 

m229. pep RSLFCSSAILCX 

i : I I M : I 

a2 2 9 AKARQRRTGFCSSTI 
190 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 827>: 

g230 . seq 

1 atgttccatt ccatcgaaaa atacagaaca cccgcccaag tcttattagg 

51 cctgattgca tcaacttttg ccggcttcgg cgtcagcacg gtttcccatc 

101 cgggcgccga ctacatcgtc caagtgggcg acgaaaaaat cagcgagcac 

151 tcaaccaaca acgccatgca gaacgagcag gcggacggcg gcagcccttg 

201 gcgcgacgcg gtgttccaat ccctgctgca acgcgcctac ctgaaacagg 

251 gcgcgaagct gatgggcatt tcggtttctt ccgaacaaat caagcagatg 

301 attgtggacg atcccaattt ccacgacgca aacggcaaat tcagtcacgc 

351 gcttttgagt caatacctgt cgcaacgcca tatgtctgaa gaccagtttg 

401 tcgaagaaat ccgcgatcag tttgccttgc agaatttggt aagcctcgtc 

451 caaaacggcg tattggtcgg cgacgcgcag gcggaacagc ugatcaggct 

501 gacgcaggtc aaccgcacca tccgttcgca cactttcaac cccgacgagt 

551 ccaccgccca agtcaaagcg tctgaagccg atttgcagaa attttataat 

601 gcgaacaaaa aagactatct gctgccgcag gcggtcaaat tggaatatgt 

651 cgccttgaat ctgaaggatt ttgcagacaa gcagaccgtc agtgaaacgg 

701 aagtgaaaaa tgcgtttgaa gagcgcgtgg cgcgtttgcc ggcacatgaa 

751 gccaaacctc ctttcgagca ggaaaaagcc gccgtcgaaa acgaattgaa 

8 01 aatgaaaaag gcggttgccg acttcaacaa ggcaaaagaa aagctgggcg 

851 acgatgcgtt caatcatccc tcctcgcttg ccgaagccgc caaaaacagc 

901 ggtttgaaag tggaaaccca agaaacttgg ccgagcaggc aggacgcaca 

951 aatgtccggc atgcccgaaa acctaatcaa tgccgtattc agcgacgacg 

1001 tattgaagaa aaaacacaat tccgaagtgc tgaccatcaa cagcgaaacc 

1051 gcgtgggtcg tccgcgccaa agaagtccgc gaagaaaaaa acctactgtt 

1101 tgaagaagcc aaagatgcgg tgcgtcaggc ctatatccgt accgaagccg 

1151 ccaaactttt gaaaacaatg taa 

This corresponds to the amino acid sequence <SEQ ID 828; ORF 230.ng>: 

g230 . pep 

1 MFHSIEKYRT PAQVLLGLIA LTFVGFGVST VS HPGADYIV QVGDEKISEH 

51 SINNAMQNEQ ADGGSPWRDA VFQSLLQRAY LKQGAKLMGI SVSSEQIKQM 

101 IVDDPNFHDA NGKFS HALLS QYLSQRHMSE DQFVEEIRDQ FALQNLVSLV 

151 QNGVLVGDAQ AEQLIRLTQV NRTIRSHTFN PDEFIAQVKA SEADLQKFYN 

201 ANKKDYLLPQ AVKLEYVALN LKDFADKQTV SETEVKNAFE ERVARLPAHE 

2 51 AKPSFEQEKA AVENELKMKK AVADFNKAKE KLGDDAFNHP SSLAEAAKNS 
301 GLKVETQETW LSRQDAQMSG MPENLINAVF SDDVLKKKHN SEVLT INSET 

3 51 AWWRAKEVR EEKNLLFEEA KDAVRQAYIR TEAAKLLKTM * 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 829>: 

m230.seq (partial) 

1 ATGTTCCATT CCATCGAAAA ATACAGAACG CCCGCCCAAG TCCTTTTGGG 

51 CCTGATTGCA TTAACCTTCG TCGGCTTCGG GGTCAGCACG GTATCCCATC 

101 CGGGTGCCGA CTACATCGTC CAAGTGGGCG ACGAAAAAaT CAGCGACCAC 

151 TCCATCAACA ACGCCATACA GAACGAACAG GCGGACGGCG GCGGCCCTTC 

201 GCc.GACGCG GTGTTCCAAT CCCTGCTGCA ACGCGCCTAC CTGAAACAGG 

2 51 GCGCGAAGCT GATGGGCATT TCGGTTTCTT CCGAACAAAT CAAGCAAATT 
301 ATCGTGGACG ATCCCAATTT CCACGACGCA AACGGCAAAT TCGACCACGC 

3 51 GCTTTTAAAC CGCTACCTTT CCCAACGCCA TATGTCTGAA GACCAGTTTG 

4 01 TCGAAGAAAT CCGCGATCAG TTTGCCTTGC AGAATTTGGT AAACCTCGTC 
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310 320 330 340 350 360 

m2 3 0 .pep GLKVETQETWLSRQDAQMSGMPENLINAVFSDDVLKKKHNSEVLTINSETAWWRAKEVR 

IIMIIIIIIIMIMIMIIIIIIIIIIIIIIIIMIIIIIIIIIMIIIIIIIMMI 

g23 0 GLKVETQETWLSRODAQMSGMPENLINAVFSDDVLKKKHNSEVLTINSETAWWRAKEVR 
310 320 330 340 350 360 

370 380 
m2 3 0 . pep EEKTLPFAEAKDAVRQAYIRTEAAKL 

Mhl I IIIMIIIIIIIIIIIII 

g230 EEKNLLFEEAKDAVRQAYIRTEAAKLLKTM 
370 380 390 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 83 1>: 

a230.seq (partial) 

ATGTTCCATT CCATCGAAAA 
CCTGATTGCA TTAACCTTCG 
CGGGTGCCGA CTACATCGTC 
TCCATCAACA ACGCCATACA 
GCGCGACGCG GTGTTCCAAT 
GCGCGAAGCT GATGGGCATT 
ATCGTGGACG ATCCCAATTT 
GCTTTTAAAC CGCTACCT1 



1 
51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 
601 
651 
701 
751 
801 
851 
901 
951 
1001 
1051 
1101 
1151 



TCGAAGAAAT CCGCGATCAG 
CAAAACGGCG TATTGGTCGG 
GACGCAGGTC AACCGCACCA 
TCATCGCCCA' AGTCAAAGTG 
GCAAACAAAA AAGACTACCT 
CGCCTTGAAT CTGAAAGACT 
AAGTGAAAAA TGCGTTTGAA 
GCCAAACCTT CTTTCGAGCA 
AATGAAAAAG GCGGTTGCCG 
ATGACGCGTT CAACCATCCT 
GGTTTGAAAG TCGAAACCCA 
AATGTCCGGT ATGCCCGAAA 
TATTGAAGAA AAAACACAAT 
GCGTGGGTCG TCCGCGCCAA 
TGCCGAAGCC AAAGACGCGG 
CCAAACTT 



ATACAGAACG 
TCGGCTTCGG 
CAAGTGGGCG 
GAACGAACAG 
CCCTGCTACA 
TCGGTTTCTT 
CCAr^ACGCA 
CCCAACGTCA 
TTTGCCTTGC 
CGACGCGCAG 
TCCGTTCGCA 
TCTGAAGCCG 
GCTTCCCAAA 
TTGCAGACAA 
GAGCGCGTGG 
GGAAAAAGCC 
ACTTCAATAA 
TCCTCGCTTG 
AGAAACTTGG 
ACCTGATCAA 
TCCGAAGTGC 
AGAAGTCCGC 
TACGTCAGGC 



CCCGCCCAAG 
GGTCAGCACG 
ACGAAAAAAT 
GCGGACGGCG 
ACGCGCCTAC 
CCGAACAAAT 
AACGGCAAAT 
TATGTCTGAA 
' "AATTTGGT 
JC^GAACAGC 
CACTTTCAAC 
ATTTGCAGAA 
GCGGTCAAAT 
ACAGACCGTC 
CGCGTTTGCC 
GCCGTCGAAA 
GGCAAAAGAA 
CCGAAGCCGC 
CTGAGCAGGC 
TGCCGTATTC 
TGACCATCAA 
GAAGAGAAAA 
TTATATCCGT 



TCCTTTTGGG 
GTATCCCATC 
CAGCGACCAC 
GCGGCCCTTC 
CTGAAACAGG 
CAAGCAGATT 
TCGACCACGC 
GACCAGTTTG 
AAACCTCGTC 
TGATCAGGCT 
CCCGACGAAT 
GTTTTATAAC 
TGGAATATGT 
AGCGAAACAG 
GGCAAATGAA 
ACGAATTGAA 
AAGCTGGGCG 
CAAAAACAGC 
AGGATGCGCA 
AGCGACGACG 
CAGCGAAACC 
CCCTGCCGTT 
ACCGAAGCCG 



This corresponds to the amino acid sequence <SEQ ID 832; ORF 230. a>: 



a230.pep (partial) 

1 MFHSIEKYRT PAQVLLGLIA LT FVGFGVST VS HPGADYIV 

51 SINNAIQNEQ ADGGGPSRDA VFQSLLQRAY LKQGAKLMGI 

101 IVDDPNFHDA NGKFDHALLN RYLSQRHMSE DQFVEEIRDQ 

151 QNGVLVGDAQ AEQLIRLTQV NRTIRSHTFN PDEFIAQVKV 

201 ANKKDYLLPK AVKLEYVALN LKDFADKQTV SETEVKNAFE 

2 51 AKPSFEQEKA AVENELKMKK AVADFNKAKE KLGDDAFNHP 

301 GLKVETQETW LSRQDAQMSG MPENLINAVF SDDVLKKKHK 

351 AWVVRAKEVR EEKTLPFAEA KDAVRQAYIR TEAAKL 



QVGDEKISDH 
SVSSEQIKQI 
FALQN LVNLV 
SEADLQKFYN 
ERVARLPANE 
SSLAEAAKNS 
SEVLTINSET 



m230/a230 99.2% identity in 386 aa overlap 

10 20 



30 



40 



50 



60 



m230.pep MFHSIEKYRTPAQVLLGLIALTFVGFGVSTVSHPGADYIVQVGDEK I SDH SINNAIQNEQ 

! I ! i t I I I I M I I I ! I I I I I M I M I 1 I I I ! I I I I t I M I 11 I I I ! I I I I I I I I I I I M ! 
a230 MFHSIEKYRTPAQVLLGLIALTFVGFGVSTVSHPGADYIVQVGDEKISDHSINNAIQNEQ 
10 20 30 40 50 60 

70 80 90 100 110 120 

m2 30.pep ADGGG PS PDAVFQSLLQRAYLKQGAKLMGI SVSSEQIKQI I VDDPNFHDANGKFDHALLN 

I I I I I I I I i I I ! I I I M j I I I I I I I I I I I I I I M i I I I ! I i I I I I I I I I I t I i I I I I I 1 
a230 ADGGGPSRDAVFQSLLQRAYLKQGAKLMG I SVSSEQIKQI I VDDPNFHDANGKFDHALLN 

70 30 90 100 110 120 



130 



140 



150 



160 



170 



180 
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151 QNGVLVGDAQ AEQLIRLTQV NRTI RSHTFN PDEFIAQVKA SEADLQKFYN 

20 i ANKKDYLLPQ AVKLEYVALN LKDFADKQTV SETEVKNAFE ERVARLPAHE 

251 AKPSFEQEKA AVENELKMKK AVADFNKAKE KLGDDAFNKP SSLAEAAKNS 

301 GLKVETQETW LSRQDAQMSG MPENLINAVF SDDVLKKKHN SEVLTINSET 

351 AWVVRAKEVR EEKNLLFEEA KDAVRQAYIR TEAAKLAENK AKEVLTQLNG 

4 01 GKAVDVKWSE VSVLGAQQAR QSMPPEAYAE LLKAKPANGK PAYVRLTGLP 

4 51 APVI VEAQAV TPPEDIAAQL PPAKQALAQQ QSANTFDLLI RYFNGKI KQT 

501 KGAQSVDNGD GQ* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 835>: 

m230-l.seq 

1 ATGTTCCATT CCATCGAAAA ATACAGAACG CCCGCCCAAG TCCTTTTGGG 

51 CCTGATTGCA TTAACCTTCG TCGGCTTCGG GGTCAGCACG GTATCCCATC 

101 CGGGTGCCGA CTACATCGTC CAAGTGGGCG ACGAAAAAAT CAGCGACCAC 

151 TCCATCAACA ACGCCATACA GAACGAACAG GCGGACGGCG GCGGCCCTTC 

201 GCGCGACGCG GTGTTCCAAT CCCTGCTGCA ACGCGCCTAC CTGAAACAGG 

251 GCGCGAAGCT GATGGGCATT TCGGTTTCTT CCGAACAAAT CAAGCAAATT 

301 ATCGTGGACG ATCCCAATTT CCACGACGCA AACGGCAAAT TCGACCACGC 

351 GCTTTTAAAC CGCTACCTTT CCCAACGCCA TATGTCTGAA GACCAGTTTG 

401 TCGAAGAAAT CCGCGATCAG TTTGCCTTGC AGAATTTGGT AAACCTCGTC 

4 51 CAAAACGGCG TATTGGTCGG CGACGCGCAG GCGGAACAGC TGATCAGGCT 

501 GACACAGGTC AACCGCACCA TCCGTTCGCA CACTTTCAAC CCCGACGAGT 

551 TCATCGCCCA AGTCAAAGTG TCTGAAGCCG ATTTGCAGAA ATTTTATAAT 

601 GCGAACAAAA AAGACTATCT GCTGCCGCAG GCGGTCAAAT TGGAATATGT 

651 CGCCTTGAAT CTGAAGGATT TTGCAGACAA GCAGACCGTC AGTGAAACGG 

701 AAGTGAAAAA TGCATTTGAA GAGCGCGTGG CGCGTTTGCC GGCAAATGAA 

7 51 GCCAAACCTT CTTTCGAGCA GGAAAAAGCC GCCGTCGAAA ACGAATTGAA 

B01 AATGAAAAAG GCGGTTGCCG ACTTCAACAA GGCAAAAGAA AAATTGGGCG 

851 ACGATGCGTT CAACCATCCT TCCTCGCTTG CCGAAGCCGC CAAAAACAGC 

901 GGTTTGAAAG TCGAAACCCA AGAAACTTGG CTGAGTAGGC AGGACGCGCA 

951 AATGTCCGGT ATGCCCGAAA ACCTGATCAA TGCCGTATTC AGCGACGACG 

1001 TATTGAAGAA AAAACACAAT TCCGAAGTGC TGACCATCAA CAGCGAAACC 

1051 GCGTGGGTCG TCCGCGCCAA AGAAGTCCGC GAAGAGAAAA CCCTGCCGTT 

1101 TGCCGAAGCC AAAGACGCGG TACGTCAGGC TTATATCCGT ACCGAAGCCG 

1151 CCAAACTTGC CGAAAACAAG GCAAAAGACG TGCTTACCCA ACTGAACGGC 

1201 GGCAAGGCTG TTGACGTGAA ATGGTCGGAA GTGTCCGTTT TGGGCGCACA 

12 51 GCAGGCAAGG CAGTCCATGC CGCCCGAGGC TTATGCGGAA CTGCTGAAAG 
1301 CAAAACCGGC AAACGGCAAA CCCGCCTACG TCAGGCTGAT CGGTCTGCCG 

13 51 GCACCCGTGA TTGTCGAAGT ACAGGCTGTA ACCCCGCCGG ATGATATCGC 

14 01 CGCACAGCTT CCGCTTGCAA AACAGGCTTT GGCGCAACAG CAGTCTGCCA 
14 51 ATACTTTCGA CTTGTTGATA CGTTATTTCA ACGGCAAAAT CAAACAGACC 
1501 AAAGGAGCGC AATCGGTCGA CAACGGCGAC GGTCAGTAA 

This corresponds to the amino acid sequence <SEQ ID 836; ORF 230- 1>: 

m230-l .pep 

1 MFHSIEKYRT PAQVLLGLI A LTFVGFGVST VS HPGADYIV QVGDEKISDH 

51 SINNAIQNEQ ADGGGPSRDA VFQSLLQRAY LKQGAKLMGI SVSSEQIKQI 

101 IVDDPNFHDA NGKFDHALLN RYLSQRHMSE DQFVEEIRDQ FALQNLVNLV 

151 QNGVLVGDAQ AEQLIRLTQV NRTI RSHTFN PDEFIAQVKV SEADLQKFYN 

201 ANKKDYLLPQ AVKLEYVALN LKDFADKQTV SETEVKNAFE ERVARLPANE 

251 AKPSFEQEKA AVENELKMKK AVADFNKAKE KLGDDAFNHP SSLAEAAKNS 

301 GLKVETQETW LSRQDAQMSG MPENLINAVF SDDVLKKKHN SEVLTINSET 

351 AWVVRAKEVR EEKTLPFAEA KDAVRQAYIR TEAAKLAENK AKDVLTQLNG 

401 GKAVDVKWSE VSVLGAQQAR QSMPPEAYAE LLKAKPANGK PAYVRLIGLP 

451 APVIVEVQAV TPPDDIAAQL PLAKQALAQQ QSANTFDLLI RYFNGKIKQT 

501 KGAQSVDNGD GQ* 



m23Q-l/g230-l 96.3% identity in 512 aa overlap 



10 20 30 40 50 60 

m230-l .pep MFHSIEKYRTPAQVLLGLIALTFVGFGVSTVSHPGADYIVQVGDEKI SDHSINNAIQNEQ 
I ! 1 i I I I I 1 I M I E I I i i I I I I I i I I I I I 1 ! I I I ! I I I I I I I t I 1 I I i : i I 1 M I : I i I I 
g2 30-l MFHSIEKYRTPAQVLLGLIALTFVGFGVSTVSHPGADYIVQVGDEKISEHSINNAMQNEQ 

10 20 30 40 50 60 

70 80 90 100 110 120 

m230-l .pep ADGGGPSRDAVFQSLLQRAYLKQGAKLMGI SVSSEQIKQI I VDDPNFHDANGKFDHALLN 
I I I I : I I ! I I I I I ! f I I t I I M I I I 1 ! I I I i I I I I I I j : I M I I I I I t I i I I t : I I I I : 
g2 3 0 - 1 ADGGS PWRDAVFQSLLQRAYLKQGAKLMGI SVSSEQI KQMI VDDPNFHDANGKFSHALLS 

70 80 90 100 110 120 



m2 30-l .pep 



130 140 150 i60 170 ISO 

RYLSQRHMSEDQFVEEIRDQFALQNLVNLVQNGVLVGDAQAEQLI RLTQVNRTI RSHTFN 
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This corresponds to the amino acid sequence <SEQ ID 838; ORF 230-La>: 

a230-l -pep 

1 MFHS IEKYRT PAQVLLGLIA LTFVGFGVST V5 HPGADYI V QVGDEKISDH 

51 SINNAIQNEQ ADGGGPSRDA VFQSLLQRAY LKQGAKLMGI SVSSEQIKQI 

101 IVDDPNFHDA NGKFDHALLN RYLSQRHMSE DQFVEEIRDQ FALQNLVNLV 

151 ONGVLVGDAQ AEQLIRLTQV NRTIRSHTFN PDEFIAQVKV SEADLQKFYN 

201 ANKKDYLLPK AVKLEYVALN LKDFADKQTV SETEVKNAFE ERVARLPANE 

251 AKPSFEQEKA AVENELKMKK AVADFNKAKE KLGDDAFNHP SSLAEAAKNS 

301 GLKVETQETW LSRQDAQMSG MPENLINAVF SDDVLKKKHN SEVLTINSET 

351 AWVVRAKEVR EEKTLPFAEA KDAVRQAYIR TEAAKLAENK AKDVLTQLNG 

4 01 GKAVDVKWSE VSVLGAQQAR QSMPPEAYAE LLKAKPANGK PAYVRLIGLP 

451 APVIVEVQAV TPPDDIAAQL PLAKQALAOO QSANTFDLLI RYFNGKIKQT 
501 KGAQSVDNGD GQ* 



a230-l/m230-l 99.8% identity in 512 aa overlap 

10 20 30 40 50 60 

a230-l.pep MFHS I EKYRTPAQVLLGL I ALTFVGFGVSTVSHPGADY I VQVGDEKI SDH SINNAIQNEQ 
I I I I I I I I t I 11 i I I I I I I I I i I I t I I I 1 I I I I I I I I I i I I ! I t I t I I I I I I i I t I I I I I 
m230-l MFHS I EKYRTPAQVLLGL I ALTFVGFGVSTVSHPGADY I VQVGDEKI SDHSINNAIQNEQ 

10 20 30 40 50 60 

70 80 90 100 110 120 

a230-l . pep ADGGGPSRDAVFQSLLQRAYLKQGAKLMGISVSSEQIKQIIVDDPNFHDANGKFDHALLN 

i i 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 i 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

m230-l ADGGGPSRDAVFQSLLQRAYLKQGAKLMGISVSSEQIKQIIVDDPNFHDANGKFDHALLN 

70 80 90 100 110 120 

130 140 150 160 170 180 

a2 30-1 .pep RYLSQRHMSEDQFVEEIRDQFALQNLVNLVQNGVLVGDAQAEQLIRLTQVNRTIRSHTFN 
I I I I I I I M I I I I I ! i I I I 1 I I I M I I I I I I I i I I I 1 I I H I I i 1 I I I I I ! I ! I I ! I 1 1 I 
m230-l RYLSQRHMSEDQFVEEIRDQFALQNLVNLVQNGVLVGDAQAEQLIRLTQVNRTIRSHTFN 

130 140 150 160 170 180 

190 200 210 220 230 240 

a2 30- 1 . pep PDEFI AQVKVSEADLQKFYNANKKDYLLPKAVKLEYVALNLKDFADKQTVSETEVKNAFE 
I | | | | | I I | I I i I I I I I t M I I II I 1 I I I : I I I I I I I I I I i i I i I i 1 I I t 1 I I I I ! I I II 
m230-l PDEFI AQVKVSEADLQKFYNANKKDYLLPQAVKLEYVALNLKDFADKQTVSETEVKNAFE 

190 200 210 220 230 240 

250 260 270 280 290 300 

a2 30-1 .pep ERVARLPANEAKPSFEQEKAAVENELKMKKAVADFNKAKEKLGDDAFNHPSSLAEAAKNS 

I i | | | I I I I I I I I I I I I i i I I I I I I I I I t I I I I I I M I I I I I 1 1 I i i I 1 1 1 1 1 1 i 1 1 I 1 1 
m230-l ERVARLPANEAKPSFEQEKAAVENELKMKKAVADFNKAKEKLGDDAFNHPSSLAEAAKNS 

250 260 270 280 290 300 

310 320 330 340 350 360 

a230-l . pep GLKVETQETWLSRQDAQMSGMPENLINAVFSDDVLKKKHNSEVLTINSETAWVVRAKEVR 
I I I I I I i I I I I I I I I I I I i I I I I I I I I I I I I I t M I I i I I I i i I I I I I i I I I I I t I I I I I 
m230-l GLKVETQETWLSRQDAQMSGMPENLINAVFSDDVLKKKHNSEVLTINSETAWVVRAKEVR 

310 320 330 340 350 360 

370 380 390 400 410 420 

a2 30-1. pep EEKTLPFAEAKDAVRQAYIRTEAAKLAENKAKDVLTQLNGGKAVDVKWSEVSVLGAQQAR 
I I I I M I I I I t I I I 1 I i I ! i I I t I I i I i S I I E 1 I 1 I I II ! 1 I I I I 1 I I I Si I I I I I I I I I 
m2 30-l EEKTLPFAEAKDAVRQAYIRTEAAKLAENKAKDVLTQLNGGKAVDVKWSEVSVLGAQQAR 

370 380 390 400 410 420 

430 440 450 460 470 480 

a2 30- 1 . pep QSMPPEAYAELLKAKPANGKPAYVRLIGLPAPVI VEVQAVTPPDDI AAQLPLAKQALAQQ 
| I I I 1 i I I I I I 1 E I I I t I I I i i I I I M II I I I I I I 1 I I I I I I I M I I I I I i I I I I M 1 M 
m230-l QSMPPEAYAELLKAKPANGKPAYVRL I GLPAPVI VEVQAVTPPDDI AAQLPLAKQALAQQ 

430 440 450 4 60 470 480 

490 500 510 

a2 30-1. pep QSANTFDLLI RYFNGKIKQTKGAQSVDNGDGQX 
I I I I I I I I I I I I I I t I I I I I ! I I I I t I I I I I I I 
m230- 1 QSANTFDLLI RYFNGKIKQTKGAQSVDNGDGQX 

490 500 51" 
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a231 . seq 


(partial ) 










1 


ATGTCAAAAC 


GAAAATCCAT 


AAACCGTCCG 


TATCAAAAAC 


CGGCGGAACT 


51 


GCCGCCGTTG 


CAA.-.ATAATC 


CGCCATTTTA 


CCGTAAAAAC 


CGCCGCCTGA 


101 


ACTTTTTTAT 


CGNGGCAGAC 


GGCGGTTGCG 


CGTCTCCGCA 


AAAATGCAGG 


151 


GCGCGCGGTT 


TTCAGACGGC 


ATTTGCCGTT 


CAAAGCCGTG 


CGGTGTCTTT 


201 


ACCAAATGCC 


CAACCATTCG 


CCCACGGCAT 


CCATCCAATC 


CTTATTGCCC 


251 


CCGCCGCTCC 


TGCCTGCCCG 


GCGGTACGCC 


CACGGCGCTT 


GCGGATTTTT 


301 


AGC^TTCCAC 


AATCCTTTGC 


GTTCCCTTTC 


CGCCTGAATT 


TGAGCGTCGG 


351 


CATAATCGGC 


AAAATCCGCC 


TTATCCTGCT 


GTTCTTTAGC 


ATAACTTTTA 


401 


TAATGCCACG 


CCCICCCCGTC 




r\ 1 \^ri\j\3 1 1 


/vrt ituuJ X 1 1 


451 


GCCGACAGAA 


ACCTGCGCCA 


CTTCGCGCTG 


GTAGCGGTCG 


GTGTCGAACA 


501 


CGCGGACGCT 


GACTTTCCTG 


CCTTCCGCCG 


CCGCGCGCAG 


GTTGTCGCGC 


551 


GAACGCGTGC 


CGTAAGCCTG 


TTTCATCTCC 


GGCGCGTCGA 


TATACGCCAT 


601 


CCGGATTTTG 


TGTTTCGCGC 


CGTCGCCGTC 


GATAACGTGA 


AGGGTGTCGC 


651 


CGTCATAGAC 


TTTGGACACC 


GTGCCTGTGT 


AGCGGTGGCC 


GGATTTCGCC 


701 


GATGCTCGGC 


GGCGGGCGGG 


CGCGTCGGAA 


CCCGCGTCCC 


CTGCCGCGCC 


751 


GAGTACGTCG 


AGTACGGCAA 


CCGCCGTCCG 


CACCGCCTCG 


CTGCCGTACC 


801 


CCGTATAACC 


CAACGCACCC 


AAAAGCGACA 


AGGCGACGGG 


AAGCCATTTC 


851 


ATGATTTTTT 


TAATCTGCAT 


ATTTTTCAAA 


TGCCGATGCC 


GTCTGAACAT 


901 


ATC 











This corresponds to the amino acid sequence <SEQ ID 844; ORF 217.a>: 

a231.pep (partial) 

1 MSKRKSINRP YQKPAELPPL QNNPPFYRKN RRLNFFIXAD GGCASPQKCR 

51 ARGFQTAFAV QSRAVSLPNA QPFAHGIHPI LIAPAAPACP AVRPRRLRIF 

101 SFPQSFAFPF RLNLSVG IIG KIRLILLFF5 ITFI MPRRPV LHLHQVQIGF 

151 ADRNLRHFAL VAVGVEHADA DFPAFRRRAQ VVARTRAVSL FHLRRVDIRH 

201 PDFVFRAVAV DNVKGVAVID FGHRACVAVA GFRRCSAAGG RVGTRVPCRA 

251 EYVEYGNRRP HRLAAVPRIT QRTQKRQGDG KPFHDFFNLH I FQMPMPSEH 
301 I 

m231/a231 98.6% identity in 73 aa overlap 

10 20 30 40 50 60 

m231 . pep MSKRKSINRPYQKPAELPPLQNNPPFYRKNRRLNFFIAADGGCASPQKCRARG FQTAFAV 

! I I M i I II I I I i I I I I I I I I I I I I I I I I i I I II I i i I i I I i I I 11 I I I 1 I I I I I I I I I 
a231 MSKRKSINRPYQKPAELPPLQNNPPFYRKNRRLNFFIXADGGCASPQKCRARGFQTAFAV 

10 20 30 40 50 60 

70 

ra2 31.pep QSRAVSLPNAQPFG 
I I I I I I I I I I I I I : 

a2 31 QSRAVSLPNAQPFAHGIHPILIAPAAPACPAVRPRRLRIFSFPQSFAFPFRLNLSVGI IG 

70 80 90 100 110 120 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 845>: 

g231-l . seq 

1 ATGTCAAAAC GAAAATCCAT AAACCGTCCG TATCAAAAAC CGGCGGAACT 

51 GCCGCCGTTG CAAAATAATC CGCCATTTTA CCGTAAAAAC CGCCGcCTGA 

101 ACTTTTTTAT CGCGGCAGAC GGCGGTTGCG CGTCTCCGCA AAAATGCAGG 

151 GCGCGCGGTT TTCAGACGGC ATTTGCCGTT CAAGGCCGTG CGGTGTCTTT 

201 ACCAAATGCC CAACCATTCG CCCACGGAAT CCATCCAATC CTTATTGCCC 

251 CCGCCGCTCC TGCCTGCCCG GCGGTACGCC CACGGCGCTT GCGGATTTTT 

301 AGCTTTCCAC AATCCTTTGC GTTCCCTTTC CGCCTGAATT TGAGCGTCGG 

351 CATAGTCGGC AAAATCCGCC TTATCCTGCT GTTCTTTAGC ATAACTTTTA 

401 TAATGCCACG CCGCCCCGTC CTGCACCTGC ATCAGGTTCA AATCGGTTTT 

451 GCCGGCGGAT ACCTGCGCCA CTTCGCGCTG ATAGCGGTCG GTTTCAAACa 

501 CaCgTaCaat gagtttcgtA ccctccGCCG ccgcgcgCAG GTTGtcgcGC 

551 GAACgTGTAC CGTAagcgtg TTtcatctcc GGTGCgtcGA TATACGCCaT 

601 cCgAATTTta tGTttcgcgc cgtcgcCgtc gATGACGTGA AGGGtatcGC 

651 CgtcATAGAC TTTGGACACC Gtgcctgcgt AGctGTGGCC GGATttcgc 

This corresponds to the amino acid sequence <SEQ ID 846; ORF 231-l.ng>: 

g231-l .pep 

1 MSKRKSINRP YQKPAELPPL QNNPPFYRKN RRLNFFIAAD GGCASPQKCR 

51 ARGFQTAFAV QGRAVSLPNA QPFAHGIHPI LIAPAAPACP AVRPRRLRIF 
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151 
201 
251 
301 
351 
401 
451 
501 
551 
601 
651 
701 
751 
801 
851 
901 



GCGCGCGGTT 
ACCAAATGCC 
CCGCCGCTCC 
AGCTTTCCAC 
CATAATCGGC 
TAATGCCACG 
GCCGACAGAA 
CGCGGACGCT 
GAACGCGTGC 
CCGGATTTTG 
CGTCATAGAC 
GATGCTCGGC 
GAGTACGTCG 
CCGTATAACC 
ATGATTTTTT 
ATCGGAATCG 



TTCAGACGGC 
CAACCATTCG 
TGCCTGCCCG 
AATCCTTTGC 
AAAATCCGCC 
CCGCCCCGTC 
ACCTGCGCCA 
GACTTTCCTG 
CGTAAGCCTG 
TGTTTCGCGC 
TTTGGACACC 
GGCGGGCGGG 
AGTACGGCAA 
CAACGCACCC 
TAATCTGCAT 
GATTTCAGAC 



ATTTGCCGTT 
CCCACGGCAT 
GCGGTACGCC 
GTTCCCTTTC 
TTATCCTGCT 
CTGCACCTGC 
CTTCGCGCTG 
CCTTCCGCCG 
TTTCATCTCC 
CGTCGCCGTC 
GTGCCTGTGT 
CGCGTCGGAA 
CCGCCGTCCG 
AAAAGCGACA 
ATTTTTCAAA 
GGCATCTTAA 



CAAAGCCGTG 
CCATCCAATC 
CACGGCGCTT 
CGCCTGAATT 
GTTCTTTAGC 
ATCAGGTTCA 
GTAGCGGTCG 
CCGCGCGCAG 
GGCGCGTCGA 
GATAACGTGA 
AGCGGTGGCC 
CCCGCGTCCC 
CACCGCCTCG 
AGGCGACGGG 
TGCCGATGCC 



CGGTGTCTTT 
CTTATTGCCC 
GCGGATTTTT 
TGAGCGTCGG 
ATAACTTTTA 
AATCGGTTTT 
GTGTCGAACA 
GTTGTCGCGC 
TATACGCCAT 
AGGGTGTCGC 
GGATTTCGCC 
CTGCCGCGCC 
CTGCCGTACC 
AAGCCATTTC 
GTCTGAACAT 



This corresponds to the amino acid sequence <SEQ ID 850; ORF 23 1-1 .a>: 



a231-l .pep 

1 MSKRKSINRP 
51 ARGFQTAFAV 
SFPQS FAFPF 



101 
151 
201 
251 
301 



ADRNLRHFAL 
PDFVFRAVAV 
EYVEYGNRRP 
IGIGFQTAS * 



YQKPAELPPL 
OSRAVSLPNA 
RLNLSVG IIG 
VAVGVEHADA 
DNVKGVAVID 
HRLAAVPRIT 



QNNPPFYRKN 
QPFAHGIHPI 
KIRLILLFFS 



DFPAFRRRAQ 
FGHRACVAVA 
QRTOKRQGDG 



RRLNFFIAAD 
LIAPAAPACP 
ITFI MPRRPV 
VVARTRAVf - 
GFRRCSAAGG 
KPFHDFFNLH 



GGCASPQKCR 
AVRPRRLRIF 
LHLHQVQIGF 
FHLRRVDIRH 
RVGTRVPCRA 
IFQMPMPSEH 



a231-l/m231-l 99.0% identity in 309 aa overlap 



10 20 30 40 50 60 

a231-l - pep MSKRKSINRPYQKPAELPPLQNNPPFYRKNRRLNFFIAADGGCASPQKCRARGFQTAFAV 
! I I I M I I I I I I I I I I I I I I I t I I I 1 I I I I I I I I I I t I I I I I I 1 I i I I l I - i I I I I I I I I 
m231-l MSKRKSINRPYQKPAELPPLQNNPPFYRKNRRLNFFIAADGGCASPQKCRARGFQTAFAV 
10 20 30 40 50 60 



70 80 90 100 110 120 

a231-l .pep QSRAVSLPNAQPFAHGIHPILIAPAAPACPAVRPRRLRI FSFPQSFAFPFRLNLSVGI IG 
I I I I I I I I I I I t I I I I ! t I ! I I I I [ I I I I II I I I I I II t i I I I I I I I M I I t I I I I ! I I 
m2 31-l QSRAVSLPNAQPFAHGIHPILIAPAAPACSAVRPRRLRI FSFPQSFAFPFRLNLSVGI IG 

70 80 90 100 110 120 



130 140 150 160 170 180 

a2 31-l .pep KIRLILLFFSITFIMPRRPVLHLHQVQIGFADRNLRHFALVAVGVEHADADFPAFRRRAQ 
I M I I I ! I I I I I I I I I I 1 M I I I I I I I I I I I I I I I I I M I I I I I : I ; I I I I I I II I I I I 
m2 31-l KIRLILLFFSITFIMPRRPVLHLHQVQIGFADRNLRHFALVAVGIEHAHADFPAFRRRAO 

130 140 150 160 170 180 



190 200 210 220 230 240 

a2 3 1-1. pep WARTRAVSLFHLRRVDIRHPDFVFRAVAVDNVKGVAVIDFGHRACVAVAGFRRCSAAGG 
I I I I I I I I I I I I I I I I I I I I I 1 ! I I I I I I I i I I M I I I I I I I M I I I I I I I I I I I I I I M 
m231-l VVARTRAVSLFHLRRVDIRHPDFVFRAVAVDNVKGVAVI DFGHRACVAVAGFRRCSAAGG 

190 200 210 220 230 240 



250 260 270 280 290 300 

a231-l .pep RVGTRVPCRAEYVEYGNRRPHRLAAVPRITQRTQKRQGDGKPFHDFFNLHI FQMPMPSEH 
! I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I ! I I I I I I SI I 1 I ! i I I I I I I I i I I II 
m2 3 1 - 1 RVGTRVPCRAEYVEYGNRRPHRLAAVPRITORTOKRQGDGKPFHDFFNLHI FQMPMPSEH 

250 260 270 280 290 300 



310 

a231-l.pep IGIGFQTASX 
I I I I I I I I I I 
m231-l IGIGFQTASX 
310 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 85 1>: 

g232 . seq 

1 atgatgggca acagcctgat tgaatccggt acgtttgtcg ccatcctgtt 

51 tggtcagatt ttgggaacgg cggttgccgg cgcgccgcct tatattgtcg 

101 ggatactggt tttgctggtc gccgtcggag gaacggccgg cagccngttt 
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10 20 30 40 50 60 

70 80 90 100 110 120 

TOIEWNIVRGTKSLLRETVRHKPVFTAIIGISWFWFVGAVYTTQLPTFTQIHLGGNDNVF 

IIIIIMIIMMIIMIIIhlllllllMMIMWIIMIMIIIIIIIIIMIIII 

TOIEWNIVRGTKSLLRETVRHNPVFTAIIGISWFWFVGAVYTTQLPTFTQIHLGGNDNVF 
70 80 90 100 110 120 

130 140 150 160 170 180 

NLMLALFSIGIAAGSVLCAKFSXERIJ^^WVTVGALGLTVCGLVLVWLTHGHRFEGLNGI 

IMIillllllllllimih IIIIIIMIIIIII MIIIMIMIMIIIIMIII 

NLMIALFSIGIAAGSVLCAKFGRERIJ^LAWVTVGALGSTVCGLVLVWLTHGHRFEGLNGI 
130 140 150 160 170 180 

190 200 210 220 230 

FXFLSQGWAYPVMAVMTLIGFFGGFFSVPLYT - VQTAI ARFPRP - AVAANNI VNGI FMVS 

I [MIIIIIIIMIIIIIIMMIIIIIIII : I IIIIMIMIMIII 

FWFLSOGWAYPVMA T ":TLIGFFGGFFSVPLYTWLQTASSETFRARAVAANNIVNGIFMVS 
190 200 210 220 230 240 

240 250 260 270 280 289 

AAVLSAVLLFLFDSISLLYLIVALGNI PLSVFLI KRERRFLGAAAIRKKPX 

IIMIIIIllllllllllMllllllllhllMIIIMIllllllllll 

AAVLSAVLLFLFDSISLLYLIVALGNIPLAVFLIKRERRFLGAAAIRKKP 
250 260 270 280 290 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 855>: 

a232 . seq 

1 ATGTACGCTA AAAAAGGCGG TTTGGGACTG GTTAAAAGCC GCCGTTTCGC 

51 ACCTCTTTTC GCTACGCAGT TTCTCGGCGC GTTCAACGAC AATGTGTTCA 

101 AAACCGCGCT GTTTGTGATG ATTGGGTTTT ACGGTTTGGG GCAAAACGGC 

151 TTCCTGCCTG CCGGACAGAT GTTGAACTTG GGCGCGTTGC TGTTTATTTT 

201 GCCGTATTTC CTG1"_TTC~T r jC TGTCGGG GCAGTTGGGT AACAAAT7CG 

251 ACAAGGCCGT TTTGGCGCGT TGGGCCAAGG TGCTGGAAAT GATCATTATG 

301 GCGGTGGCGG CATACGGGTT TTATATCCGG TCTGCCCCGC TGCTTTTGGC 

351 GTGTCTGTTT TGCATGGGCG CGCAATCGAC GCTGTTCGGG CCGCTGAAAT 

4 01 ACGCCATCCT GCCCGATTAT CTCGACGACA AAGAGTTGAT GATGGGCAAC 

4 51 AGCCTGATTG AATCGGGTAC GTTTGTCGCC ATCCTGTTCG GTCAGATACT 

501 GGGGACTGCG GTGGCAGGTG TACCGCCTTA TATTGTCGGG ATACTGGTTT 

551 TGCTGGTCGC CGTAGGAGGC ACGGTCGGCA GCCTGTTTAT GCCGTCCGTA 

601 CCCGCCAAGG CTGCCGATAC ACAAATTGAG TGGAATATTG TCCGGGGTAC 

651 AAAATCCCTG CTGCGTGAAA CGGTGCGGCA CAAGCCCGTT TTTACCGCCA 

701 TTATCGGTAT TTCGTGGTTT TGGTTTGTCG GCGCGGTTTA TACCACGCAA 

7 51 CTGCCGACCT TTACCCAAAT CCATCTAGGC GGCAACGACA ATGTTTTCAA 

801 CCTGATGCTT GCCCTGTTTT CCATCGGTAT TGCCGCCGGT TCGGTACTGT 

851 GTGCCAAGTT CAGCAGGGAA CGGCTGAGGT TGGCTTGGGT AACGGTTGGT 

901 GCGTTGGGTT TGACGGTTTG CGGCTTGGTT TTGGTGTGGC TGACGCACGG 

951 ACACCGTTTT GAAGGGCTGA ACGGCATTTT TTGGTTTTTA TCGCAAGGAT 

1001 GGGCATATCC CGTGATGGCG GTGATGACGC TGATCGGCTT TTTCGGCGGA 

1051 TTTTTCTCCG TTCCGCTCTA TACCTGGCTG CAAACCGCCA GTAGCGAGAC 

1101 TTTCCGCGCC CGCGCCGTTG CCGCCAACAA TATCGTTAAC GGTATTTTTA 

1151 TGGTTTCCGC TGCCGTTTTG AGCGCGGTGT TGCTGTTTTT GTTTGACAGC 

12 01 ATTTCCTTGT TGTATCTGAT TGTCGCTTTG GGCAATATTC CGTTGTCGGT 

1251 ATTTTTGATT AAGCGCGAAA GGCGGTTTTT AGGCGCGGCG GCAATCAGGA 

1301 AAAAACCTTG A 

This corresponds to the amino acid sequence <SEQ ID 856; ORF 232. a>: 

a232.pep 

1 MYAKKGGLGL VKSRRFAPLF ATQFLGAFND NVFKTALFVM IGFYGLGQNG 

51 FLPAGQ MLNL GALLFILPYF LFS SLSGQLG NKFDK AVLAR WAKVLEMIIM 

101 AVAAYGFYIR SAP LLLACLF CMGAQSTLFG PLKYAILPDY LDDKELMMGN 

151 SLIESGT FVA ILFGQILGTA VAGV PPYIVG ILVLLVAVGG TVGSLFM PSV 

201 PAKAADTQIE WNIVRGTKSL LRETVRHKP V FTAIIGISWF WFVGAV YTTQ 

251 LPTFTQIHLG GNDNVFNLML ALFSIGIAAG SVLCAKFSRE RLRLAWVTVG 



m2 32 . pep 
g232 

m2 32 . pep 
g232 

m2 32 . pep 
g232 

m23 2 . pep 
g232 
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151 QISATVDRSG LWQAQTPQLF QAGLLHRALA AENLGGITDE AS AVE KLGV R 
201 PLLIQGDARN LKLTQPQDAY IVRLLLNAV* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 859>: 

m233.seq (partial) 

1 ATGAAGCGCA AAAATATCGC GCTGATTCCC GCCGCCGGCA TCGGGGCGCG 

51 TTTCGGTGCG GACAAACCCA AGCAATATGT CGAAATCGGA AGCAAAACCG 

101 T TTTAGAACA TACGATTGGG ATTTTTGAAC GGCATGAGGC CGTCGATTTG 

151 ACCGTCGTTG TCGTCTCGCC CGAAGACACG TTTGCCGATA AGGTTCAGAC 

201 GGCATTTCCA CAGGTTCGGG TGTGGAAAAA CGGCGGACAG ACCCGCGCCG 

251 AAACCGTCCG CAACGGTGTG GCAAAACTGT TGGAAACCGG TTTGGCGGCG 

3 01 GAAACCGACA ATATTCTGGT ACACGATGCC GCGCGTTGCT GCCTGCCGTC 

3 51 TGAAGCTTTG ACGCGGTTGA TAGAACAGGC GGGCAACGCC GCCGAAGGCG 

4 01 GGATTTTGGC AATTCCCATT GCCGATACGC TCAAGTGCGC GGACGGTGGG 
451 AACATT. . . . 

This corresponds to the amino acid sequence <SEQ ID 860; ORF 233>: 

m233.pep (partial) 

1 MKRKNIALIP AAGI 3ARFG A DKPKQYVEIG SKTVLEHTIG I FERHEAVDL 
51 TWWSPEDT FADKVQTAFP QVRVWKNGGQ TRAETVRNGV AKLLETGLAA 
101 ETDNILVHDA ARCCLPSEAL TRLIEQAGNA AEGGILAIPI ADTLKCADGG 
151 NI . . . . 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 233 shows 93.4% identity over a 152 aa overlap with a predicted ORF (ORF 233.ng) 
from N. gonorrhoeae: 

m233/g233 



10 20 30 40 50 60 

m23 3 .pep MKRKKIALI PAAGIGARFGADKPKQYVEIGSKTVLEHTIGI FERHEAVDLTWWSPEDT 

IIMMIMIIIII|:|I|||||||||||||||||||::||||||||M||||||||M| 
9 2 3 3 MKRKN I AL I PAAG I GVRFGADKP KQYVE I GS KTVL EHVLG I FE RHE AVDLTWWS PEDT 

10 20 30 40 50 60 



70 80 90 100 110 120 

m23 3 .pep FADKVQTAFPQWWKNGGQTRAETVRNGVAKLLETGIAAETDNIL^DAARCCLPSEAL 

IIIMIIMIIII1IIIIIIMIIIMII1IIIIIIIIIIIMMMIIIIII1IIIIII 

9233 FADKVQTAFPOVRWKNGGQTRAETVRNGVAKLLETGLAAETDNILVHDAARCCLPSEAL 
70 80 90 100 110 120 



130 140 150 

m233 .pep TRL I EQAGNAAEGG I LAI P I ADTLKCADGGN I 

:||IMIIIIIIIIIIh|:||||| |::|:| 
923 3 ARLIEQAGNAAEGGILAVPVADTLKRAESGQISATVDRSGLWQAQTPQLFQAGLLHRALA 
130 140 150 160 170 180 



The following partial DNA sequence was identified in N. meningitidis <SEQ ID 86 1>: 

a233. seq 

1 ATGAAGCGCA AAAATATCGC GCTGATTCCC GCCGCCGGCA TCGGGGCGCG 

51 TTTCGGTGCG GACAAACCCA AGCAATATGT CGAAATCGGA AGCAAAACCG 

101 TTTTAGAACA TACGATTGGG ATTTTTGAAC GGCATGAGGC CGTCGATTTG 

151 ACCGTCGTTG TCGTCTCGCC CGAAGACACG TTTGCCGATA AGGTTCAGAC 

201 GGCATTTCCA CAGGTTCGGG TGTGGAAAAA CGGCGGACAG ACCCGCGCCG 

251 AAACTGTCCG CAACGGTGTG GCAAAATTGT TGGAAACCGG TTTGGCGGCG 

301 GAAACCGACA ATATTCTGGT ACACGATGCC GCGCGTTGCT GCCTGCCGTC 

351 TGAAGCTTTG ACGCGGTTGA TAGAACAGGC GGGCAACGCT GCCGAAGGTG 

4 01 GGATTTTGGC AATTCCCGTT GCCGATACGC TCAAGTGCGC GGACGGTGGG 

4 51 AACATTAGTG CAACCGTCGA GCGGACGAGC CTTTGGCAGG CGCAAACGCC 

501 GCAGCTTTTC CGCGCCGGGC TGCTGCACCG CGCATTGGCT GCGGAAAACT 

551 TGGACGGCAT TACCGATGAA GCGTCCGCCG TGGAAAAATT GGGCATCCGC 

601 CCTTTGCTGG TGCAGGGCGA CGCGCGCAAT TTGAAACTGA CGCAGCCGCA 

651 GGACGCATAC ATCGTCAGGC TGCTGCTCGA TGCCGTCTGA 
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m234.pep (partial) 

1 . . GAGEYALSNR El IGFGGTSG YDATLNGKVL DLAIREAVNS LVQAVDNGAW 
51 QPNR* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 234 shows 94.4% identity over a 54 aa overlap with a predicted ORF (ORF 234.ng) 
from N. gonorrhoeae: 

m234/g234 

10 20 30 

m23 4 . pep G AGS YALSNRE I IGFGGTSG YDATLNGKVL 

MIIIMIIMIMIIIIIMIIIIMIII 

g234 LGRGKSQIAYAKVALNIVNVNTSEIVYSTOGAGEYALSNREIIGFGGTSGYDATLNGKVL 
140 150 160 170 180 190 

40 50 
m2 3 4 . pep D LA I RE A VNS L VQ A VDNG AWQ PNRX 

llllllli-llllillllli II! 
g 2 3 4 DLA1 RE AVDNLVQAVDNGAWQSNRX 

200 210 220 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 867>: 

a234. seq (partial) 

1 AACCGCACCT ATTTGAACGC ATTAAAACAG GAATCCGGCA TTTCCGGCAA 

51 AGCGCATAAC CTGAAAGGCG CAAATTATGT CGNNACCGGC GATGTAACCG 

101 AATTCGGACG CANAGATGTC GGCGATCATC AGCTCTTCGG CATTTTGGGT 

i51 CGCGGCAAAT CGCAAATCGC CTATGCAAAA GTGGCTCTGA ATATCGTCAA 

201 CGTCAATACT TCCGAAATCG TCTATTCCGC ACAGGGCGCG GGCGAATACG 

2 51 CACTTTCCAA CCGTGAAATC ATCGGTTTCG GCGGCACTTC CGGCTACGAT 

301 GCGACTTTGA ACGGCAAAGT TTTAGACTTG GCAATCCGCG AAGCCGTCAA 

351 CAGCCTGGTT CAGGCTGTTG ACAACGGCGC ATGGCAACCC AACCGTTAA 

This corresponds to the amino acid sequence <SEQ ID 868; ORF 234. a>: 

a234.pep (partial) 

1 NRTYLNALKQ ESGISGKAHN LKGANYVXTG DVTEFGRXDV GDHQLFGILG 

51 RGKSQIAYAK VALNI VNVNT SEIVYSAQGA GEYALSNREI IGFGGTSGYD 

101 ATLNGKVLDL AIREAVNSLV QA VDNG AWQ P NR* 



10 



m234/a234 1 00.0% identity in 54 aa overlap 

30 

m234 . pep 

GAGEYALSNREI IGFGGTSGYDATLNGKVL 

I I I I! I II I I 1 1 I I I I M M I I II M M M 
a234 

LGRGKSQIAYAKVALNIVNVNTSEIVYSAQGAGEYALSNREI IGFGGTSGYDATLNGKVL 

50 60 70 80 90 100 

40 50 
rn2 34 . pep DLAIREAVNSLVQAVDNGAWQPNRX 
I I II t I I I I I I I I I I I I I II I I I M 
a2 3 4 DLAI REAVNSLVQAVDNGAWQPNRX 

110 120 130 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 869>: 

g23 5 . seq 
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YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAWNQIANSLT 

IMIIIIIIIIIIIIIIIIIIIIIIIMIIIIMIIIIIMIMIIIhlllllllllll 

YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVGAWNQIANSLT 
130 140 150 160 170 180 

190 200 210 

DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPKX 



190 200 210 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 873>: 

a235 . seq 

1 ATGAAACCTT TGATTTTAGG GCTTGCCGCC GTGTTGGCGC TGTCTGCCTG 

51 CCAAGTTCAA AAAGCGCCCG ATTTCGACTA CACGTCATTC AAGGAAAGCA 

101 AACCGGCTTC AATTTTGGTG GTTCCGCCGC TGAACGAATC GCCCGATGTC 

151 AACGGAACAT GGGGTGTACT GGCTTCGACC GCCGCGCCGC TTTCCGAAGC 

201 CGGCTATTAC GTCTTCCCCG CCGCAGTCGT GGAGGAAACC TTCAAACAAA 

251 ACGGCTTGAC CAATGCCGCC GATATTCACG CCGTCCGGCC GGAAAAACTG 

301 CATCAGATTT TCGGCAATGA TGCGGTTTTG TACATTACGG TTACCGAATA 

351 CGGCACTTCA TATCAAATTT TAGACAGCGT GACGACCGTA TCCGCCAAAG 

4 01 CACGGCTGGT CGATTCCCGC AACGGAAAAG AGTTGTGGTC GGGTTCGGCC 

4 51 AGCATCCGCG AAGGCAGCAA CAACAGCAAC AGCGGCCTGT TGGGGGCTTT 

501 GGTCAGCGCA GTGGTCAATC AGATTGCCA, ..\JCCTGACC GACCGCGGTT 

551 ATCAGGTTTC TAAAACCGCC GC AT ACAACC TGCTGTCGCC CTATTCTCAC 

601 AACGGCATCT TGAAAGGTCC GAGATTCGTC GAAGAGCAGC CCAAATAA 

This corresponds to the amino acid sequence <SEQ ID 874; ORF 235. a>: 

a2 3 5 . pep 

1 MKPLILGLAA VLALSA CQVQ KAPDFDYTSF KESKPASILV VPPLNESPDV 

51 NGTWGVLAST AAPLSEAGYY VFPAAWEET FKQNGLTNAA DIHAVRPEKL 

101 HQ I FGNDAVL YITVTEYGTS YQILDSVTTV SAKARLVDSR NGKELWSGSA 

151 SIREGSNNSN SGLLGALVSA WNQIANSLT DRGYQVSKTA AYNLLSPYSH 

201 NGILKGPRFV EEQPK* 

m235/a235 100.0% identity in 215 aa overlap 

10 20 30 40 50 60 

m235 . pep MKPLILGLAAVLALSACQVQKAPDFDYTSFKESKPASILVVPPLNESPDVNGTWGVLAST 
I > I M I I I I I M I I ! I I I I I | ! | | | I ! ! ! ! ! [j I ! I I I I t I I I M ! I I i I i 1 I I I M i I ! 
a2 35 MKPLI LGLAAVLALSACQVQKAPDFDYTS FKESKPAS I LVVPPLNES PDVNGTWGVLAST 

10 20 30 40 50 60 

70 80 90 100 110 120 

m235.pep AAPLSEAGYYVFPAAVVEETFKQNGLTNAADIHAVRPEKLHQI FGNDAVLYITVTEYGTS 

I I f S I I ! M I t i I I I I I I I I ! t I I I I I I i 1 I I I i I I I i i I I I j I I 1 | | | M I i II I I I I I 
a235 AAPLSEAGYYVFPAAWEETFKQNGLTNAAD I HAVRPEKLHQI FGNDAVLYITVTEYGTS 

70 80 90 100 110 120 

130 140 150 160 170 ISO 

m235. pep YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSN5GLLGALVSAWNQIANSLT 
I I M I ! I 1 I I I ! 1 E i I I I I I I I M I I M I 1 I U ! I I I I 1 I I II I I I i 1 I I 1 I 1 [ i ! M M 
a2 35 YQILDSVTTVSAKARLVDSRNGKELWSGSASIREGSNNSNSGLLGALVSAVVNQIANSLT 

130 140 150 160 170 180 

190 200 210 

m235 . pep DRG YQVS KTAAYNLLS PYSHNG I LKGPR FVEEQPKX 

I I I M I I I I I I ! I I t f I I I I I ! I I | I I | | ! | j M I i 
a235 DRGYQVSKTAAYNLLSPYSHNGILKGPRFVEEQPKX 
190 200 210 



m23 5 . pep 
g235 

m23 5 -pep 
g235 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 875>: 

g236 . seq 
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10 20 30 

m2 3 6 . pep LHGRTDGFVGAQRLDGGGYRFAGFADCRPF 

1 !■ MM: , M MMIII 

g23 6 FRHQQGKAQFFAOSIQIAGHFFRRGNFGFRLQGRTDSFVGAQRLDSGGYCFARFADCRPF 
60 70 80 90 100 110 

40 50 60 70 80 90 

m236 . pep FHQFGFGFFVDGRELVPSMEEDAVXFAAAXDVPRFFAGEAQNRCNQENQTACDVIQGSLC 

MIIIIIIHIIMMIIiMIII llll IIIIIIHIMIIIIIMhl Ihlhl 

923 6 FHOFGFGFFVDGRELVPSMEEDAVFFAAADDVPRFFAGEAQNRCNQENQAARDWQGGLR 
120 130 140 150 160 170 

100 110 120 130 140 150 

m236 . pep AAACMAVCFGGVEAVFQDVEVERTQVFRAERNXVFYGKVEXITRIVIACQTLLQLTCQYH 

IN II I 1 1 M I II I I II 1 1 h 1 1 1 I I 1 1 I M h i hill 1 1 1 1 1 1 1 Ih 

92 3 6 AAAGAAVGFGGVEAVFQDVEVERAQVFRAERNNVFHGEVEGIARI VTACOTLLQPPRQYQ 

180 190 200 210 220 230 

160 170 180 190 200 210 

m2 36 . pep GVAVDFHHIRLLHGIFNRIKVAQVGKQKAQGIADTAVAFGYAFEDFFGNRQFAAVIGRCR 

IIIIIMIMIIIMIIIIIIIhllhlllllhlllll IIIIIIMIIIIIIII II 

923 6 G V A VD FHHIRLLHGIFNRI KV AQ I G KQ E AO G I AD AA V A FGN A F E D F FGNRQ F AA V I GGC R 

240 250 260 270 280 290 

220 230 240 250 259 

m23 6 . pep PQAQDVCAEFVINLLRCNDVADGFRHFFAFAVDNETMGOQLFIRRATHX 

iiiiii : minium i ■ hi 

g23 6 PQAQDVRAEFVIDFLRRDDVADGFRHFAAVLVNKETVGQQLFIRCASHG 
300 310 320 330 340 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 879>: 

a236 . seq 

1 ATGGCGCGTT TCGCCTTCTC CGCCGACATT CTCTGCACAG CGTTTGCAGA 

51 CGGTTTCATG GCCTGCAACC GCGCCCACAT CGCGGGTGTA GTGCCAGCAG 

101 CGTTCGCATT TTTCACCATC ACTGGCTTTA GCGGCAACGG CAAGTTCGCT 

151 GCCTACTTTC ACTTCTGCTT TAGACACCAG CAAAGCAAAG CGCAATTCTT 

201 CGCCCAAAGC ATTCAGATAG CCGGCCATTT CTTCCGGCGC GGTAATTTCG 

251 GCTTCGGCTT GCAAGGACGA ACCGACGGTT TTGTCGGCGC GCAAAGGCTC 

301 GATGGCGGCG GTTACCGCTT CGCGGGCTTC GCGGATTGCC GTCCATTTTT 

351 TCACCAGTTC GGCTTCGGCT TTTTCGTTGA TGGTCGGGAA CTCGTGCCAA 

4 01 GTATGGAAAA GCACGCTGTC TTCTGCGCCG CCGCCGATGA TGTCCCACGC 

4 51 TTCTTCGCCG GTGAAGCACA AAATCGGTGC AATCAAGAGA ACCAGGCTGC 

501 GCGTGATGTG GTACAGGGCG GTTTGCGCGC TGCGGCGGGC GCGGCTGTCG 

551 GCTTTGGTGG TATAGAGGCG GTCTTTCAGG ATATCGAGGT AGAACGCGCC 

601 CAAGTCTTCC GAGCAGAAAG AAACCATTTC TTTCACGGCA AAGTGGAAGG 

651 CATAACGCGG ATAAAAATCA CCGGCAACGC GTTCTTGCAG CCGCCTTGCC 

701 AACACCAAGG CATAGCGGTC GATTTCCACC ATATCCGCCT GTTGCACGGC 

7 51 ATCTTCAATA GGATTGAAGT CGCTCAAGTT GGCAAACAAA AAGCTCAAGG 
801 TATTGCGGAT ACGGCGGTAG CTTTCGGTTA CGCGCTTGAG GATTTCTTTG 

8 51 GAAATCGCCA ATTCGCCGCT GTAATCGGTG GATGCCGCCC ACAGGCGCAG 
901 GATGTCCGCG CCGAACTCGT TATACACTTC TTGCGGCGCG ACGACGTTGC 
951 CGATGGATTT CGACATTTTG CGCCCGTTTT GATCCACCAC GAAACCATGG 

1001 GTCAGCAGCT GTTTGTACGG CGCGCGACCC ATTGA 

This corresponds to the amino acid sequence <SEQ ID 880; ORF 236.a>: 

a236.pep 

1 MARFAFSADI LCTAFADGFM ACNRAH IAGV VPAAFAFFTI TGF SGNGKFA 

51 AYFH FCFRHQ QSKAQFFAQS IQIAGHFFRR GNFG FGLQGR TDGFVGAQRL 

101 DGGGYRFAGF ADCRPFFHQF GFGFFVDGRE LVPSMEKHAV FCAAADDVPR 

151 FFAGEAQNRC NQENQAARDV VQGGLRAAAG AAVGFGGIEA VFQDIEVERA 

201 QVFRAERNHF FHGKVEGITR IKITGNAFLQ PPCQHQGIAV DFHHIRLLHG 

251 I FNRIEVAQV GKQKAQGIAD TAVAFGYALE DFFGNRQFAA VIGGCRPQAQ 

301 DVRAELVIHF LRRDDVADGF RHFAPVLIHH ETMGQQLFVR RATH* 
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101 AFQRPEYRVQ TDISIDGIDH I FTLDAAFGR VNQPTVLIRF DARHRGLLEN 

151 LRAGFGSGAG NTQRVIERMK MPGQGIEL CA LVHI AVKLLF QLGFI PKLIM 

201 TR TVMPLGVF MPLLQLFPML RTDGNRGITA LPIAIDGMFA DAFVHQFDRL 

251 QRLLPKPLRL LQADLFFNFP HTAGWADNL PATPAGRAEA DTCGFQNDGF 

301 MSVFRQRQCG TQTAQTAADN AGFGFQTALE FRINSMRINR TKIIRRQIFL 

351 KIRANHCVCF IGYIFGRNDT DCRAISSKQK IG* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 883>: 

m237 . seq 

1 ATGCGGGACA AGGTTGGCGG TAATGTCGCA CTCCCCGCCC CACGAATATT 

51 CGATTTTGAC ATCGGCAAGC TGCGGAAAAA CTTTAAGCAT ATCTTGGCGG 

101 ACAAGCTCGG TCATaCGCTC AGGATTGTCG ATAAACTCGT TATCCTTACC 

151 GCCGAAAAGC AGTCTGCCGT CCGCGCTGAG GCGGTAATAA TCCAAAATAT 

201 GGCGGTTGTC GCATACTGCC ATATTGTTA Z GGATAAGCCC TTTTGCGCGC 

251 GCCCCCAAGG G1TCGGTCGC AATAATAAAG GTGCTGACAG CAATCGCCTT 

301 GCGTTCCAAA GGCCGGAATA TCGGGTTCAA ACCTGCATAA GTATTGACAG 

351 CATAGACCAC ATTTTTGCAC TCGACGCTGC CTTCGGGCGT GTAAACCAGC 

4 01 CAACCGTTTT GATGCGGTTC GATGCACGTC ATCGGGGATT GCTCGAAAAT 

4 51 CTGCGCACCG GCTTCGGCAG CGGCACGAGC GATGCCCAAA GTGTAAGTGA 

501 GCGGATGCAG GTGTCCGGAT AAGGGGTCGA ATTGTGCCCC TTGGTACATA 

551 TCGCTGTCAA GCTGCTGTTT CAACTCGGCT TTATCCCAAA GTTGATAATG 

601 ACTCGCACCG TAATGCCGTT GGGCGTGTTC ATGCCACTGC TGCAACTCTT 

651 CCCAATGCTG CGGACGGACG GCAACCGTGG CATAACCGCG CTGCCAATCA 

7 01 CAATCGACGG CATGTTTGCG GACGCGTTCG TCCACCAGTT CGACCGCCTG 
751 CAAAGACTGT TGCCAAAACC ATTGCGCCTG CTCCAAGCCG ACCTGTTTTT 

8 01 CAATTTCCCC CATACCGCAG nCGTAATCGC TGATAACCTG CCCGCCACTC 
851 CGTCCCGACG CGCCGAAACC GATACGCGCG GCTTCCAACA CAACCGTTTC 
901 ATGTCCCTGC TCCGCCAAGG GCAATGCAGT GCACAAACCA CCCAATCCGC 
951 CGCCGATGAT ACAGGTATCG GTTTTCAGAC GGCATTGAAG TTtCGGATAA 

1001 ACAGTATGAG GATTAACCGA ACTGAAATAA TAAGAAGGCA GATATTCTTG 

1051 AAAATCAGGG CGAATCATTG TGTTTGCTTT ATCAGGTGTA TTTTCGGACG 

1101 GAATGATACA GGCTGTCGGG CCATATCGTC CAwACAGAAA ATCGGTTGA 

This corresponds to the amino acid sequence <SEQ ED 884; ORF 237>: 

m23 7 . pep 

1 MRDKVGGNVA LPAPRIFDFD IGKLRKNFKH ILADKLGHTL RIVDKLVILT 

51 AEKQSAVRAE AVI IQNMAW AYCHIVT DKP FCARPQGFGR NNKGADSNRL 

101 AFQRPEYRVQ TCISIDSIDH I FALDAAFGR VNQPTVLMRF DARHRGLLEN 

151 LRTGFGSGTS DAQSVSERMQ VSGXGVEL CP LVHIAVKLLF QLGFI PKLIM 

201 TRT VMPLGVF MPLLQLFPML RTDGNRGITA LPITIDGMFA DAFVHQFDRL 

251 QRLLPKPLRL LQADLFFNFP HTAXVIADNL PATPSRRAET DTRGFQHNRF 

301 MSLLRQGQCS AQTTQSAADD TGIGFQTALK FRINSMRINR TEIIRRQIFL 

351 KIRANHCVCF IRCIFGRNDT GCRAISSXQK IG* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from ;V. gonorrhoeae 

ORF 237 shows 86.1% identity over a 382 aa overlap with a predicted ORF (ORF 237.ng) 
from N. gonorrhoeae: 

m237/g237 

10 20 30 40 50 60 

m23 7 .pep MRDKVGGNVALPAPRI FDFDIGKLRKNFKHILADKLGHTLRIVDKLVILTAEKQSAVRAE 

lllllllhlllllllll :!MIIII1IIIII!IIIII INlhlllMIM Hill 

g2 3 7 MRDKVGGNIALPAPRI FDSNIGKLRKNFKHILADKLGHTRRIVDKFVI LTAEKQPAVRAE 

10 20 30 40 50 60 

70 80 90 100 110 120 

m237.pep AVI IQNMAWAYCHI\^TDKPFCARPQGFGRNNKGADSNRLAFQRPEYRVQTCISIDSIDH 

IIIMIIIIMllllhlllllll Mil lllllhlllllllMIIIII lllhlll 
g23 7 AVI IQNMAWAYCHIVADKPFCARAQGFGGNNKGADGNRLAFQRPEYRVQTDISIDGIDH 

70 6. 90 100 110 120 

130 140 150 160 170 180 

m237 . pep IFALDAAFGRVNQPTVLMRFDARHRGLLENLRTGFGSGTSDAQSVSERMQVSGXGVELCP 
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m237/a237 85.6% identity in 382 aa overlap 

10 20 30 40 50 60 

m237 .pep mrdkvggnvalpaprifdfdigklrknfkhiladklghtlrivdklviltaekqsavrae 

I | ! | | ! f I ! I I M I I I I I I t I I ! M M ! I I i I I i I I I I I I E I I I I I I I I I I I I I M I I 
a2 37 MRDKVGGNVALPAPRI FDFDIGKLRKNFKHILADKLGHTRGIVDKLVILTAEKQSAVRAE 

10 20 30 40 50 60 

70 80 90 100 110 120 

m237 -pep AVI IQNMAVVAYCHI VTDKPFCARPQGFGRNNKGADSNRLAFQRPEYRVQTC I S I DSI DH 

II | | | | | : M ! I I I i I : I I I I I : I III i I 1 I M I I I I I : ! I II M I I MM— ■ 
a 237 AVIIQNMTWAYCHIVADKPFCTRAQGFCGNNKGADSNRLALQRLEYRIOTGISIDGVHQ 

70 80 90 100 110 120 

130 140 150 160 170 180 

m237 .pep IFALDAAFGRVNQPTVLMRFDARHRGLLENLRTGFGSGTSDAQSVSERMQVSGXGVELCP 

lll:MM! I M M II : I I M I : I : I I I I : I I I I : : I I I I MM: I Mill 
a 237 iFAFDAAFGGVNQPTVLIRFNAYHGRMLKNLRTSFGSGAGDAQRVIERHEMPGQGIELCA 

130 140 150 160 HO 180 

190 200 210 220 230 240 

m->37 pop LVHIAVKLLFQLGFIPKLIMTRTVMPLGVFMPLLQLFPMLRT DGNRGITALP ITIDGMFA 

1 | | | | I I I I : I : : ||:|||: I M I I M I I I I M M I M II I M M I I I I M M I I II 
a2 37 LVHIAVKLLLQFS VI PELIMSCTV I FLGVLMPLLQFFPMLRT DGNRGITALP I AINGMFA 

190 200 210 220 230 240 

250 260 270 280 290 300 

m237 pep DAFVHQFDRLQRLLPKPLRLLQADLFFNFPKTAXVIADNLPATPSRRAETDTRGFQHNRF 
| | i | | | | || M I I I M I I M I I : I I I M I I I t I I M I! I M I I I I II M I I I M M II 
a2 37 DAFVHQFDRLQRLLPKPLRLLQTDLFFNF LHTAGVIADNLPATPSRRAETDTRGFQHNRF 

250 260 270 280 290 300 

310 .320 330 340 350 360 

m237 pep MSLLRQGQCSAQTTQSAADDTGIGFQTALKFRINSMRINRTEI IRRQIFLKIRANHCVCF 

1 j I I I M I M I I I M I I II M I I M M I I M M I M II II I I M M M M M II M M I I 
a2 37 MSLLRQGQCSAQTTQSAADDTGIGFQTALKFRINSMRINRTEI IRRQIFLKIRANHCVCF 

310 320 330 340 350 360 

370 380 
m2 37 . pep IRCIFGRNDTGCRAISSXQKIGX 
I I M M M M I M II Mill 
a2 37 IGYIFGRNDTGCRAISSKQKIGX 

370 380 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 887>: 

g238 . seq 

1 atgaatttgc ctattcaaaa attcatgatg ctgttggcag cggcaatatc 

51 gatgctgcat acccccatta gtcatgcgaa cggtttggat gcccgcttgc 

101 gcgatgatat gcaggcaaaa cactacgaac cgggtggcaa ataccatctg 

151 tttggtaatg ctcgcggcag tgttaaaaat cgggtttgcg ccgtccaaac 

201 atttgatgca actgcggtcg gccccatacc gcctattaca cacgaacgga 

251 caggatttga aggtgttatc ggctatgaaa cccatttttc aggacacgga 

301 cacgaagtac acagcccgtt cgataatcat gattcaaaaa gcactcctga 

351 tttcagcggc ggcgtagacg gcggttttac cgtttaccaa cctcaccgga 

401 cagggtcgga aatacatccc gcagacggat atgacgggcc tcaaggcggc 

4 51 ggttatccgg aaccacaagg ggcaagggat atatacagct accatatcaa 

501 aggaacttca accaaaacaa agataaacac tgttccgcaa gccccttttt 

551 cagaccgctg gctaaaagaa aatgccggtg ccgcttccgg ttttctcagc 

601 cgtgcggacg aagcaggaaa actgatatgg gaaaacgacc ccgataaaaa 

651 ttggcgggct aaccgtatgg atgatatccg cggcatcgtc caaggtgcgg 

701 ttaatccttt tttaacgggt tttcaagggg tagggattgg ggcaattaca 

751 gacagtgcgg taagcccggt cacagataca gccgctcagc agactccaca 

801 aggtattaat gatttaggaa atttaagtcc ggaagcacaa cttgccgccg 
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151 DYPPPGGARD IYSYYVKGTS TKTKTNIVPQ APFSDRWLKE NAGAASGFFS 

201 RADEAGKLIW ESDPNKNWWA NRMDDVRGIV QGAVNPFLMG FQGVGIGAIT 

251 DSAVSPVTDT AAQQTLQGIN DLGKLSPEAQ LAAASLLQDS AFAVKDGINS 

301 AKQWADAHPN ITATAQTALS AAEAAGTVWR GKKVELNPTK WDWVKNTGYK 

351 KPAARHMQTL DGEMAGGNKP IKSLPNSAAE KRKQNFEKFN SNWSSASFDS 

4 01 VHKTLTPNAP GILSPDKVKT RYTSLDGKIT IIKDNENNYF RIKDNSRKQY 

451 LDSNGNAVKT GNLQGKOAKD YLQQQTHIRN LDK* 

Computer analysis of this amino acid sequence gave the following results: 
Homology with a predicted ORF from N. gonorrhoeae 

ORF 238 shows 86.0% identity over a 401 aa overlap with a predicted ORF (ORF 238.ng) 
from N, gonorrhoeae; 

m238/g238 



10 20 30 40 50 60 

m238 . pep MNLP I QKFMMLFAAAI SLLQ I P I SHANGLDARLRDDMQAKH YEPGGKYHLFGNARGSVKK 

IIIIIIMIIhlllihhlllllllllllllllllllllllllllllllllllillh 

g23 8 MNLPIQKFMMLLAAAISMLKIPISHANGLDAKLRDDMQAKHYEPGGKYHLFGNARGSVKN 
10 20 30 40 50 60 



70 80 90 100 110 120 

m23 8 . pep RVYAVQTFDATAVSPVLPITHERTGFEGVIGYETHFSGHGHEVHSPFDHHDSKSTSDFSG 

II MIIIUIihhlllMillMIIIIUIIMlillllllillMMlNIIUI 

g238 RVCAVQTFDATAVGPILPITHERTGFEGVIGYETHFSGKGHEVHSPFDNHDSKSTSDFSG 

70 80 90 100 110 120 



130 140 150 160 170 180 

m23 8 .pep GVDGGFTVYQLHRTGSEIHPEDGYDGPQGSDYPPPGGARDIYSYYVKGTSTKTKTNIVPQ 

illllllllMIIIIIIIM IIIIIMI: II I lilHllhMlllllli I III 

g23 8 GVDGGFTVYQLHRTGSEIHPADGYDGPQGGGYPEPQGARDIYSYKIKGTSTKTKINTVPQ 
130 140 150 160 170 180 



190 200 210 220 230 240 

m23 8 - pep APFSDRWLKENAGAASGFFSRADEAGKLIWESDPNK2WWANRMDDVRGIVQGAVNPFLMG 

I I i I I I I I I I I I I I I II I : I i I II 1 I I I I II : II : I I I M I I I h I I I I I I I 11 I I I I 
g23 8 APFSDRWLKENAGAASGFLSPJ^EAGKLIWE^PDKN^RANRMDDIRGIVQGAVNPFLTG 
190 200 210 220 230 240 



250 260 270 280 290 300 

m23 8 . pep FQGVGIGAITDSAVSPVTDTAAQQTLOGIJ^DLGKLSPEAQLAAASLLQDSAFAVKDGINS 

IIIMMIIMMIIIMMIIiniMllllhllllllMIIIIIIIIMIIIMIII 

g238 FQGVGIGAITDSAVSPVTDTAAQOTLOGINDLGNLSPEAQLAAASLLODSAFAVKDGINS 
250 260 270 280 290 300 



310 320 330 340 350 360 

m2 3 8 . pep AKQWADAHPNITATAOTALSAAEAAGTWRGKKVELNPTKWDWVK3^TGYKKPAARHMQTL 

I = I I I I I I I I I I II I i II I : I I I I II II I 1 I I I I II I I I I I I I I I II I II I I I I M I I : 
g2 3 8 ARQWADAHPN I TATAQTALAVAEAAGTVWRGKKVEI^PTKWD WKNTG YKKPAARHMQTV 

310 320 330 340 350 360 



370 3B0 390 400 410 420 

m238 . pep DGEMAGGNKP I KSLPNSAAEKRKQNFEKFNSNWSSAS FDS VHKTLTPNAPG I LS PDKVKT 

lllllllhl Ih U - - h - - - - 

g23 8 DGEMAGGNRPPKSI -TSEGKANAATYPKLVNQLNEQNLNNIAAQDPRLSliAIHEGKKNFP 

370 380 390 400 410 

430 440 450 460 470 480 

m23 8 . pep RYTSLDGKI XI I KDNENNYFRIHDNSRKQYLDSNGNAVKTGNLQGKOAXD YLQQQTHIRN 

g23 8 IGTATYEEADRLGKIWVGEGAROTSGGGWLSRDGTRQYRPPTEKKSQFATTGIOANFETY 
420 430 440 450 460 470 
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I I M I I I I i II I I II I I I I I I I I I I M II I I i : I I I I I II f I I : I I I I i I I I I I I I I I 
a2 38 FQGVGIGAITDSAVSPVTDTAAQQTLQGINHLGNLSPEAQLAAATALQDSAFAVKDGINS 
250 260 270 280 290 300 

310 320 330 340 350 360 

m2 38 . pep AKQWADAHPNITATAQTALSAAEAAGTVWRGKKVELN PTKWDWVKNTGYKKPAARHMQTL 

I : | I I I I I I I ! I I ! I II I I : : 1 I I I Ml I I ! I I I I I I I I I I I I I ! I I I I I : ! I : I I 
a 238 ARQWADAH PN I TATAQT ALAVAEAATTVWGGKKVE LN PT KWDWVKNTG YKT PAVRTMHTL 

310 320 330 340 350 360 

370 380 390 400 410 419 

m238.pep DGEMAGGNKPIKSLP-NSAAEKRKQNFEKFNSNWSSASFDSVHKTLTPNAPGILSPDKVK 

i I I I i I I I : I M : Ml: I 
a238 DGEMAGGNRPPKSITSNSKADASTQ 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 893>: 

g239 . seq 

1 atgttccacc ataaaggtat tgcccgaaac cggcggatgg aggttttgtt 

51 tttccgccgc cgccctgatc gcttcgtgat tcgccaaacg cgcctgttgc 

101 agcctcattt gcgcataatc ctgctccaag gcgatttcct gttttttcgc 

151 cttgtccaaa gctgtgaagt tgagcctgta ctggttttgc tgcatcacaa 

201 cggaaaaagc ggaaacgcac accgcaagca gcagaaagaa attcgatttg 

251 ttcattgccg ctcagacgtt tttctctgtt actattccgg tatcggaccg 

301 gcagtccgct ccgccacacg caaaactgcg ctcctcgccc tcgggttggc 

351 ggcaactccc gcttcacccg gctttaatgc cctgcccacg attttcaggg 

401 gcggatcggg caaatccgct tctctgaccg ccgcccagct cggcaggggc 

451 tcgtgttgcg aatatttttt gacaaactgc ttcacaatgc ggtcttccaa 

501 cgaacggaaa gcaatgaccg ccaaacgccc gccctcttrc agacggcaca 

551 tgacctgcgg caataccgcc cctacttctt caagctcgcg gttaataaag 

601 atgcggattg cctggaaggt gcgcgccgca ggatcctgcc cccgcccgcg 

651 agtacggacg ttttgtgcca caatctgcgc cagctcgcgg gttgratcga 

701 ttggactttc cgcccgctgc gcgacaatgg cgcgcacaat ctggcggcta 

751 aaccgctctt caccataa 

This corresponds to the amino acid sequence <SEQ ID 894; ORF 239. ng>: 

g239 .pep 

1 MFHHKGIARN RRMEVLFFCR RPDRFVIRQT RLLQPHLRII LLQGDFLFFR 

51 LVQSCEVEPV LVLLHHNGKS GNAHRKQQKE IRFVHCRSDV FLCYYSGIGP 

101 AVRSATRKTA LLALGLAAIS ASPGFNALPT IFRGGSGKSA SLTAAQL.GRG 

151 SCCEYFLTNC FTMRSSNEWK AMTAKRPPSF RRHMTCGNTA PTSSSSRLIK 

2 01 MRIAWKVRVA GSCPRSRVR T FCATICASLR WSIGLSARC ATMARTIWRL 
251 NRSSP* 

The following partial DNA sequence was identified in N. meningitidis <SEQ ID 895>: 

m239 . seq 

1 ATGCTCCACC ATAAAGGTmy kGCCCGAAAC CGGCkGATGG AGGTTTTGTT 

51 TTTCTGCCGC CGCCCTGATC GCTTCGTGGT TCGCCAAACG CGCCTGTTGC 

101 AGCCTCATTT GCGCATAATC CTGCTCCAAG GCGATTTCCT GTTTTTTCGC 

151 CTTATCCAAA GCTGTGAAAT TGAGCCTGTA CTGGTTTTGC TGCATCACAA 

201 CGGAAAAAGC GGAAACGCAC ACCGCAAGCA GCAGAAGGAA ATTCAATTTG 

251 TTCATTGCCA TTCAGACGTT TTTCTCTGTG ATTGTTCCGG TATCGGACCG 

3 01 GCAGTCCGCT CCGCCACACG CAAAACCGCA CTTCTCGCCC TCGGATTGGC 
351 GGCAATTTCC GCCTCACCCG GCTTTAATGC CCTGCCCACG ATTTTCAGGG 
401 GCAGCTCGGG CAAATCCGCT TCCCTGaCCG CCGCCCAGCG CGGCAGGGGC 
451 GCGTGTTGCG AATATTTTTT GACAAACTGC TTCACAATGC GATCTTCCAA 
501 CGAATGGAAA GCAATGACCG CCAAACGTCC GCCCTCTTTC AGACGACACA 

5 51 TGACCTGCGG CAATACTGCC CCTACTTCTT CAAGCTCGCG GTTAATAAAG 

6 01 ATGCGGACCG CCTGGAAGGT GCGCGTCGCA GGATCCTGCC CCCGCTCGCG 
651 AGTACGGACG TTTTGTGCCA CGATCTGCGC CAGCTTGCGG GTTGTATCGA 
701 TTGGACTTTC CGCCCGTTGC GCAACAATGG CGCGCGCAAT cCGGCGGCTa 
751 AACCGCTCTT cACCATAA 

This corresponds to the amino acid sequence <SEQ ID 896; ORF 239>: 
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This corresponds to the amino acid sequence <SEQ ID 898; ORF 239.a>: 

a239 .pep 

I MLHHKGIARN RRMEVLFFCR RPDRFWRQT RLLQPHLRI I LLQGDFLFFR 

51 LIQSCEVEPV LVLLHHNGKS GNAKRKQQKE IQFVHCHSDV FLCDCSGIG? 

101 AVRSATRKTA LLALGLAAI S ASPGFNALPA I FRGGS GKSA SLTAAQRGRG 

151 ACCEYFLTNC FTMRSSNEWK AMTAKRPPSF RRHMTCGNTA PTSSSSRLIK 

201 MRIAWKVRVA GSCPRSRVRT FCATICASLR VVSIGLSARC ATMARAIWRL 

251 NRSSP* 

m239/a239 97.3% identity in 255 aa overlap 

10 20 30 40 50 60 

m23 9 pep MLHHKGXARNRXMEVLFFCRRPDRFVVRQTRLLQPHLRI ILLQGDFLFFRLIQSCEIEPV 

I | I I [ I I I I I I I I i I I i I I I I t I I I I I I t i I I I I I M I I I I I I 1 I 1 i t i I I i : II f 
a2 3 9 MLHHKGIARNRRMEVLFFCRRPDRFVVRQTRLLQPHLRI I LLQGDFLFFRLIQSCEVEPV 

10 20 30 40 50 60 

70 80 90 100 110 120 

m23 9 pep LVLLHHNGKSGN^HRKQQKEIQFVHCHSDVFLCDCSGIGPAVRSATRKTALLALGLAAIS 
I | M M | | I I I II I I I I I I I t I I I I II I if I I I i I I I ! I I I I t i I I M ! I I I I I I I I I I I 
a 2 3 9 LVLLHHNGKSGNAHRKQQKEIQFVHCHSDVFLCDCSGIGPAVRSATRKTALLALGLAAIS 
70 80 90 100 110 120 

130 140 150 160 170 180 

m239.pep ASPGFNALPTIFRGSSGKSASLTAAQRGRGACCEYFLTNCFTMRSSNEWKAMTAKRPPSF 
i j I I I I I I I : I I I 1 : M I I t I 1 I I I I I I I I I M ■ I M I ! I ! M I I I I I M ! U I I I I I I f 
a2 39 ASPGFNALPAIFRGGSGKSASLTAAQRGRGACCEYFLTNCFTMRSSNEWKAMTAKRPPSF 

130 140 150 160 170 180 

190 200 210 220 230 240 

m23 9 pep RRHMTCGNTAPTSSSSRLIKMRTAWKVRVAGSCPRSRVRTFCATICASLRWS IGLSARC 

M | | | | | M I! I I ! I I I I I I I I I I I I I I I I I i I I I I I M M I I I I M I i I i I 1 I I i I I ! 
a2 3 9 RRHMTCGNTAPTSSSSRLIKMRIAWKVRVAGSCPRSRVRTFCATICASLRVVS IGLSARC 

190 200 210 220 230 240 

250 

m23 9 . pep ATMARAIRRLNRSSPX 
I I I 1 I t I I I M I! I I 
a23 9 ATMARAIWRLNRSSPX 
250 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 899>: 

g240 . seq 

1 atgatagaag tcatacattt cttcggcgcc gaaacgcgca gacagtttgc 

51 ttgtgccgac gttggacgat ttctgcataa tgccgcgcac atccaaagag 

101 gggtaaacat gggtatcatc gcgcacggga gacggtccga ttttataagg 

151 ctgcgtattc agccgtccgt tcaaatcggt tttgcccgca tccaatgcct 

201 tcgcaatcac gaacggtttg attgccgaac caggttcgat catatcggtt 

251 acggcacggt cgcgccgccg ttcgctgtct gcccggccgg gtctgttggg 

301 atcgtaggcg ggcgtattgg ccaaggcgag gatttccccc gtgcgggcat 

351 ccaaaaccac caccgttccg gcttctgcct gatggtattc gaccgccttg 

401 ttcaactctt cataggccaa ggtctgaatc ccctgatcga gggaaaggat 

451 gatgtctttg ccgttttgcg gtgctttatt gcgcggggag tccaagctgt 

501 ccacaatatt gccctgccgg tcccgcaaaa caacttccgc gccgtcttcg 

551 ccatacaggc tgtcttcaag cgaaagttcc aaaccttccc gacctttgcc 

601 gtcaatatcg gtaaacccga tgacgtgtgc aaacaggttg cccatcgggt 

651 aatggcgctt taa 

This corresponds to the amino acid sequence <SEQ ID 900; ORF 240.ng>: 

g24 0 .pep 

1 MIEVIHFFGA ETRRQFACAD VGRFLHNAAH IQRGVNMGI I AHGRRSDFIR 

51 LRIQPFVQIG FARIQCLRNH ERFDCRTRFD HIGYGT VAPL FAVCPAGSVG 

101 IVGGRIGQGE DFPRAGIQNH HRSGFCLMVF DRLVQLF I GQ GLNPLIEGKD 

151 DVFAVLRCFI ARGVQAVHNI ALPVPQNNFR AVFAIQAVFK RKFQTFLTFA 
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2 51 GCACGGTTGC GCCGCTGTTC GCTGTCTGCC CGGCCGGGCC TGTTGGGATC 

301 GTAGGCGGGC GTATTGGCCA AGGCGAGGAT TTCCCCCGTG CGGGCATCCA 

351 AAACCACCAC CGTTCCGGCT TTTGCCTGAT GGTATTCGAC CGCCTTGTTC 

4 01 AACTCTTCAT AGGCCAAGGT CTGAATCCTC TGATCGAGGG AAAGGATGAT 

4 51 GTCTTTGCCG TTTTTCGGGG CTTTATTGCG CGGGGAGTCC AAGCTGTCCA 

501 CAATATTGCC CTGCCGGTCC CGCAAAACGA CTTCCGCGCC GTCTTCGCCA 

551 TGCAGGCTGT CTTCAAGCGA AAGTTCCAAA CCTTCCTGAC CTTTGCCGTC 

601 AATATCGGTA AATCCGATGA CGTGTGCAAA CAGGTTGCCC ATCGGGTAAT 

651 GGCGTTTTAA 

This corresponds to the amino acid sequence <SEQ ID 904; ORF 240. a>: 

a240 .pep 

1 MIEVIHFFGT ETRRQFACAD VGRFLHDAAH IQRGVNMGIA HGRRSDFIRL 

51 RIQPFVQIGF ARIQCLRNHK RFDCRTGFDH IGYGT VAPLF AVCPAGPVGI 

101 VGGRIGQGED FPRAGIQNHH R3GFCLMVFD RLVQLFIGOG LNPLIEGKDD 

151 VFAVFRGFIA RGVQAVHMIA LPVPQNDFRA VFAMQAVFKR KFQTFLTFAV 

201 NIGKSDDVCK QVAHRVMAF* 

m240/a240 99. 1 % identity in 2 1 9 aa overlap 

10 20 30 40 50 60 

m2 4 0, pep MIEVIHFFGTETRRQFACADVGRFLHDAAHIQRGVNMGIAHGRRSDFIRLRIQPFVQIGF 
I 1 I II I I I I I 1 I i I I i I i I i I I I i I I 1 1 I I M I I i i t I ! I ! I M i I I M I I I I 1 I II t I I 
a2 4 0 MIEVIHFFGTETRRQFACADVGRFLHDAAHIQRGVNMGIAHGRRSDFIRLRIQPFVQIGF 

10 20 30 40 50 60 

70 80 90 100 110 120 

m24 0.pep ARIQCLRNKKRFDCRTGFDHIGYGTVAPLFAVCPAGPVGIVGGRIGQGEDFPRAGIQXHH 
I I i I M ( ) If I I I M M I I I I I I I I M 1 I I f I I I i I I I I I M I I f I I I I I I I I M M II 
a 24 0 ARIQCLRNHKRFDCRTGFDHIGYGTVAPLFAVCPAGPVGIVGGRIGQGEDFPRAGIQNHH 
70 80 90 100 110 120 

130 140 150 160 170 180 

m24 0 .pep RSGFCLMVFDRLVQLFIGQGLNPLIEGKDDVFAVFRGFXARGVQAVHM1ALPVPQNDFRA 
! M I I M I 1 I i I II I 1 I I I ( I 1 ! I I I I I I I II M I I I I II I II I I I II I I M I M M M 
a24 0 RSGFCLMVFDRLVQLFIGQGLNPLIEGKDDVFAVFRGFIARGVQAVHNIALPVPQNDFRA 
130 140 150 160 170 180 

190 200 210 220 

m240. pep V FAMQAVFKRK FQT FLT FAVN IGKS DDVCKQVAH R VMAFX 

I I i I I I I I I I I t I i 1 I I I I I I I It II I II I I 1 I i I I I I I I 
a2 4 0 V F AMQAV FKRKFQT FLT FAVN I GKS D D VCKQV AHR VMA FX 

190 200 210 220 



The following partial DNA sequence was identified in N. gonorrhoeae <SEQ ID 905>: 



g241 . seq 












1 


ATGATAGAAG 


TCATACATTT 


CTTCGGCACC 


GAAACGCGCA 


GACAGTTTGC 


51 


TTGTGCCGAC 


GTTGGACGAT 


TTCTGCATGA 


TGCCGCGCAC 


ATCCAAAGAG 


101 


GGGTAAACAT 


GGGTATCGCG 


CACGGGAGAC 


GGTCCGATTT 


TATAAGGCTG 


151 


CGTATTCAGC 


CGTTCGTTCA 


AATCGGTTTT 


GCCCGCATCC 


AATGCCTTCG 


201 


CAATCACAAA 


CGGTTTGATT 


GCCGAACCGG 


GTTCGATCAT 


ATCGGTTACG 


251 


GCACGGTTGC 


GCCGCTGTTC 


GCTGTCTGCC 


CGGCCGGGCC 


TGTTGGGATC 


301 


GTAGGCGGGC 


GTATTGGCCA 


AGGCGAGGAT 


TTCCCCCGTG 


CGGGCATCCA 


351 


ArACCACCAC 


CGTTCCGGCT 


TTTGCCTGAT 


GGTATTCGAC 


CGCCTTGTTC 


401 


AACTCTTCAT 


AGGCCAAGGT 


CTGAATCCTC 


TGATCGAGGG 


AAAGGATGAT 


451 


GTCTTTGCCG 


TTTTTCGGGG 


CTTTAkTGCG 


CGGGGAGTCC 


AAGCTGTCCA 


501 


CAATATTGCC 


CTGCCGGTCC 


CGCAAAACGA 


CTTCCGCGCC 


GTCTTCGCCA 


551 


TGCAAGCTGT 


CTTCAAGCGA 


AAGTTCCAAA 


CCTTCCTGAC 


CTTTGCCGTC 


601 


AATATCGGTA 


AATCCGATGA 


CGTGTGCAAA 


CAGGTTGCCC 


ATCGGGTAAT 


651 


GGCGTTTTAA 











This corresponds to the amino acid sequence <SEQ ID 906; ORF 241. ng>: 

g241 -pep 

1 MPTRPTRAAN PPTPTTWLQT AYCPRPPYRP PSVQTHTPHE PASSTCAAKS 



